Available Models

View as Markdown

Available Models

This page is auto-generated from GET /v1/models so the catalog stays aligned with the live Mesh API inventory.

Total models: 325

Showing 1-25 of 325 models

NameModel IDProviderTierContextInput (USD)Output (USD)Description
AI21: Jamba 1.5 Large
ai21/jamba-1-5-large-v1Ai21Paid262 K0.0020.008

No description available.

AI21: Jamba 1.5 Mini
ai21/jamba-1-5-mini-v1Ai21Paid262 K0.00020.0004

No description available.

AI21: Jamba Large 1.7
ai21/jamba-large-1.7Ai21Paid256 K0.0020.008

Jamba Large 1.7 is the latest model in the Jamba open family, offering improvements in grounding…

AionLabs: Aion-1.0
aion-labs/aion-1.0Aion LabsPaid131 K0.0040.008

Aion-1.0 is a multi-model system designed for high performance across various tasks, including r…

AionLabs: Aion-1.0-Mini
aion-labs/aion-1.0-miniAion LabsPaid131 K0.00070.0014

Aion-1.0-Mini 32B parameter model is a distilled version of the DeepSeek-R1 model, designed for…

AionLabs: Aion-2.0
aion-labs/aion-2.0Aion LabsPaid131 K0.00080.0016

Aion-2.0 is a variant of DeepSeek V3.2 optimized for immersive roleplaying and storytelling. It…

AionLabs: Aion-RP 1.0 (8B)
aion-labs/aion-rp-llama-3.1-8bAion LabsPaid32.8 K0.00080.0016

Aion-RP-Llama-3.1-8B ranks the highest in the character evaluation portion of the RPBench-Auto b…

Amazon: Nova 2 Lite
amazon/nova-2-lite-v1AmazonPaid1 M0.000060.00024

Nova 2 Lite is a fast, cost-effective reasoning model for everyday workloads that can process te…

Amazon: Nova Lite
amazon/nova-lite-v1AmazonPaid3 K0.000060.00024

No description available.

Amazon: Nova Micro
amazon/nova-micro-v1AmazonPaid128 K0.000040.00014

No description available.

Amazon: Nova Premier 1.0
amazon/nova-premier-v1AmazonPaid1 M0.00250.0125

Amazon Nova Premier is the most capable of Amazon’s multimodal models for complex reasoning task…

Amazon: Nova Pro
amazon/nova-pro-v1AmazonPaid3 K0.00080.0032

No description available.

Anthropic: Claude 3.5 Haiku
anthropic/claude-3.5-haikuAnthropicPaid2 K0.00080.004

Claude 3.5 Haiku features offers enhanced capabilities in speed, coding accuracy, and tool use…

Anthropic: Claude Haiku 4.5
anthropic/claude-haiku-4.5AnthropicPaid2 K0.00080.004

Claude Haiku 4.5 is Anthropic’s fastest and most efficient model, delivering near-frontier intel…

Anthropic: Claude Opus 4
anthropic/claude-opus-4AnthropicPaid2 K0.0150.075

Claude Opus 4 is benchmarked as the world’s best coding model, at time of release, bringing sust…

Anthropic: Claude Opus 4.1
anthropic/claude-opus-4.1AnthropicPaid2 K0.0150.075

Claude Opus 4.1 is an updated version of Anthropic’s flagship model, offering improved performan…

Anthropic: Claude Opus 4.5
anthropic/claude-opus-4.5AnthropicPaid2 K0.0150.075

Claude Opus 4.5 is Anthropic’s frontier reasoning model optimized for complex software engineeri…

Anthropic: Claude Opus 4.6
anthropic/claude-opus-4.6AnthropicPaid1 M0.0150.075

Opus 4.6 is Anthropic’s strongest model for coding and long-running professional tasks. It is bu…

Anthropic: Claude Opus 4.7
anthropic/claude-opus-4.7AnthropicPaid1 M0.0050.025

Anthropic’s latest flagship, now live on Mesh API. Opus 4.7 is a big step up on hard coding work…

Anthropic: Claude Opus 4.8
anthropic/claude-opus-4.8AnthropicPaid1 M0.0050.025

Anthropic’s latest flagship, now live on Mesh API. Opus 4.7 is a big step up on hard coding work…

Anthropic: Claude Sonnet 4.5
anthropic/claude-sonnet-4.5AnthropicPaid1 M0.0030.015

Claude Sonnet 4.5 is Anthropic’s most advanced Sonnet model to date, optimized for real-world ag…

Anthropic: Claude Sonnet 4.6
anthropic/claude-sonnet-4.6AnthropicPaid1 M0.0030.015

Sonnet 4.6 is Anthropic’s most capable Sonnet-class model yet, with frontier performance across…

Baidu: ERNIE 4.5 300B A47B
baidu/ernie-4.5-300b-a47bBaiduPaid123 K0.000280.0011

ERNIE-4.5-300B-A47B is a 300B parameter Mixture-of-Experts (MoE) language model developed by Bai…

Baidu: ERNIE 4.5 VL 424B A47B
baidu/ernie-4.5-vl-424b-a47bBaiduPaid123 K0.000420.00125

ERNIE-4.5-VL-424B-A47B is a multimodal Mixture-of-Experts (MoE) model from Baidu’s ERNIE 4.5 ser…

bedrock/amazon.titan-embed-g1-text-02
bedrock/amazon.titan-embed-g1-text-02BedrockPaidN/A0.000020

No description available.

bedrock/amazon.titan-embed-text-v2:0
bedrock/amazon.titan-embed-text-v2:0BedrockPaidN/A0.000020

No description available.

bedrock/cohere.embed-english-v3
bedrock/cohere.embed-english-v3BedrockPaidN/A0.00010

No description available.

bedrock/cohere.embed-multilingual-v3
bedrock/cohere.embed-multilingual-v3BedrockPaidN/A0.00010

No description available.

bedrock/cohere.embed-v4:0
bedrock/cohere.embed-v4:0BedrockPaidN/A0.000120

No description available.

ByteDance Seed: Seed 1.6
bytedance-seed/seed-1.6Bytedance SeedPaid262 K0.000250.002

Seed 1.6 is a general-purpose model released by the ByteDance Seed team. It incorporates multimo…

ByteDance Seed: Seed 1.6 Flash
bytedance-seed/seed-1.6-flashBytedance SeedPaid262 K0.0000750.0003

Seed 1.6 Flash is an ultra-fast multimodal deep thinking model by ByteDance Seed, supporting bot…

ByteDance Seed: Seed-2.0-Lite
bytedance-seed/seed-2.0-liteBytedance SeedPaid262 K0.000250.002

Seed-2.0-Lite is a versatile, cost‑efficient enterprise workhorse that delivers strong multimoda…

ByteDance Seed: Seed-2.0-Mini
bytedance-seed/seed-2.0-miniBytedance SeedPaid262 K0.00010.0004

Seed-2.0-mini targets latency-sensitive, high-concurrency, and cost-sensitive scenarios, emphasi…

ByteDance: UI-TARS 7B
bytedance/ui-tars-1.5-7bBytedancePaid128 K0.00010.0002

UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, includin…

Claude 3 Haiku
anthropic/claude-3-haikuAnthropicPaid2 K0.000250.00125

No description available.

Claude Sonnet 4
anthropic/claude-sonnet-4AnthropicPaid2 K0.0030.015

No description available.

Cohere: Command A
cohere/command-aCoherePaid256 K0.00250.01

Command A is an open-weights 111B parameter model with a 256k context window focused on deliveri…

Cohere: Command R (08-2024)
cohere/command-r-08-2024CoherePaid128 K0.000150.0006

command-r-08-2024 is an update of the Command R with improved perfor…

Cohere: Command R+ (08-2024)
cohere/command-r-plus-08-2024CoherePaid128 K0.00250.01

command-r-plus-08-2024 is an update of the Command R+ with roug…

Cohere: Command R7B (12-2024)
cohere/command-r7b-12-2024CoherePaid128 K0.00003750.00015

Command R7B (12-2024) is a small, fast update of the Command R+ model, delivered in December 202…

Deep Cogito: Cogito v2.1 671B
deepcogito/cogito-v2.1-671bDeepcogitoPaid128 K0.001250.00125

Cogito v2.1 671B MoE represents one of the strongest open models globally, matching performance…

DeepSeek: DeepSeek V3 0324
deepseek/deepseek-chat-v3-0324DeepseekPaid164 K0.00020.00077

DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship…

DeepSeek: DeepSeek V3.1
deepseek/deepseek-chat-v3.1DeepseekPaid32.8 K0.000150.00075

DeepSeek-V3.1 is a large hybrid reasoning model (671B parameters, 37B active) that supports both…

DeepSeek: DeepSeek V3.1 Terminus
deepseek/deepseek-v3.1-terminusDeepseekPaid164 K0.000210.00079

DeepSeek-V3.1 Terminus is an update to DeepSeek V3.1 that mainta…

DeepSeek: DeepSeek V3.2
deepseek/deepseek-v3.2DeepseekPaid164 K0.000620.00185

DeepSeek-V3.2 is a large language model designed to harmonize high computational efficiency with…

DeepSeek: DeepSeek V3.2 Exp
deepseek/deepseek-v3.2-expDeepseekPaid164 K0.000270.00041

DeepSeek-V3.2-Exp is an experimental large language model released by DeepSeek as an intermediat…

DeepSeek: DeepSeek V4 Flash
deepseek/deepseek-v4-flashDeepseekPaid1.05 M0.000140.00028

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B to…

DeepSeek: DeepSeek V4 Pro
deepseek/deepseek-v4-proDeepseekPaid1.05 M0.0013920.002784

DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total paramete…

DeepSeek: R1
deepseek/deepseek-r1DeepseekPaid64 K0.001350.0054

DeepSeek R1 is here: Performance on par with OpenAI o1, but open-sourced and with…

EssentialAI: Rnj 1 Instruct
essentialai/rnj-1-instructEssentialaiPaid32.8 K0.000150.00015

Rnj-1 is an 8B-parameter, dense, open-weight model family developed by Essential AI and trained…

Gemini 3 1 Flash Lite
google/gemini-3.1-flash-liteGooglePaidN/A0.000250.0015

No description available.

Gemini 3 1 Flash Lite Preview
google/gemini-3.1-flash-lite-previewGooglePaid1.05 M0.000250.0015

Gemini 3.1 Flash Lite Preview is Google’s high-efficiency model optimized for high-volume use ca…

Gemini 3 1 Pro Preview
google/gemini-3.1-pro-previewGooglePaid1.05 M0.0020.012

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engine…

Gemini 3 Flash Preview
google/gemini-3-flash-previewGooglePaid1.05 M0.00050.003

Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows…

Glm 4 7
zai/glm-4-7ZaiPaid131 K0.00060.0022

No description available.

Glm 4 7 Flash
zai/glm-4-7-flashZaiPaid131 K0.000070.0004

No description available.

Google: Gemini 2.0 Flash Lite
google/gemini-2.0-flash-lite-001GooglePaid1.05 M0.0000750.0003

Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to [Gemi…

Google: Gemini 2.5 Flash Lite Preview 09-2025
google/gemini-2.5-flash-lite-preview-09-2025GooglePaid1.05 M0.00010.0004

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for u…

Google: Gemini 2.5 Pro Preview 05-06
google/gemini-2.5-pro-preview-05-06GooglePaid1.05 M0.001250.01

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, ma…

Google: Gemini 2.5 Pro Preview 06-05
google/gemini-2.5-pro-previewGooglePaid1.05 M0.001250.01

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, ma…

Google: Gemma 2 27B
google/gemma-2-27b-itGooglePaid8.19 K0.000650.00065

Gemma 2 27B by Google is an open model built from the same research and technology used to creat…

Google: Gemma 3 12B
google/gemma-3-12b-itGooglePaid131 K0.000090.00029

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles…

Google: Gemma 3 27B
google/gemma-3-27b-itGooglePaid131 K0.000080.00016

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles…

Google: Gemma 3 4B
google/gemma-3-4b-itGooglePaid131 K0.000040.00008

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles…

Google: Gemma 3n 4B
google/gemma-3n-e4b-itGooglePaid32.8 K0.000020.00004

Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as…

Google: Gemma 4 26B A4B
google/gemma-4-26b-a4b-itGooglePaid262 K0.000130.0004

Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind…

Google: Nano Banana (Gemini 2.5 Flash Image)
google/gemini-2.5-flash-imageGooglePaid32.8 K0.00030.03

Gemini 2.5 Flash Image, a.k.a. “Nano Banana,” is now generally available. It is a state of the a…

Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)
google/gemini-3.1-flash-image-previewGooglePaid65.5 K0.00050.049585

Gemini 3.1 Flash Image Preview, a.k.a. “Nano Banana 2,” is Google’s latest state of the art imag…

Gpt 3 5 Turbo 0125
openai/gpt-3.5-turbo-0125OpenaiPaid16.4 K0.00050.0015

No description available.

Gpt 3 5 Turbo 1106
openai/gpt-3.5-turbo-1106OpenaiPaid16.4 K0.0010.002

No description available.

Gpt 4 0613
openai/gpt-4-0613OpenaiPaid8.19 K0.030.06

No description available.

Gpt 4 Turbo 2024 04 09
openai/gpt-4-turbo-2024-04-09OpenaiPaid131 K0.010.03

No description available.

Gpt 5 4 Image
openai/gpt-5.4-imageOpenaiPaidN/A0.0080.015

No description available.

Gpt 5 4 Image 2
openai/gpt-5.4-image-2OpenaiPaidN/A0.0080.015

No description available.

Gpt 5 4 Image Mini
openai/gpt-5.4-image-miniOpenaiPaidN/A0.0080.015

No description available.

Gpt 5 Mini
openai/gpt-5-miniOpenaiPaid131 K0.000250.002

No description available.

Gpt 5 Nano
openai/gpt-5-nanoOpenaiPaid131 K0.000050.0004

No description available.

Gpt Audio
openai/gpt-audioOpenaiPaidN/A0.00250.01

No description available.

Gpt Audio 1 5
openai/gpt-audio-1.5OpenaiPaidN/A0.00250.01

No description available.

Gpt Audio Mini
openai/gpt-audio-miniOpenaiPaidN/A0.00060.0024

No description available.

Gpt Image 1
openai/gpt-image-1OpenaiPaidN/A0.0050

No description available.

Gpt Image 1 5
openai/gpt-image-1.5OpenaiPaidN/A0.0050.01

No description available.

Gpt Image 1 Mini
openai/gpt-image-1-miniOpenaiPaidN/A0.0020

No description available.

Gpt Image 2
openai/gpt-image-2OpenaiPaidN/A0.0050

No description available.

Gpt Oss 120B
openai/gpt-oss-120bOpenaiPaid131 K0.000150.0006

No description available.

GPT Realtime 1.5
openai/gpt-realtime-1.5OpenaiPaidN/A0.0040.016

OpenAI GPT Realtime 1.5 — speech-to-speech real-time model with text, audio, and image input.

GPT Realtime 2
openai/gpt-realtime-2OpenaiPaidN/A0.0040.024

OpenAI GPT Realtime 2 — speech-to-speech real-time model with text, audio, and image input.

GPT Realtime Mini
openai/gpt-realtime-miniOpenaiPaidN/A0.00060.0024

OpenAI GPT Realtime Mini — cost-efficient speech-to-speech real-time model.

GPT Realtime Translate
openai/gpt-realtime-translateOpenaiPaidN/APricing unavailablePricing unavailable

OpenAI GPT Realtime Translate — real-time audio translation, billed per minute of output audio.

GPT Realtime Whisper
openai/gpt-realtime-whisperOpenaiPaidN/APricing unavailablePricing unavailable

OpenAI GPT Realtime Whisper — real-time audio transcription, billed per minute of input audio.

GPT-5-mini
gpt-5-miniUnknownPaid4 K0.000250.002

GPT-5 mini is a faster, more cost-efficient version of GPT-5. It’s great for well-defined tasks…

GPT-5.5
openai/gpt-5.5OpenaiPaid1.05 M0.0050.03

GPT-5.5 is OpenAI’s newest frontier model for the most complex professional work. Reasoning.effo…

Grok 4.3
x-ai/grok-4.3X AiPaidN/A0.0030.015

No description available.

IBM: Granite 4.0 Micro
ibm-granite/granite-4.0-h-microIbm GranitePaid131 K0.0000170.00011

Granite-4.0-H-Micro is a 3B parameter from the Granite 4 family of models. These models are the…

Imagen 3
google/imagen-3GooglePaidN/APricing unavailablePricing unavailable

No description available.

Imagen 3 Fast
google/imagen-3-fastGooglePaidN/APricing unavailablePricing unavailable

No description available.

Imagen 3 V1
google/imagen-3-v1GooglePaidN/APricing unavailablePricing unavailable

No description available.

Imagen 4
google/imagen-4GooglePaidN/APricing unavailablePricing unavailable

No description available.

Imagen 4 Fast
google/imagen-4-fastGooglePaidN/APricing unavailablePricing unavailable

No description available.

Imagen 4 Ultra
google/imagen-4-ultraGooglePaidN/APricing unavailablePricing unavailable

No description available.

Inception: Mercury 2
inception/mercury-2InceptionPaid128 K0.000250.00075

Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM). Inst…

Inflection: Inflection 3 Productivity
inflection/inflection-3-productivityInflectionPaid8 K0.00250.01

Inflection 3 Productivity is optimized for following instructions. It is better for tasks requir…

Kwaipilot: KAT-Coder-Pro V2
kwaipilot/kat-coder-pro-v2KwaipilotPaid256 K0.00030.0012

KAT-Coder-Pro V2 is the latest high-performance model in KwaiKAT’s KAT-Coder series, designed fo…

Magnum v4 72B
anthracite-org/magnum-v4-72bAnthracite OrgPaid16.4 K0.0030.005

This is a series of models designed to replicate the prose quality of the Claude 3 models, speci…

Mancer: Weaver (alpha)
mancer/weaverMancerPaid8 K0.000750.001

An attempt to recreate Claude-style verbosity, but don’t expect the same level of coherence or m…

Meta: Llama 3 70B Instruct
meta-llama/llama-3-70b-instructMeta LlamaPaid8.19 K0.000720.00072

Meta’s latest class of model (Llama 3) launched with a variety of sizes & flavors. This 70B inst…

Meta: Llama 3 8B Instruct
meta-llama/llama-3-8b-instructMeta LlamaPaid8.19 K0.00030.0006

Meta’s latest class of model (Llama 3) launched with a variety of sizes & flavors. This 8B instr…

Meta: Llama 3.1 70B Instruct
meta-llama/llama-3.1-70b-instructMeta LlamaPaid131 K0.000720.00072

Meta’s latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 70B in…

Meta: Llama 3.1 8B Instruct
meta-llama/llama-3.1-8b-instructMeta LlamaPaid16.4 K0.00020.0002

Meta’s latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 8B ins…

Meta: Llama 3.2 11B Vision Instruct
meta-llama/llama-3.2-11b-vision-instructMeta LlamaPaid131 K0.0000490.000049

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks…

Meta: Llama 3.2 1B Instruct
meta-llama/llama-3.2-1b-instructMeta LlamaPaid60 K0.0000270.0002

Llama 3.2 1B is a 1-billion-parameter language model focused on efficiently performing natural l…

Meta: Llama 3.2 3B Instruct
meta-llama/llama-3.2-3b-instructMeta LlamaPaid80 K0.0000510.00034

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced…

Meta: Llama 3.3 70B Instruct
meta-llama/llama-3.3-70b-instructMeta LlamaPaid131 K0.000720.00072

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned…

Meta: Llama 4 Maverick
meta-llama/llama-4-maverickMeta LlamaPaid1.05 M0.000240.00097

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, bui…

Meta: Llama 4 Scout
meta-llama/llama-4-scoutMeta LlamaPaid328 K0.000170.00017

Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta,…

Meta: Llama Guard 4 12B
meta-llama/llama-guard-4-12bMeta LlamaPaid164 K0.000180.00018

Llama Guard 4 is a Llama 4 Scout-derived multimodal pretrained model, fine-tuned for content saf…

Microsoft: Phi 4
microsoft/phi-4MicrosoftPaid16.4 K0.0000650.00014

Microsoft Research Phi-4 is designed to perform well in complex reasoning tasks an…

Minimax M2
minimax/minimax-m2MinimaxPaid197 K0.00030.0012

No description available.

Minimax M2 1
minimax/minimax-m2-1MinimaxPaid197 K0.00030.0012

No description available.

Minimax M2 5
minimax/minimax-m2-5MinimaxPaid197 K0.00030.0012

No description available.

MiniMax: MiniMax M2-her
minimax/minimax-m2-herMinimaxPaid65.5 K0.00030.0012

MiniMax M2-her is a dialogue-first large language model built for immersive roleplay, character-…

MiniMax: MiniMax-01
minimax/minimax-01MinimaxPaid1 M0.00020.0011

MiniMax-01 is a combines MiniMax-Text-01 for text generation and MiniMax-VL-01 for image underst…

Mistral Large
mistralai/mistral-largeMistralaiPaid128 K0.0020.006

This is Mistral AI’s flagship model, Mistral Large 2 (version mistral-large-2407). It’s a prop…

Mistral Large 2407
mistralai/mistral-large-2407MistralaiPaid131 K0.0020.006

This is Mistral AI’s flagship model, Mistral Large 2 (version mistral-large-2407). It’s a propri…

Mistral Large 2411
mistralai/mistral-large-2411MistralaiPaid131 K0.0020.006

Mistral Large 2 2411 is an update of Mistral Large 2 released togeth…

Mistral: 7B Instruct (legacy)
mistral/mistral-7b-instruct-v0MistralPaid32.8 K0.000150.0002

No description available.

Mistral: Codestral 2508
mistralai/codestral-2508MistralaiPaid256 K0.00030.0009

Mistral’s cutting-edge language model for coding released end of July 2025. Codestral specialize…

Mistral: Devstral 2 123B
mistral/devstral-2-123bMistralPaid262 K0.00040.002

No description available.

Mistral: Devstral 2 2512
mistralai/devstral-2512MistralaiPaid262 K0.00040.002

Devstral 2 is a state-of-the-art open-source model by Mistral AI specializing in agentic coding…

Mistral: Devstral Medium
mistralai/devstral-mediumMistralaiPaid131 K0.00040.002

Devstral Medium is a high-performance code generation and agentic reasoning model developed join…

Mistral: Devstral Small 1.1
mistralai/devstral-smallMistralaiPaid131 K0.00010.0003

Devstral Small 1.1 is a 24B parameter open-weight language model for software engineering agents…

Mistral: Large 2402 (legacy)
mistral/mistral-large-2402-v1MistralPaid32.8 K0.0040.012

No description available.

Mistral: Large 3 675B
mistral/mistral-large-3-675b-instructMistralPaid262 K0.00050.0015

No description available.

Mistral: Magistral Small 2509
mistral/magistral-small-2509MistralPaid131 K0.00050.0015

No description available.

Mistral: Ministral 3 14B
mistral/ministral-3-14b-instructMistralPaid131 K0.00020.0002

No description available.

Mistral: Ministral 3 14B 2512
mistralai/ministral-14b-2512MistralaiPaid262 K0.00020.0002

The largest model in the Ministral 3 family, Ministral 3 14B offers frontier capabilities and pe…

Mistral: Ministral 3 3B
mistral/ministral-3-3b-instructMistralPaid131 K0.00010.0001

No description available.

Mistral: Ministral 3 3B 2512
mistralai/ministral-3b-2512MistralaiPaid131 K0.00010.0001

The smallest model in the Ministral 3 family, Ministral 3 3B is a powerful, efficient tiny langu…

Mistral: Ministral 3 8B
mistral/ministral-3-8b-instructMistralPaid131 K0.000150.00015

No description available.

Mistral: Ministral 3 8B 2512
mistralai/ministral-8b-2512MistralaiPaid262 K0.000150.00015

A balanced model in the Ministral 3 family, Ministral 3 8B is a powerful, efficient tiny languag…

Mistral: Mistral 7B Instruct v0.1
mistralai/mistral-7b-instruct-v0.1MistralaiPaid2.82 K0.000110.00019

A 7.3B parameter model that outperforms Llama 2 13B on all benchmarks, with optimizations for sp…

Mistral: Mistral Large 3 2512
mistralai/mistral-large-2512MistralaiPaid262 K0.00050.0015

Mistral Large 3 2512 is Mistral’s most capable model to date, featuring a sparse mixture-of-expe…

Mistral: Mistral Medium 3
mistralai/mistral-medium-3MistralaiPaid131 K0.00040.002

Mistral Medium 3 is a high-performance enterprise-grade language model designed to deliver front…

Mistral: Mistral Medium 3.1
mistralai/mistral-medium-3.1MistralaiPaid131 K0.00040.002

Mistral Medium 3.1 is an updated version of Mistral Medium 3, which is a high-performance enterp…

Mistral: Mistral Nemo
mistralai/mistral-nemoMistralaiPaid131 K0.000020.00004

A 12B parameter model with a 128k token context length built by Mistral in collaboration with NV…

Mistral: Mistral Small 3
mistralai/mistral-small-24b-instruct-2501MistralaiPaid32.8 K0.000050.00008

Mistral Small 3 is a 24B-parameter language model optimized for low-latency performance across c…

Mistral: Mistral Small 3.1 24B
mistralai/mistral-small-3.1-24b-instructMistralaiPaid131 K0.000030.00011

Mistral Small 3.1 24B Instruct is an upgraded variant of Mistral Small 3 (2501), featuring 24 bi…

Mistral: Mistral Small 3.2 24B
mistralai/mistral-small-3.2-24b-instructMistralaiPaid128 K0.0000750.0002

Mistral-Small-3.2-24B-Instruct-2506 is an updated 24B parameter model from Mistral optimized for…

Mistral: Mistral Small 4
mistralai/mistral-small-2603MistralaiPaid262 K0.000150.0006

Mistral Small 4 is the next major release in the Mistral Small family, unifying the capabilities…

Mistral: Mixtral 8x22B Instruct
mistralai/mixtral-8x22b-instructMistralaiPaid65.5 K0.0020.006

Mistral’s official instruct fine-tuned version of [Mixtral 8x22B](/models/mistralai/mixtral-8x22…

Mistral: Pixtral Large (2502)
mistral/pixtral-large-2502-v1MistralPaid131 K0.0020.006

No description available.

Mistral: Pixtral Large 2411
mistralai/pixtral-large-2411MistralaiPaid131 K0.0020.006

Pixtral Large is a 124B parameter, open-weight, multimodal model built on top of [Mistral Large…

Mistral: Saba
mistralai/mistral-sabaMistralaiPaid32.8 K0.00020.0006

Mistral Saba is a 24B-parameter language model specifically designed for the Middle East and Sou…

Mistral: Small 2402 (legacy)
mistral/mistral-small-2402-v1MistralPaid32.8 K0.0010.003

No description available.

Mistral: Voxtral Mini 3B
mistral/voxtral-mini-3b-2507MistralPaid32.8 K0.000040.00004

No description available.

Mistral: Voxtral Small 24B 2507
mistralai/voxtral-small-24b-2507MistralaiPaid32 K0.00010.0003

Voxtral Small is an enhancement of Mistral Small 3, incorporating state-of-the-art audio input c…

MoonshotAI: Kimi K2 0711
moonshotai/kimi-k2MoonshotaiPaid131 K0.000570.0023

Kimi K2 Instruct is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot…

MoonshotAI: Kimi K2 0905
moonshotai/kimi-k2-0905MoonshotaiPaid131 K0.00040.002

Kimi K2 0905 is the September update of Kimi K2 0711. It is a large-scale…

MoonshotAI: Kimi K2 Thinking
moonshotai/kimi-k2-thinkingMoonshotaiPaid131 K0.00060.0025

Kimi K2 Thinking is Moonshot AI’s most advanced open reasoning model to date, extending the K2 s…

MoonshotAI: Kimi K2.5
moonshotai/kimi-k2.5MoonshotaiPaid262 K0.00060.003

Kimi K2.5 is Moonshot AI’s native multimodal model, delivering state-of-the-art visual coding ca…

Morph: Morph V3 Fast
morph/morph-v3-fastMorphPaid81.9 K0.00080.0012

Morph’s fastest apply model for code edits. ~10,500 tokens/sec with 96% accuracy for rapid code…

Morph: Morph V3 Large
morph/morph-v3-largeMorphPaid262 K0.00090.0019

Morph’s high-accuracy apply model for complex code edits. ~4,500 tokens/sec with 98% accuracy fo…

MythoMax 13B
gryphe/mythomax-l2-13bGryphePaid4.1 K0.000060.00006

One of the highest performing and most popular fine-tunes of Llama 2 13B, with rich descriptions…

Nous: Hermes 3 405B Instruct
nousresearch/hermes-3-llama-3.1-405bNousresearchPaid131 K0.0010.001

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced…

Nous: Hermes 3 70B Instruct
nousresearch/hermes-3-llama-3.1-70bNousresearchPaid131 K0.00030.0003

Hermes 3 is a generalist language model with many improvements over [Hermes 2](/models/nousresea…

Nous: Hermes 4 405B
nousresearch/hermes-4-405bNousresearchPaid131 K0.0010.003

Hermes 4 is a large-scale reasoning model built on Meta-Llama-3.1-405B and released by Nous Rese…

Nous: Hermes 4 70B
nousresearch/hermes-4-70bNousresearchPaid131 K0.000130.0004

Hermes 4 70B is a hybrid reasoning model from Nous Research, built on Meta-Llama-3.1-70B. It int…

NVIDIA: Nemotron 3 Nano 30B A3B
nvidia/nemotron-3-nano-30b-a3bNvidiaPaid262 K0.000060.00024

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and…

NVIDIA: Nemotron 3 Super
nvidia/nemotron-3-super-120b-a12bNvidiaPaid262 K0.00010.0005

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameter…

O1
openai/o1OpenaiPaid2 K0.0150.06

No description available.

O3
openai/o3OpenaiPaid2 K0.0020.008

No description available.

O3 Mini
openai/o3-miniOpenaiPaid2 K0.00110.0044

No description available.

O4 Mini
openai/o4-miniOpenaiPaid2 K0.00110.0044

No description available.

openai/gpt-5-pro
openai/gpt-5-proOpenaiPaid4 K0.0150.12

High-compute version of GPT-5 for complex reasoning tasks

openai/gpt-5.5-pro
openai/gpt-5.5-proOpenaiPaidN/A0.030.18

No description available.

openai/text-embedding-3-large
openai/text-embedding-3-largeOpenaiPaidN/A0.000130

No description available.

openai/text-embedding-3-small
openai/text-embedding-3-smallOpenaiPaidN/A0.000020

No description available.

openai/text-embedding-ada-002
openai/text-embedding-ada-002OpenaiPaidN/A0.00010

No description available.

OpenAI: GPT-3.5 Turbo
openai/gpt-3.5-turboOpenaiPaid16.4 K0.00050.0015

GPT-3.5 Turbo is OpenAI’s fastest model. It can understand and generate natural language or code…

OpenAI: GPT-3.5 Turbo (older v0613)
openai/gpt-3.5-turbo-0613OpenaiPaid4.09 K0.0010.002

GPT-3.5 Turbo is OpenAI’s fastest model. It can understand and generate natural language or code…

OpenAI: GPT-3.5 Turbo 16k
openai/gpt-3.5-turbo-16kOpenaiPaid16.4 K0.0030.004

This model offers four times the context length of gpt-3.5-turbo, allowing it to support approxi…

OpenAI: GPT-3.5 Turbo Instruct
openai/gpt-3.5-turbo-instructOpenaiPaid4.09 K0.00150.002

This model is a variant of GPT-3.5 Turbo tuned for instructional prompts and omitting chat-relat…

OpenAI: GPT-4
openai/gpt-4OpenaiPaid8.19 K0.030.06

OpenAI’s flagship model, GPT-4 is a large-scale multimodal language model capable of solving dif…

OpenAI: GPT-4 Turbo
openai/gpt-4-turboOpenaiPaid128 K0.010.03

The latest GPT-4 Turbo model with vision capabilities. Vision requests can now use JSON mode and…

OpenAI: GPT-4.1
openai/gpt-4.1OpenaiPaid1.05 M0.0020.008

GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-wo…

OpenAI: GPT-4.1 Mini
openai/gpt-4.1-miniOpenaiPaid1.05 M0.00040.0016

GPT-4.1 Mini is a mid-sized model delivering performance competitive with GPT-4o at substantiall…

OpenAI: GPT-4.1 Nano
openai/gpt-4.1-nanoOpenaiPaid1.05 M0.00010.0004

For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1…

OpenAI: GPT-4o
openai/gpt-4oOpenaiPaid128 K0.00250.01

GPT-4o (“o” for “omni”) is OpenAI’s latest AI model, supporting both text and image inputs with…

OpenAI: GPT-4o (2024-05-13)
openai/gpt-4o-2024-05-13OpenaiPaid128 K0.0050.015

GPT-4o (“o” for “omni”) is OpenAI’s latest AI model, supporting both text and image inputs with…

OpenAI: GPT-4o (2024-08-06)
openai/gpt-4o-2024-08-06OpenaiPaid128 K0.00250.01

The 2024-08-06 version of GPT-4o offers improved performance in structured outputs, with the abi…

OpenAI: GPT-4o (2024-11-20)
openai/gpt-4o-2024-11-20OpenaiPaid128 K0.00250.01

The 2024-11-20 version of GPT-4o offers a leveled-up creative writing ability with more natural,…

OpenAI: GPT-4o Search Preview
openai/gpt-4o-search-previewOpenaiPaid128 K0.00250.01

GPT-4o Search Previewis a specialized model for web search in Chat Completions. It is trained to…

OpenAI: GPT-4o-mini
openai/gpt-4o-miniOpenaiPaid128 K0.000150.0006

GPT-4o mini is OpenAI’s newest model after GPT-4 Omni, supporting both…

OpenAI: GPT-4o-mini (2024-07-18)
openai/gpt-4o-mini-2024-07-18OpenaiPaid128 K0.000150.0006

GPT-4o mini is OpenAI’s newest model after GPT-4 Omni, supporting both…

OpenAI: GPT-4o-mini Search Preview
openai/gpt-4o-mini-search-previewOpenaiPaid128 K0.000150.0006

GPT-4o mini Search Preview is a specialized model for web search in Chat Completions. It is trai…

OpenAI: GPT-5 Chat
openai/gpt-5-chatOpenaiPaid128 K0.001250.01

GPT-5 Chat is designed for advanced, natural, multimodal, and context-aware conversations for en…

OpenAI: GPT-5.1
openai/gpt-5.1OpenaiPaid4 K0.001250.01

GPT-5.1 is the latest frontier-grade model in the GPT-5 series, offering stronger general-purpos…

OpenAI: GPT-5.1 Chat
openai/gpt-5.1-chatOpenaiPaid128 K0.001250.01

GPT-5.1 Chat (AKA Instant is the fast, lightweight member of the 5.1 family, optimized for low-l…

OpenAI: GPT-5.1-Codex
openai/gpt-5.1-codexOpenaiPaid4 K0.001250.01

GPT-5.1-Codex is a specialized version of GPT-5.1 optimized for software engineering and coding…

OpenAI: GPT-5.2
openai/gpt-5.2OpenaiPaid4 K0.001750.014

GPT-5.2 is the latest frontier-grade model in the GPT-5 series, offering stronger agentic and lo…

OpenAI: GPT-5.2 Chat
openai/gpt-5.2-chatOpenaiPaid128 K0.001750.014

GPT-5.2 Chat (AKA Instant) is the fast, lightweight member of the 5.2 family, optimized for low-…

OpenAI: GPT-5.2 Pro
openai/gpt-5.2-proOpenaiPaid4 K0.0210.168

GPT-5.2 Pro is OpenAI’s most advanced model, offering major improvements in agentic coding and l…

OpenAI: GPT-5.2-Codex
openai/gpt-5.2-codexOpenaiPaid4 K0.001750.014

GPT-5.2-Codex is an upgraded version of GPT-5.1-Codex optimized for software engineering and cod…

OpenAI: GPT-5.3 Chat
openai/gpt-5.3-chatOpenaiPaid128 K0.001750.014

GPT-5.3 Chat is an update to ChatGPT’s most-used model that makes everyday conversations smoothe…

OpenAI: GPT-5.3-Codex
openai/gpt-5.3-codexOpenaiPaid4 K0.001750.014

GPT-5.3-Codex is OpenAI’s most advanced agentic coding model, combining the frontier software en…

OpenAI: GPT-5.4
openai/gpt-5.4OpenaiPaid1.05 M0.00250.015

GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system…

OpenAI: GPT-5.4 Mini
openai/gpt-5.4-miniOpenaiPaid4 K0.000750.0045

GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized…

OpenAI: GPT-5.4 Nano
openai/gpt-5.4-nanoOpenaiPaid4 K0.00020.00125

GPT-5.4 nano is the most lightweight and cost-efficient variant of the GPT-5.4 family, optimized…

OpenAI: GPT-5.4 Pro
openai/gpt-5.4-proOpenaiPaid1.05 M0.030.18

GPT-5.4 Pro is OpenAI’s most advanced model, building on GPT-5.4’s unified architecture with enh…

OpenAI: gpt-oss-safeguard-20b
openai/gpt-oss-safeguard-20bOpenaiPaid131 K0.000090.00039

gpt-oss-safeguard-20b is a safety reasoning model from OpenAI built upon gpt-oss-20b. This open-…

OpenAI: o3 Pro
openai/o3-proOpenaiPaid2 K0.020.08

The o-series of models are trained with reinforcement learning to think before they answer and p…

Perplexity: Sonar
perplexity/sonarPerplexityPaid127 K0.0010.001

Sonar is lightweight, affordable, fast, and simple to use — now featuring citations and the abil…

Perplexity: Sonar Pro
perplexity/sonar-proPerplexityPaid2 K0.0030.015

Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://docs.perp

Perplexity: Sonar Pro Search
perplexity/sonar-pro-searchPerplexityPaid2 K0.0030.015

Exclusively available on the OpenRouter API, Sonar Pro’s new Pro Search mode is Perplexity’s mos…

Qwen Flash
qwen/qwen-flashQwenPaid1 M0.0000220.000216

No description available.

Qwen Flash 2025 07 28
qwen/qwen-flash-2025-07-28QwenPaid131 K0.0000220.000216

No description available.

Qwen Mt Flash
qwen/qwen-mt-flashQwenPaid131 K0.0001010.00028

No description available.

Qwen Mt Lite
qwen/qwen-mt-liteQwenPaid131 K0.0000860.000229

No description available.

Qwen Mt Plus
qwen/qwen-mt-plusQwenPaid131 K0.0002590.000775

No description available.

Qwen Plus
qwen/qwen-plusQwenPaid1 M0.0001150.000287

No description available.

Qwen Plus 2025 07 28:Non Thinking
qwen/qwen-plus-2025-07-28:non-thinkingQwenPaid131 K0.0001150.000287

No description available.

Qwen Plus 2025 09 11
qwen/qwen-plus-2025-09-11QwenPaid131 K0.0003450.002868

No description available.

Qwen Plus 2025 09 11:Non Thinking
qwen/qwen-plus-2025-09-11:non-thinkingQwenPaid131 K0.0001150.000287

No description available.

Qwen Plus 2025 09 11:Thinking
qwen/qwen-plus-2025-09-11:thinkingQwenPaid131 K0.0001150.001147

No description available.

Qwen Plus 2025 12 01
qwen/qwen-plus-2025-12-01QwenPaid131 K0.0001150.000287

No description available.

Qwen Plus 2025 12 01:Non Thinking
qwen/qwen-plus-2025-12-01:non-thinkingQwenPaid131 K0.0003450.002868

No description available.

Qwen Plus 2025 12 01:Thinking
qwen/qwen-plus-2025-12-01:thinkingQwenPaid131 K0.0001150.001147

No description available.

Qwen Plus:Non Thinking
qwen/qwen-plus:non-thinkingQwenPaid131 K0.0006890.006881

No description available.

Qwen Plus:Thinking
qwen/qwen-plus:thinkingQwenPaid131 K0.0001150.001147

No description available.

qwen/text-embedding-v3
qwen/text-embedding-v3QwenPaidN/A0.000070

No description available.

qwen/text-embedding-v4
qwen/text-embedding-v4QwenPaidN/A0.000070

No description available.

Qwen2.5 72B Instruct
qwen/qwen-2.5-72b-instructQwenPaid32.8 K0.000120.00039

Qwen2.5 72B is the latest series of Qwen large language models. Qwen2.5 brings the following imp…

Qwen2.5 Coder 32B Instruct
qwen/qwen-2.5-coder-32b-instructQwenPaid32.8 K0.000660.001

Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known a…

Qwen3 14B:Non Thinking
qwen/qwen3-14b:non-thinkingQwenPaid131 K0.0001440.000574

No description available.

Qwen3 14B:Thinking
qwen/qwen3-14b:thinkingQwenPaid131 K0.0001440.001434

No description available.

Qwen3 235B A22B Instruct 2507
qwen/qwen3-235b-a22b-instruct-2507QwenPaid131 K0.000230.00092

No description available.

Qwen3 235B A22B:Non Thinking
qwen/qwen3-235b-a22b:non-thinkingQwenPaid131 K0.0002870.001147

No description available.

Qwen3 235B A22B:Thinking
qwen/qwen3-235b-a22b:thinkingQwenPaid131 K0.0002870.002868

No description available.

Qwen3 30B A3B:Non Thinking
qwen/qwen3-30b-a3b:non-thinkingQwenPaid131 K0.0001080.000431

No description available.

Qwen3 30B A3B:Thinking
qwen/qwen3-30b-a3b:thinkingQwenPaid131 K0.0001080.001076

No description available.

Qwen3 32B V1
qwen/qwen3-32b-v1QwenPaid131 K0.00020.0006

No description available.

Qwen3 32B:Non Thinking
qwen/qwen3-32b:non-thinkingQwenPaid131 K0.000160.00064

No description available.

Qwen3 32B:Thinking
qwen/qwen3-32b:thinkingQwenPaid131 K0.000160.00064

No description available.

Qwen3 5 Flash
qwen/qwen3.5-flashQwenPaid1 M0.0000290.000287

No description available.

Qwen3 5 Flash 2026 02 23
qwen/qwen3.5-flash-2026-02-23QwenPaid131 K0.0000290.000287

No description available.

Qwen3 5 Plus
qwen/qwen3.5-plusQwenPaid1 M0.0001150.000688

No description available.

Qwen3 5 Plus 2026 02 15
qwen/qwen3.5-plus-2026-02-15QwenPaid131 K0.0001150.000688

No description available.

Qwen3 6 Plus
qwen/qwen3.6-plusQwenPaid131 K0.0002760.001651

No description available.

Qwen3 6 Plus 2026 04 02
qwen/qwen3.6-plus-2026-04-02QwenPaid131 K0.0002760.001651

No description available.

Qwen3 8B:Non Thinking
qwen/qwen3-8b:non-thinkingQwenPaid131 K0.0000720.000287

No description available.

Qwen3 8B:Thinking
qwen/qwen3-8b:thinkingQwenPaid131 K0.0000720.000717

No description available.

Qwen3 Coder 30B A3B V1
qwen/qwen3-coder-30b-a3b-v1QwenPaid131 K0.000150.00062

No description available.

Qwen3 Coder 480B A35B Instruct
qwen/qwen3-coder-480b-a35b-instructQwenPaid262 K0.0008610.003441

No description available.

Qwen3 Coder Flash 2025 07 28
qwen/qwen3-coder-flash-2025-07-28QwenPaid131 K0.0001440.000574

No description available.

Qwen3 Coder Plus 2025 07 22
qwen/qwen3-coder-plus-2025-07-22QwenPaid131 K0.0005740.002294

No description available.

Qwen3 Coder Plus 2025 09 23
qwen/qwen3-coder-plus-2025-09-23QwenPaid131 K0.0005740.002294

No description available.

Qwen3 Max 2025 09 23
qwen/qwen3-max-2025-09-23QwenPaid131 K0.0008610.003441

No description available.

Qwen3 Max 2026 01 23
qwen/qwen3-max-2026-01-23QwenPaid262 K0.0003590.001434

No description available.

Qwen3 Max Preview
qwen/qwen3-max-previewQwenPaid131 K0.0008610.003441

No description available.

Qwen3 Next 80B A3B
qwen/qwen3-next-80b-a3bQwenPaid131 K0.000150.0012

No description available.

Qwen3 Vl 235B A22B Thinking:Thinking
qwen/qwen3-vl-235b-a22b-thinking:thinkingQwenPaid131 K0.0002870.002868

No description available.

Qwen3 Vl 30B A3B Thinking:Thinking
qwen/qwen3-vl-30b-a3b-thinking:thinkingQwenPaid131 K0.0001080.001076

No description available.

Qwen3 Vl 32B Thinking:Thinking
qwen/qwen3-vl-32b-thinking:thinkingQwenPaid131 K0.000160.00064

No description available.

Qwen3 Vl 8B Thinking:Thinking
qwen/qwen3-vl-8b-thinking:thinkingQwenPaid131 K0.0000720.000717

No description available.

Qwen3 Vl Flash
qwen/qwen3-vl-flashQwenPaid131 K0.0000220.000215

No description available.

Qwen3 Vl Flash 2025 10 15
qwen/qwen3-vl-flash-2025-10-15QwenPaid131 K0.0000220.000215

No description available.

Qwen3 Vl Plus
qwen/qwen3-vl-plusQwenPaid131 K0.0001440.001434

No description available.

Qwen3 Vl Plus 2025 09 23
qwen/qwen3-vl-plus-2025-09-23QwenPaid131 K0.0001440.001434

No description available.

Qwen: Qwen Plus 0728
qwen/qwen-plus-2025-07-28QwenPaid1 M0.0003450.002868

Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoning mod…

Qwen: Qwen Plus 0728 (thinking)
qwen/qwen-plus-2025-07-28:thinkingQwenPaid1 M0.0001150.001147

Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoning mod…

Qwen: Qwen2.5 7B Instruct
qwen/qwen-2.5-7b-instructQwenPaid32.8 K0.000040.0001

Qwen2.5 7B is the latest series of Qwen large language models. Qwen2.5 brings the following impr…

Qwen: Qwen3 235B A22B
qwen/qwen3-235b-a22bQwenPaid131 K0.0004550.00182

Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, activating…

Qwen: Qwen3 235B A22B Instruct 2507
qwen/qwen3-235b-a22b-2507QwenPaid262 K0.0000710.0001

Qwen3-235B-A22B-Instruct-2507 is a multilingual, instruction-tuned mixture-of-experts language m…

Qwen: Qwen3 235B A22B Thinking 2507
qwen/qwen3-235b-a22b-thinking-2507QwenPaid131 K0.00014950.001495

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) langua…

Qwen: Qwen3 30B A3B
qwen/qwen3-30b-a3bQwenPaid41 K0.000080.00028

Qwen3, the latest generation in the Qwen large language model series, features both dense and mi…

Qwen: Qwen3 30B A3B Instruct 2507
qwen/qwen3-30b-a3b-instruct-2507QwenPaid262 K0.0001080.000431

Qwen3-30B-A3B-Instruct-2507 is a 30.5B-parameter mixture-of-experts language model from Qwen, wi…

Qwen: Qwen3 32B
qwen/qwen3-32bQwenPaid41 K0.000080.00024

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for…

Qwen: Qwen3 8B
qwen/qwen3-8bQwenPaid41 K0.000050.0004

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for bot…

Qwen: Qwen3 Coder 30B A3B Instruct
qwen/qwen3-coder-30b-a3b-instructQwenPaid16 K0.0002160.000861

Qwen3-Coder-30B-A3B-Instruct is a 30.5B parameter Mixture-of-Experts (MoE) model with 128 expert…

Qwen: Qwen3 Coder 480B A35B
qwen/qwen3-coderQwenPaid262 K0.000220.001

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by…

Qwen: Qwen3 Coder Flash
qwen/qwen3-coder-flashQwenPaid1 M0.0001440.000574

Qwen3 Coder Flash is Alibaba’s fast and cost efficient version of their proprietary Qwen3 Coder…

Qwen: Qwen3 Coder Next
qwen/qwen3-coder-nextQwenPaid262 K0.000120.00075

Qwen3-Coder-Next is an open-weight causal language model optimized for coding agents and local d…

Qwen: Qwen3 Coder Plus
qwen/qwen3-coder-plusQwenPaid1 M0.0005740.002294

Qwen3 Coder Plus is Alibaba’s proprietary version of the Open Source Qwen3 Coder 480B A35B. It i…

Qwen: Qwen3 Max
qwen/qwen3-maxQwenPaid262 K0.0003590.001434

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reason…

Qwen: Qwen3 Max Thinking
qwen/qwen3-max-thinkingQwenPaid262 K0.000780.0039

Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes…

Qwen: Qwen3 Next 80B A3B Instruct
qwen/qwen3-next-80b-a3b-instructQwenPaid262 K0.0001440.000574

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimize…

Qwen: Qwen3 Next 80B A3B Thinking
qwen/qwen3-next-80b-a3b-thinkingQwenPaid131 K0.00009750.00078

Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs…

Qwen: Qwen3 VL 235B A22B Instruct
qwen/qwen3-vl-235b-a22b-instructQwenPaid262 K0.0002870.001147

Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generati…

Qwen: Qwen3 VL 235B A22B Thinking
qwen/qwen3-vl-235b-a22b-thinkingQwenPaid131 K0.000260.0026

Qwen3-VL-235B-A22B Thinking is a multimodal model that unifies strong text generation with visua…

Qwen: Qwen3 VL 30B A3B Instruct
qwen/qwen3-vl-30b-a3b-instructQwenPaid131 K0.0001080.000431

Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual…

Qwen: Qwen3 VL 30B A3B Thinking
qwen/qwen3-vl-30b-a3b-thinkingQwenPaid131 K0.000130.00156

Qwen3-VL-30B-A3B-Thinking is a multimodal model that unifies strong text generation with visual…

Qwen: Qwen3 VL 32B Instruct
qwen/qwen3-vl-32b-instructQwenPaid131 K0.000160.00064

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precis…

Qwen: Qwen3 VL 8B Instruct
qwen/qwen3-vl-8b-instructQwenPaid131 K0.0000720.000287

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for h…

Qwen: Qwen3 VL 8B Thinking
qwen/qwen3-vl-8b-thinkingQwenPaid131 K0.0001170.001365

Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal model, des…

Qwen: Qwen3.5 397B A17B
qwen/qwen3.5-397b-a17bQwenPaid262 K0.0001720.001032

The Qwen3.5 series 397B-A17B native vision-language model is built on a hybrid architecture that…

Qwen: Qwen3.5 Plus 2026-02-15
qwen/qwen3.5-plus-02-15QwenPaid1 M0.000260.00156

The Qwen3.5 native vision-language series Plus models are built on a hybrid architecture that in…

Qwen: Qwen3.5-122B-A10B
qwen/qwen3.5-122b-a10bQwenPaid262 K0.0001150.000917

The Qwen3.5 122B-A10B native vision-language model is built on a hybrid architecture that integr…

Qwen: Qwen3.5-27B
qwen/qwen3.5-27bQwenPaid262 K0.0000860.000688

The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechanism, de…

Qwen: Qwen3.5-35B-A3B
qwen/qwen3.5-35b-a3bQwenPaid262 K0.0000570.000459

The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid architecture…

Qwen: Qwen3.5-Flash
qwen/qwen3.5-flash-02-23QwenPaid1 M0.0000650.00026

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrat…

Reka Edge
rekaai/reka-edgeRekaaiPaid16.4 K0.00010.0001

Reka Edge is an extremely efficient 7B multimodal vision-language model that accepts image/video…

Relace: Relace Search
relace/relace-searchRelacePaid256 K0.0010.003

The relace-search model uses 4-12 view_file and grep tools in parallel to explore a codebase…

ReMM SLERP 13B
undi95/remm-slerp-l2-13bUndi95Paid6.14 K0.000450.00065

A recreation trial of the original MythoMax-L2-B13 but with updated models. #merge

Sao10K: Llama 3 8B Lunaris
sao10k/l3-lunaris-8bSao10KPaid8.19 K0.000040.00005

Lunaris 8B is a versatile generalist and roleplaying model based on Llama 3. It’s a strategic me…

Sao10k: Llama 3 Euryale 70B v2.1
sao10k/l3-euryale-70bSao10KPaid8.19 K0.001480.00148

Euryale 70B v2.1 is a model focused on creative roleplay from Sao10k

Sao10K: Llama 3.1 70B Hanami x1
sao10k/l3.1-70b-hanami-x1Sao10KPaid16 K0.0030.003

This is Sao10K’s experiment over Euryale v2.2.

Sao10K: Llama 3.1 Euryale 70B v2.2
sao10k/l3.1-euryale-70bSao10KPaid131 K0.000850.00085

Euryale L3.1 70B v2.2 is a model focused on creative roleplay from [Sao10k](https://ko-fi.com/sa

StepFun: Step 3.5 Flash
stepfun/step-3.5-flashStepfunPaid262 K0.00010.0003

Step 3.5 Flash is StepFun’s most capable open-source foundation model. Built on a sparse Mixture…

Tencent: Hunyuan A13B Instruct
tencent/hunyuan-a13b-instructTencentPaid131 K0.000140.00057

Hunyuan-A13B is a 13B active parameter Mixture-of-Experts (MoE) language model developed by Tenc…

TheDrummer: Cydonia 24B V4.1
thedrummer/cydonia-24b-v4.1ThedrummerPaid131 K0.00030.0005

Uncensored and creative writing model based on Mistral Small 3.2 24B with good recall, prompt ad…

TheDrummer: Rocinante 12B
thedrummer/rocinante-12bThedrummerPaid32.8 K0.000170.00043

Rocinante 12B is designed for engaging storytelling and rich prose. Early testers have reported:…

TheDrummer: Skyfall 36B V2
thedrummer/skyfall-36b-v2ThedrummerPaid32.8 K0.000550.0008

Skyfall 36B v2 is an enhanced iteration of Mistral Small 2501, specifically fine-tuned for impro…

TheDrummer: UnslopNemo 12B
thedrummer/unslopnemo-12bThedrummerPaid32.8 K0.00040.0004

UnslopNemo v4.1 is the latest addition from the creator of Rocinante, designed for adventure wri…

vertex/gemini-embedding-001
vertex/gemini-embedding-001VertexPaidN/A0.000150

No description available.

vertex/text-embedding-005
vertex/text-embedding-005VertexPaidN/A0.0000250

No description available.

vertex/text-multilingual-embedding-002
vertex/text-multilingual-embedding-002VertexPaidN/A0.0000250

No description available.

WizardLM-2 8x22B
microsoft/wizardlm-2-8x22bMicrosoftPaid65.5 K0.000620.00062

WizardLM-2 8x22B is Microsoft AI’s most advanced Wizard model. It demonstrates highly competitiv…

Writer: Palmyra X4
writer/palmyra-x4-v1WriterPaid131 K0.0050.015

No description available.

Writer: Palmyra X5
writer/palmyra-x5WriterPaid1.04 M0.00060.006

Palmyra X5 is Writer’s most advanced model, purpose-built for building and scaling AI agents acr…

Writer: Palmyra X5
writer/palmyra-x5-v1WriterPaid1 M0.0060.03

No description available.

xAI: Grok 4.20
x-ai/grok-4.20X AiPaid2 M0.0020.006

Grok 4.20 is xAI’s newest flagship model with industry-leading speed and agentic tool calling ca…

xAI: Grok 4.20 Multi-Agent
x-ai/grok-4.20-multi-agentX AiPaid2 M0.0020.006

Grok 4.20 Multi-Agent is a variant of xAI’s Grok 4.20 designed for collaborative, agent-based wo…

Xiaomi: MiMo-V2-Flash
xiaomi/mimo-v2-flashXiaomiPaid262 K0.000090.00029

MiMo-V2-Flash is an open-source foundation language model developed by Xiaomi. It is a Mixture-o…

Z.ai: GLM 4 32B
z-ai/glm-4-32bZ AiPaid128 K0.00010.0001

GLM 4 32B is a cost-effective foundation language model. It can efficiently perform complex task…

Z.ai: GLM 4.5 Air
z-ai/glm-4.5-airZ AiPaid131 K0.000130.00085

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built f…

Page 1 of 1