Model Explorer

449+ Models

100% official direct connections. Real-time pricing. Zero quantization.

Ai21Ai21

AI21: Jamba Large 1.7

ai21/jamba-large-1.7 from Ai21, optimized for chat completions workloads and available through Yi-AI.

Input

$2.0000

chatresponses
Aion LabsAion Labs

AionLabs: Aion-1.0

aion-labs/aion-1.0 from Aion Labs, optimized for chat completions workloads and available through Yi-AI.

Input

$4.0000

chatresponses
Aion LabsAion Labs

AionLabs: Aion-1.0-Mini

aion-labs/aion-1.0-mini from Aion Labs, optimized for chat completions workloads and available through Yi-AI.

Input

$0.7000

chatresponses
Aion LabsAion Labs

AionLabs: Aion-2.0

aion-labs/aion-2.0 from Aion Labs, optimized for chat completions workloads and available through Yi-AI.

Input

$0.8000

chatresponses
Aion LabsAion Labs

AionLabs: Aion-RP 1.0 (8B)

aion-labs/aion-rp-llama-3.1-8b from Aion Labs, optimized for chat completions workloads and available through Yi-AI.

Input

$0.8000

chatresponses
AlfredprosAlfredpros

AlfredPros: CodeLLaMa 7B Instruct Solidity

alfredpros/codellama-7b-instruct-solidity from Alfredpros, optimized for chat completions workloads and available through Yi-AI.

Input

$0.8000

chatresponses
AlibabaAlibaba

alibaba/wan-2.6

alibaba/wan-2.6 from Alibaba, optimized for chat completions workloads and available through Yi-AI.

Input

Free

chatresponses
AlibabaAlibaba

alibaba/wan-2.7

alibaba/wan-2.7 from Alibaba, optimized for chat completions workloads and available through Yi-AI.

Input

Free

chatresponses
AlibabaAlibaba

Tongyi DeepResearch 30B A3B

alibaba/tongyi-deepresearch-30b-a3b from Alibaba, optimized for chat completions workloads and available through Yi-AI.

Input

$0.0900

chatresponses
AllenaiAllenai

AllenAI: Olmo 3 32B Think

allenai/olmo-3-32b-think from Allenai, optimized for chat completions workloads and available through Yi-AI.

Input

$0.1500

chatresponses
AllenaiAllenai

AllenAI: Olmo 3.1 32B Instruct

allenai/olmo-3.1-32b-instruct from Allenai, optimized for chat completions workloads and available through Yi-AI.

Input

$0.2000

chatresponses
AlpindaleAlpindale

Goliath 120B

alpindale/goliath-120b from Alpindale, optimized for chat completions workloads and available through Yi-AI.

Input

$3.7500

chatresponses
AmazonAmazon

Amazon: Nova 2 Lite

amazon/nova-2-lite-v1 from Amazon, optimized for chat completions workloads and available through Yi-AI.

Input

$0.3000

chatresponses
AmazonAmazon

Amazon: Nova Lite 1.0

amazon/nova-lite-v1 from Amazon, optimized for chat completions workloads and available through Yi-AI.

Input

$0.0600

chatresponses
AmazonAmazon

Amazon: Nova Micro 1.0

amazon/nova-micro-v1 from Amazon, optimized for chat completions workloads and available through Yi-AI.

Input

$0.0350

chatresponses
AmazonAmazon

Amazon: Nova Premier 1.0

amazon/nova-premier-v1 from Amazon, optimized for chat completions workloads and available through Yi-AI.

Input

$2.5000

chatresponses
AmazonAmazon

Amazon: Nova Pro 1.0

amazon/nova-pro-v1 from Amazon, optimized for chat completions workloads and available through Yi-AI.

Input

$0.8000

chatresponses
Anthracite OrgAnthracite Org

Magnum v4 72B

anthracite-org/magnum-v4-72b from Anthracite Org, optimized for chat completions workloads and available through Yi-AI.

Input

$3.0000

chatresponses
AnthropicAnthropic

Anthropic: Claude 3 Haiku

anthropic/claude-3-haiku from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$0.2500

chatresponses
AnthropicAnthropic

Anthropic: Claude 3.5 Haiku

anthropic/claude-3.5-haiku from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$0.8000

chatresponses
AnthropicAnthropic

Anthropic: Claude 3.7 Sonnet

anthropic/claude-3.7-sonnet from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$3.0000

chatresponses
AnthropicAnthropic

Anthropic: Claude 3.7 Sonnet (thinking)

anthropic/claude-3.7-sonnet:thinking from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$3.0000

chatresponses
AnthropicAnthropic

Anthropic: Claude Haiku 4.5

anthropic/claude-haiku-4.5 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$1.0000

chatresponses
AnthropicAnthropic

Anthropic: Claude Opus 4

anthropic/claude-opus-4 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$15.0000

chatresponses
AnthropicAnthropic

Anthropic: Claude Opus 4.1

anthropic/claude-opus-4.1 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$15.0000

chatresponses
AnthropicAnthropic

Anthropic: Claude Opus 4.5

anthropic/claude-opus-4.5 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$5.0000

chatresponses
AnthropicAnthropic

Anthropic: Claude Opus 4.6

anthropic/claude-opus-4.6 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$5.0000

chatresponses
AnthropicAnthropic

Anthropic: Claude Opus 4.6 (Fast)

anthropic/claude-opus-4.6-fast from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$30.0000

chatresponses
AnthropicAnthropic

Anthropic: Claude Opus 4.7

anthropic/claude-opus-4.7 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$5.0000

chatresponses
AnthropicAnthropic

Anthropic: Claude Sonnet 4

anthropic/claude-sonnet-4 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$3.0000

chatresponses
AnthropicAnthropic

Anthropic: Claude Sonnet 4.5

anthropic/claude-sonnet-4.5 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$3.0000

chatresponses
AnthropicAnthropic

Anthropic: Claude Sonnet 4.6

anthropic/claude-sonnet-4.6 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$3.0000

chatresponses
Arcee AiArcee Ai

Arcee AI: Coder Large

arcee-ai/coder-large from Arcee Ai, optimized for chat completions workloads and available through Yi-AI.

Input

$0.5000

chatresponses
Arcee AiArcee Ai

Arcee AI: Maestro Reasoning

arcee-ai/maestro-reasoning from Arcee Ai, optimized for chat completions workloads and available through Yi-AI.

Input

$0.9000

chatresponses
Arcee AiArcee Ai

Arcee AI: Spotlight

arcee-ai/spotlight from Arcee Ai, optimized for chat completions workloads and available through Yi-AI.

Input

$0.1800

chatresponses
Arcee AiArcee Ai

Arcee AI: Trinity Large Preview

arcee-ai/trinity-large-preview from Arcee Ai, optimized for chat completions workloads and available through Yi-AI.

Input

$0.1500

chatresponses
Arcee AiArcee Ai

Arcee AI: Trinity Large Thinking

arcee-ai/trinity-large-thinking from Arcee Ai, optimized for chat completions workloads and available through Yi-AI.

Input

$0.2200

chatresponses
Arcee AiArcee Ai

Arcee AI: Trinity Mini

arcee-ai/trinity-mini from Arcee Ai, optimized for chat completions workloads and available through Yi-AI.

Input

$0.0450

chatresponses
Arcee AiArcee Ai

Arcee AI: Virtuoso Large

arcee-ai/virtuoso-large from Arcee Ai, optimized for chat completions workloads and available through Yi-AI.

Input

$0.7500

chatresponses
BaaiBaai

baai/bge-base-en-v1.5

baai/bge-base-en-v1.5 from Baai, optimized for chat completions workloads and available through Yi-AI.

Input

$0.0050

chatresponses
BaaiBaai

baai/bge-large-en-v1.5

baai/bge-large-en-v1.5 from Baai, optimized for chat completions workloads and available through Yi-AI.

Input

$0.0100

chatresponses
BaaiBaai

baai/bge-m3

baai/bge-m3 from Baai, optimized for chat completions workloads and available through Yi-AI.

Input

$0.0100

chatresponses
BaiduBaidu

Baidu: ERNIE 4.5 21B A3B

baidu/ernie-4.5-21b-a3b from Baidu, optimized for chat completions workloads and available through Yi-AI.

Input

$0.0700

chatresponses
BaiduBaidu

Baidu: ERNIE 4.5 21B A3B Thinking

baidu/ernie-4.5-21b-a3b-thinking from Baidu, optimized for chat completions workloads and available through Yi-AI.

Input

$0.0700

chatresponses
BaiduBaidu

Baidu: ERNIE 4.5 300B A47B

baidu/ernie-4.5-300b-a47b from Baidu, optimized for chat completions workloads and available through Yi-AI.

Input

$0.2800

chatresponses
BaiduBaidu

Baidu: ERNIE 4.5 VL 28B A3B

baidu/ernie-4.5-vl-28b-a3b from Baidu, optimized for chat completions workloads and available through Yi-AI.

Input

$0.1400

chatresponses
BaiduBaidu

Baidu: ERNIE 4.5 VL 424B A47B

baidu/ernie-4.5-vl-424b-a47b from Baidu, optimized for chat completions workloads and available through Yi-AI.

Input

$0.4200

chatresponses
BaiduBaidu

Baidu: Qianfan-OCR-Fast (free)

baidu/qianfan-ocr-fast:free from Baidu, optimized for chat completions workloads and available through Yi-AI.

Input

Free

chatresponses
Black Forest LabsBlack Forest Labs

black-forest-labs/flux.2-flex

black-forest-labs/flux.2-flex from Black Forest Labs, optimized for chat completions workloads and available through Yi-AI.

Input

Free

chatresponses
Black Forest LabsBlack Forest Labs

black-forest-labs/flux.2-klein-4b

black-forest-labs/flux.2-klein-4b from Black Forest Labs, optimized for chat completions workloads and available through Yi-AI.

Input

Free

chatresponses
Black Forest LabsBlack Forest Labs

black-forest-labs/flux.2-max

black-forest-labs/flux.2-max from Black Forest Labs, optimized for chat completions workloads and available through Yi-AI.

Input

Free

chatresponses
Black Forest LabsBlack Forest Labs

black-forest-labs/flux.2-pro

black-forest-labs/flux.2-pro from Black Forest Labs, optimized for chat completions workloads and available through Yi-AI.

Input

Free

chatresponses
BytedanceBytedance

ByteDance: UI-TARS 7B

bytedance/ui-tars-1.5-7b from Bytedance, optimized for chat completions workloads and available through Yi-AI.

Input

$0.1000

chatresponses
BytedanceBytedance

bytedance/seedance-1-5-pro

bytedance/seedance-1-5-pro from Bytedance, optimized for chat completions workloads and available through Yi-AI.

Input

Free

chatresponses
BytedanceBytedance

bytedance/seedance-2.0

bytedance/seedance-2.0 from Bytedance, optimized for chat completions workloads and available through Yi-AI.

Input

Free

chatresponses
BytedanceBytedance

bytedance/seedance-2.0-fast

bytedance/seedance-2.0-fast from Bytedance, optimized for chat completions workloads and available through Yi-AI.

Input

Free

chatresponses
Bytedance SeedBytedance Seed

ByteDance Seed: Seed 1.6

bytedance-seed/seed-1.6 from Bytedance Seed, optimized for chat completions workloads and available through Yi-AI.

Input

$0.2500

chatresponses
Bytedance SeedBytedance Seed

ByteDance Seed: Seed 1.6 Flash

bytedance-seed/seed-1.6-flash from Bytedance Seed, optimized for chat completions workloads and available through Yi-AI.

Input

$0.0750

chatresponses
Bytedance SeedBytedance Seed

ByteDance Seed: Seed-2.0-Lite

bytedance-seed/seed-2.0-lite from Bytedance Seed, optimized for chat completions workloads and available through Yi-AI.

Input

$0.2500

chatresponses
Bytedance SeedBytedance Seed

ByteDance Seed: Seed-2.0-Mini

bytedance-seed/seed-2.0-mini from Bytedance Seed, optimized for chat completions workloads and available through Yi-AI.

Input

$0.1000

chatresponses
Bytedance SeedBytedance Seed

bytedance-seed/seedream-4.5

bytedance-seed/seedream-4.5 from Bytedance Seed, optimized for chat completions workloads and available through Yi-AI.

Input

Free

chatresponses
CanopylabsCanopylabs

canopylabs/orpheus-3b-0.1-ft

canopylabs/orpheus-3b-0.1-ft from Canopylabs, optimized for chat completions workloads and available through Yi-AI.

Input

$7.0000

chatresponses
CognitivecomputationsCognitivecomputations

Venice: Uncensored (free)

cognitivecomputations/dolphin-mistral-24b-venice-edition:free from Cognitivecomputations, optimized for chat completions workloads and available through Yi-AI.

Input

Free

chatresponses
CohereCohere

Cohere: Command A

cohere/command-a from Cohere, optimized for chat completions workloads and available through Yi-AI.

Input

$2.5000

chatresponses
CohereCohere

Cohere: Command R (08-2024)

cohere/command-r-08-2024 from Cohere, optimized for chat completions workloads and available through Yi-AI.

Input

$0.1500

chatresponses
CohereCohere

Cohere: Command R+ (08-2024)

cohere/command-r-plus-08-2024 from Cohere, optimized for chat completions workloads and available through Yi-AI.

Input

$2.5000

chatresponses
CohereCohere

Cohere: Command R7B (12-2024)

cohere/command-r7b-12-2024 from Cohere, optimized for chat completions workloads and available through Yi-AI.

Input

$0.0375

chatresponses
CohereCohere

cohere/rerank-4-fast

cohere/rerank-4-fast from Cohere, optimized for rerank scoring workloads and available through Yi-AI.

Input

Free

rerank
CohereCohere

cohere/rerank-4-pro

cohere/rerank-4-pro from Cohere, optimized for rerank scoring workloads and available through Yi-AI.

Input

Free

rerank
CohereCohere

cohere/rerank-v3.5

cohere/rerank-v3.5 from Cohere, optimized for rerank scoring workloads and available through Yi-AI.

Input

Free

rerank
DeepcogitoDeepcogito

Deep Cogito: Cogito v2.1 671B

deepcogito/cogito-v2.1-671b from Deepcogito, optimized for chat completions workloads and available through Yi-AI.

Input

$1.2500

chatresponses
DeepSeekDeepSeek

DeepSeek: DeepSeek V3

deepseek/deepseek-chat from DeepSeek, optimized for chat completions workloads and available through Yi-AI.

Input

$0.3200

chatresponses
DeepSeekDeepSeek

DeepSeek: DeepSeek V3 0324

deepseek/deepseek-chat-v3-0324 from DeepSeek, optimized for chat completions workloads and available through Yi-AI.

Input

$0.2000

chatresponses
DeepSeekDeepSeek

DeepSeek: DeepSeek V3.1

deepseek/deepseek-chat-v3.1 from DeepSeek, optimized for chat completions workloads and available through Yi-AI.

Input

$0.1500

chatresponses
DeepSeekDeepSeek

DeepSeek: DeepSeek V3.1 Terminus

deepseek/deepseek-v3.1-terminus from DeepSeek, optimized for chat completions workloads and available through Yi-AI.

Input

$0.2100

chatresponses
DeepSeekDeepSeek

DeepSeek: DeepSeek V3.2

deepseek/deepseek-v3.2 from DeepSeek, optimized for chat completions workloads and available through Yi-AI.

Input

$0.2520

chatresponses
DeepSeekDeepSeek

DeepSeek: DeepSeek V3.2 Exp

deepseek/deepseek-v3.2-exp from DeepSeek, optimized for chat completions workloads and available through Yi-AI.

Input

$0.2700

chatresponses
DeepSeekDeepSeek

DeepSeek: DeepSeek V3.2 Speciale

deepseek/deepseek-v3.2-speciale from DeepSeek, optimized for chat completions workloads and available through Yi-AI.

Input

$0.4000

chatresponses
DeepSeekDeepSeek

DeepSeek: DeepSeek V4 Flash

deepseek/deepseek-v4-flash from DeepSeek, optimized for chat completions workloads and available through Yi-AI.

Input

$0.1400

chatresponses
DeepSeekDeepSeek

DeepSeek: DeepSeek V4 Pro

deepseek/deepseek-v4-pro from DeepSeek, optimized for chat completions workloads and available through Yi-AI.

Input

$0.4350

chatresponses
Showing 80 of 449 models. Use search to narrow results.
YiAI Router

Disclaimer: this gateway provides B2B routing infrastructure only. Data transit is completed through offshore non-jurisdictional nodes. Developers are responsible for complying with local laws and for the content they generate.

Account

Language

© 2026 YiAI Infrastructure.Zero Logs · Volatile Compute · Offshore Routed