Model Catalog

Compare leading Chinese and global model routes by input/output pricing and use case. Data is for display; availability follows console and admin configuration.

Not sure where to start? Read theintegration docs

DeepSeek

DeepSeek V4 Flash

Hot

A highly watched Chinese frontier model for coding, reasoning and cost-sensitive production workloads.

Context
128K+
Speed
Balanced
Input $/1M
$0.5
Output $/1M
$1.5
Chinacodingreasoning
Routes depend on console configuration

Moonshot AI

Kimi K2.6

Hot

Strong long-context and agentic workflow model from Moonshot AI, useful for research, coding and complex software tasks.

Context
256K+
Speed
Balanced
Input $/1M
$0.6
Output $/1M
$2.5
Chinalong contextagents
Routes depend on console configuration

Alibaba Cloud

Qwen3.5 Plus

Hot

Alibaba's broad model family is popular for multimodal work, open-weight coverage and developer ecosystem depth.

Context
128K+
Speed
Fast
Input $/1M
$1.2
Output $/1M
$4.8
Chinamultilingualopen weights
Routes depend on console configuration

Z.ai

GLM-4.7 Flash

A Chinese enterprise-focused model family with strong reasoning, coding and low-latency deployment flexibility.

Context
128K
Speed
Balanced
Input $/1M
$0.8
Output $/1M
$3.2
Chinaenterprisecoding
Routes depend on console configuration

ByteDance

Doubao Seed2.0

New

Popular in ByteDance's AI ecosystem for conversational, multimodal and consumer-facing workloads at scale.

Context
128K
Speed
Fast
Input $/1M
$0.7
Output $/1M
$2.8
Chinamultimodalconsumer
Routes depend on console configuration

Baidu

ERNIE 4.5

Baidu's enterprise model line is useful for Chinese-language knowledge, search-adjacent and business workflows.

Context
128K
Speed
Balanced
Input $/1M
$0.9
Output $/1M
$3.6
Chinaenterpriseknowledge
Routes depend on console configuration

OpenAI

GPT-4o

Flagship multimodal model with strong speed, reasoning and coding coverage.

Context
128K
Speed
Fast
Input $/1M
$5
Output $/1M
$15
multimodalreasoning
Routes depend on console configuration

Anthropic

Claude 3.5 Sonnet

Reliable long-context reasoning for complex instructions, review and coding tasks.

Context
200K
Speed
Balanced
Input $/1M
$3
Output $/1M
$15
long contextcoding
Routes depend on console configuration

Google

Gemini 2.5 Flash

Fast, cost-efficient responses for high-frequency Q&A and batch generation.

Context
1M
Speed
Very fast
Input $/1M
$0.35
Output $/1M
$1.05
fastlow cost
Routes depend on console configuration

Meta

Llama 3.1 405B

Large open-weight family for private deployments and benchmark comparisons.

Context
128K
Speed
Slower
Input $/1M
$3
Output $/1M
$3
open weightsreasoning
Routes depend on console configuration

Mistral AI

Mistral Large

European general-purpose model with balanced tool calling and multilingual coverage.

Context
128K
Speed
Balanced
Input $/1M
$2
Output $/1M
$6
tool calling
Routes depend on console configuration

Need another model? Submit a request and operations can evaluate the route.

Submit request