One LLM API gateway
for all your model calls

Use DeepSeek, Qwen, GLM and Kimi through one OpenAI-compatible API. Unified tokens, billing, rate limits and routing for developers.

$0.14/MDeepSeek V4 input from
30+models connected
99.9%30-day availability target
OpenAI-compatibledrop-in base URL switch

Supported Models

One token for mainstream LLMs. View the full model list.

Usage-based pricing

No subscription lock-in. Recharge what you need, spend by token, and keep your balance valid.

Default

Basic access

1.0x vendor price
  • DeepSeek V4 Flash
  • GLM-4-Flash
  • Kimi 8K
  • Enabled for all users
Recommended

Pro models

1.3x vendor price
  • DeepSeek V4 Pro
  • Qwen-Max
  • GLM-4-Plus
  • Higher reasoning capability
Advanced

VIP models

1.6x vendor price
  • Kimi 128K
  • Qwen-VL-Max
  • Multimodal and long context
  • Priority support for teams

Start in three steps

Register, create a token, and switch your client base URL.

1

Create an account

Email registration. Recharge by the available payment methods in your region.

2

Create an API token

Set quota, expiration and allowed model groups from the dashboard.

3

Call the API

curl https://we2ai.ai/v1/chat/completions \
  -H "Authorization: Bearer sk-xxxx" \
  -d '{"model":"deepseek-chat",
       "messages":[{"role":"user",
       "content":"hello"}]}'

FAQ

How is this different from using each vendor directly?

You get one token, one bill, one endpoint and one set of logs instead of registering and funding every provider separately.

Do you store prompts and responses?

We only keep request metadata such as time, model and token usage for billing and troubleshooting. Prompt and response bodies are not retained by default.

What happens when a model provider fails?

Routes can be configured with multiple upstream channels. If one channel fails, traffic can be shifted to another compatible channel.