Quick access to view and manage your account.
Copy and manage your keys
Add a credit card to unlock Build Tier 1
Your current balance is
View and deploy your fine-tuned models and dedicated endpoints.
| MODEL / ENDPOINT | ENDPOINTS | TYPE | CREATED | PRICE |
|---|---|---|---|---|
| Your fine-tuned models and Dedicated endpoint models will appear here. | ||||
Start using additional features of agirouter API or request your own dedicated capacity.
Access 100+ chat, language, image, and code models through serverless endpoints or our playgrounds
Host open-source, fine-tuned, or custom trained models on a dedicated endpoint configured to your needs and specifications
Build your own RAG applications with access to leading embeddings models
Customize leading open-source models with your own private data for higher accuracy on your domain tasks.
Clusters with 16-1000+ interconnected NVIDIA H100 or H200 GPUs, from $1.99 / hour.
Track total requests, errors, latency, tokens per minute, and success rates per model.
Choose the plan and tier that fits your generative AI project, and scale seamlessly as you grow.
Rate limits up to
6,000
Requests per minute
Rate limits up to
9,000
Requests per minute
Rate limits up to
UNLIMITED
Requests per minute
Build plan rates and requirements. For more details, please read our docs.
| Build Tiers | Total Spend | LLMS | Embeddings | Re-Rank |
|---|---|---|---|---|
| Build Tier 1 | Add Credit Card | 600 RPM | 3,000 RPM | 500,000 |
| Build Tier 2 | $50.00 | 1,800 RPM | 5,000 RPM | 1,500,000 |
| Build Tier 3 | $100.00 | 3,000 RPM | 5,000 RPM | 2,000,000 |
| Build Tier 4 | $250.00 | 4,500 RPM | 10,000 RPM | 3,000,000 |
| Build Tier 5 | $1,000.00 | 6,000 RPM | 10,000 RPM | 5,000,000 |
Harness the power of dedicated AI hardware tailored for your needs. Ensure peak performance and seamless operations with our monthly reservation plan.
CONTACT US