Icon MANAGE ACCOUNT

Quick access to view and manage your account.

API KEY

Copy and manage your keys

b361••••••••••••••••••••••b19b

FREE

Add a credit card to unlock Build Tier 1

60 RPMRate Limit ⓘ

BALANCE

Your current balance is

$0.95Credits remaining

Icon MY MODLES

View and deploy your fine-tuned models and dedicated endpoints.

MODEL / ENDPOINTENDPOINTSTYPECREATEDPRICE
Your fine-tuned models and Dedicated endpoint models will appear here.

Icon START HERE

Start using additional features of agirouter API or request your own dedicated capacity.

SERVERLESS MODELS

Access 100+ chat, language, image, and code models through serverless endpoints or our playgrounds

GO TO PLAYGROUND

DEDICATED ENDPOINTS

Host open-source, fine-tuned, or custom trained models on a dedicated endpoint configured to your needs and specifications

CREATE ENDPOINTS

EMBEDDINGS

Build your own RAG applications with access to leading embeddings models

VIEW EMBEDDINGS

FINE-TUNE

Customize leading open-source models with your own private data for higher accuracy on your domain tasks.

FINE-TUNE DOCS

GPU CLUSTERS

Clusters with 16-1000+ interconnected NVIDIA H100 or H200 GPUs, from $1.99 / hour.

REQUEST CLUSTER

ANALYTICS

Track total requests, errors, latency, tokens per minute, and success rates per model.

VIEW ANALYTICS

Icon PLAN CARDS

Choose the plan and tier that fits your generative AI project, and scale seamlessly as you grow.

Build

Rate limits up to

6,000

Requests per minute

  • $1.00 free credits
  • Fully pay-as-you-go, and easily add credits
  • Monitoring dashboard with 24hr data
  • Email and in-app chat support
  • No daily rate limits, up to 6000 requests
  • Deploy on-demand dedicated endpoints (no rate limits)

Scale

Rate limits up to

9,000

Requests per minute

  • Premium support
  • HIPAA compliance
  • Support via private Slack channel
  • Monitoring dashboard with 30-day data (Coming soon)
  • Advanced dedicated endpoint configuration
  • Up to 9,000 requests per minute
  • 99% availability dedicated endpoints SLA

Enterprise

Rate limits up to

UNLIMITED

Requests per minute

  • Enterprise grade security & compliance
  • VPC and on-prem deployments
  • Monitoring dashboard with 1 year data (Coming soon)
  • Dedicated success representative
  • Custom rate limits
  • 99.9% dedicated endpoints SLA with geo redundancy
  • Priority access to hardware including H100 & H200 GPUs

Icon BUILD TIERS

Build plan rates and requirements. For more details, please read our docs.

Build TiersTotal SpendLLMSEmbeddingsRe-Rank
Build Tier 1Add Credit Card600 RPM3,000 RPM500,000
Build Tier 2$50.001,800 RPM5,000 RPM1,500,000
Build Tier 3$100.003,000 RPM5,000 RPM2,000,000
Build Tier 4$250.004,500 RPM10,000 RPM3,000,000
Build Tier 5$1,000.006,000 RPM10,000 RPM5,000,000

WE'RE HERE TO HELP

Harness the power of dedicated AI hardware tailored for your needs. Ensure peak performance and seamless operations with our monthly reservation plan.

CONTACT US
Flower Illustration