Model Directory

Every Top AI Model, One API

Access GPT-5, Claude Opus, Gemini 3 Pro, Grok 4, and more through a single, unified endpoint.

GPT-5

OpenAI · 128k context

OpenAI's most capable model. Excels at complex reasoning, creative writing, and multi-step problem solving. Successor to GPT-4o with significantly improved accuracy and reduced hallucination.

GPT-5 Mini

OpenAI · 128k context

5% off

Smaller, faster version of GPT-5. Great balance of capability and cost for production applications. Ideal for chatbots, summarization, and classification tasks.

GPT-4o

OpenAI · 128k context

5% off

OpenAI's multimodal flagship. Processes text, images, and audio natively. Fast response times with strong reasoning capabilities.

GPT-4o Mini

OpenAI · 128k context

5% off

Ultra-affordable model for high-volume tasks. Best cost-per-quality ratio in the GPT family. Perfect for classification, extraction, and simple generation.

GPT-4.1

OpenAI · 1M context

5% off

1 million token context window. Optimized for long-document analysis, codebases, and retrieval tasks. Strong coding performance.

GPT-4.1 Mini

OpenAI · 1M context

5% off

Budget-friendly model with 1M context. Excellent for document processing and code review where long context is needed but top-tier reasoning isn't.

o3

OpenAI · 200k context

5% off

OpenAI's most powerful reasoning model. Uses extended thinking to solve complex math, science, and coding problems. Best-in-class for STEM tasks.

o4-mini

OpenAI · 200k context

5% off

Fast, affordable reasoning model. Matches o3 on many benchmarks at a fraction of the cost. Great for code generation and structured problem solving.

Claude Opus 4.6

Anthropic · 200k context

5% off

Anthropic's most capable model. Exceptional at nuanced writing, complex analysis, and following detailed instructions. Extended thinking for difficult problems.

Claude Sonnet 4

Anthropic · 200k context

5% off

Best balance of intelligence and speed in the Claude family. Strong coding abilities, reliable instruction following, and excellent for production use cases.

Claude Haiku 3.5

Anthropic · 200k context

5% off

Fast, cost-effective Claude model. Near-instant responses for classification, extraction, and customer-facing applications where speed matters.

Gemini 3 Pro

Google · 1M context

5% off

Google's flagship model with native multimodal capabilities. Excels at image understanding, code generation, and multi-step reasoning across 1M token context.