Model Directory
Every Top AI Model, One API
Access GPT-5, Claude Opus, Gemini 3 Pro, Grok 4, and more through a single, unified endpoint.
GPT-5
OpenAI · 128k context
OpenAI's most capable model. Excels at complex reasoning, creative writing, and multi-step problem solving. Successor to GPT-4o with significantly improved accuracy and reduced hallucination.
GPT-5 Mini
OpenAI · 128k context
Smaller, faster version of GPT-5. Great balance of capability and cost for production applications. Ideal for chatbots, summarization, and classification tasks.
GPT-4o
OpenAI · 128k context
OpenAI's multimodal flagship. Processes text, images, and audio natively. Fast response times with strong reasoning capabilities.
GPT-4o Mini
OpenAI · 128k context
Ultra-affordable model for high-volume tasks. Best cost-per-quality ratio in the GPT family. Perfect for classification, extraction, and simple generation.
GPT-4.1
OpenAI · 1M context
1 million token context window. Optimized for long-document analysis, codebases, and retrieval tasks. Strong coding performance.
GPT-4.1 Mini
OpenAI · 1M context
Budget-friendly model with 1M context. Excellent for document processing and code review where long context is needed but top-tier reasoning isn't.
o3
OpenAI · 200k context
OpenAI's most powerful reasoning model. Uses extended thinking to solve complex math, science, and coding problems. Best-in-class for STEM tasks.
o4-mini
OpenAI · 200k context
Fast, affordable reasoning model. Matches o3 on many benchmarks at a fraction of the cost. Great for code generation and structured problem solving.
Claude Opus 4.6
Anthropic · 200k context
Anthropic's most capable model. Exceptional at nuanced writing, complex analysis, and following detailed instructions. Extended thinking for difficult problems.
Claude Sonnet 4
Anthropic · 200k context
Best balance of intelligence and speed in the Claude family. Strong coding abilities, reliable instruction following, and excellent for production use cases.
Claude Haiku 3.5
Anthropic · 200k context
Fast, cost-effective Claude model. Near-instant responses for classification, extraction, and customer-facing applications where speed matters.
Gemini 3 Pro
Google · 1M context
Google's flagship model with native multimodal capabilities. Excels at image understanding, code generation, and multi-step reasoning across 1M token context.
Gemini 2.5 Flash
Google · 1M context
Ultra-fast and ultra-cheap. Best model for high-volume inference with 1M context. Ideal for RAG pipelines, classification, and data processing.
Gemini 2.5 Pro
Google · 1M context
Previous gen flagship with proven reliability. Strong coding and analysis capabilities at a lower price point than Gemini 3 Pro.
Grok 4
xAI · 256k context
xAI's most powerful model. Strong at reasoning, real-time knowledge, and unconventional problem-solving approaches.
Grok 4.1 Fast
xAI · 256k context
Blazing fast inference at rock-bottom prices. Great for real-time applications and high-volume processing.
Llama 4 Scout
Meta · 512k context
Meta's open-weight model with 512k context. Strong general capabilities, excellent value for open-source advocates who want API convenience.
DeepSeek V3
DeepSeek · 128k context
High-performance model at exceptional value. Strong coding and reasoning from the DeepSeek team. Popular choice for budget-conscious developers.
Don't See Your Model?
We add new models weekly. Request a model and we'll prioritize it.
Get Free API Key →