Discover the right AI model

Compare specifications, benchmarks, and pricing for 650+ large language models. Make informed decisions for your AI projects.

650+
Models
90+
Companies
38+
Providers

Latest News

Recent announcements and model releases

All news

Featured Models

Top performing models from leading AI labs

View all

OpenAI: GPT-5.1

OpenAI

GPT-5.1 is the latest frontier-grade model in the GPT-5 series, offering stronger general-purpose reasoning, improved instruction adherence, and a more natural conversational style compared to GPT-5. It uses adaptive reasoning to allocate computation dynamically, responding quickly to simple queries while spending more depth on complex tasks. The model produces clearer, more grounded explanations with reduced jargon, making it easier to follow even on technical or multi-step problems. Built for broad task coverage, GPT-5.1 delivers consistent gains across math, coding, and structured analysis workloads, with more coherent long-form answers and improved tool-use reliability. It also features refined conversational alignment, enabling warmer, more intuitive responses without compromising precision. GPT-5.1 serves as the primary full-capability successor to GPT-5

400K

Google: Gemini 3 Pro

Google DeepMind

Gemini 3 Pro is Google DeepMind's flagship frontier AI model released November 18, 2025. It achieves 91.9% on GPQA Diamond (PhD-level reasoning), 1501 Elo on LMArena, and introduces Deep Think mode for extended reasoning (93.8% GPQA Diamond, 45.1% ARC-AGI-2). Features include 1M token context, multimodal input (text, images, video, audio, PDFs), Vibe Coding for autonomous development, and Generative UI capabilities.

1.0M

xAI: Grok 4

xAI

Grok 4 is xAI's latest reasoning model with a 256k context window. It supports parallel tool calling, structured outputs, and both image and text inputs. Note that reasoning is not exposed, reasoning cannot be disabled, and the reasoning effort cannot be specified. Pricing increases once the total tokens in a given request is greater than 128k tokens. See more details on the [xAI docs](https://docs.x.ai/docs/models/grok-4-0709)

256K

Anthropic: Claude Opus 4.5

Anthropic

Claude Opus 4.5 is Anthropic’s frontier reasoning model optimized for complex software engineering, agentic workflows, and long-horizon computer use. It offers strong multimodal capabilities, competitive performance across real-world coding and reasoning benchmarks, and improved robustness to prompt injection. The model is designed to operate efficiently across varied effort levels, enabling developers to trade off speed, depth, and token usage depending on task requirements. It comes with a new parameter to control token efficiency, which can be accessed using the OpenRouter Verbosity parameter with low, medium, or high. Opus 4.5 supports advanced tool use, extended context management, and coordinated multi-agent setups, making it well-suited for autonomous research, debugging, multi-step planning, and spreadsheet/browser manipulation. It delivers substantial gains in structured reasoning, execution reliability, and alignment compared to prior Opus generations, while reducing token overhead and improving performance on long-running tasks.

200K

Meta: Llama 4 Maverick

Meta AI

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward pass (400B total). It supports multilingual text and image input, and produces multilingual text and code output across 12 supported languages. Optimized for vision-language tasks, Maverick is instruction-tuned for assistant-like behavior, image reasoning, and general-purpose multimodal interaction. Maverick features early fusion for native multimodality and a 1 million token context window. It was trained on a curated mixture of public, licensed, and Meta-platform data, covering ~22 trillion tokens, with a knowledge cutoff in August 2024. Released on April 5, 2025 under the Llama 4 Community License, Maverick is suited for research and commercial applications requiring advanced multimodal understanding and high model throughput.

171.0M

DeepSeek R1

DeepSeek

DeepSeek's reasoning model with extended thinking for complex problem solving.

671.0B64K

Why LLMIndex?

Everything you need to evaluate and choose the right AI model for your use case.

Comprehensive Data

Detailed specifications, benchmarks, and pricing for every major LLM.

Always Current

Daily updates ensure you have the latest model information.

Open & Unbiased

Independent data from official sources. No vendor preference.

Ready to find your model?

Browse our database or use our comparison tool to evaluate models side by side.