AI Changelog

Every major AI model update, in one place. Updated through Feb 2026.

35updates tracked
6providers
24high impact
๐Ÿ”
Filter:|
Showing 35 of 35 entries

February 2026

3
๐Ÿงก AnthropicClaude 3.7 Sonnet๐Ÿš€ Model Release๐Ÿ”ด high impact
2026-02-24

Claude 3.7 Sonnet with Extended Thinking

Anthropic releases Claude 3.7 Sonnet, featuring extended thinking mode that can reason for up to 128K tokens before responding. Sets new SOTA on coding and math benchmarks. Available on API immediately.

Read announcement โ†—
๐Ÿ–ค xAIGrok 3๐Ÿš€ Model Release๐Ÿ”ด high impact
2026-02-17

Grok 3 announced โ€” trained on 100K GPUs

xAI announces Grok 3, trained on a 100K H100 cluster (10x Grok 2). Claims state-of-the-art on AIME and GPQA benchmarks. Available to Premium+ subscribers first.

Read announcement โ†—
๐Ÿ’™ GoogleGemini 2.0๐Ÿš€ Model Release๐Ÿ”ด high impact
2026-02-05

Gemini 2.0 Flash generally available

Google releases Gemini 2.0 Flash as GA, featuring native tool use (search, code execution), multimodal live API, and 1M context window. Twice the speed of 1.5 Flash at same price.

Read announcement โ†—

January 2026

5
๐Ÿ’š OpenAIo3-mini๐Ÿš€ Model Release๐Ÿ”ด high impact
2026-01-31

o3-mini released โ€” fast reasoning at low cost

OpenAI releases o3-mini, a compact reasoning model that competes with larger models on STEM tasks. Three effort levels: low, medium, high. Dramatically cheaper than o3 full.

Read announcement โ†—
๐Ÿ–ค xAIโœจ Feature๐ŸŸก medium impact
2026-01-20

Grok gets DeepSearch โ€” multi-step web reasoning

xAI launches DeepSearch, a feature allowing Grok to iteratively search the web and reason across multiple sources before answering. Similar to Perplexity Deep Research.

Read announcement โ†—
๐Ÿงก AnthropicClaude 3.5 Haiku๐Ÿš€ Model Release๐Ÿ”ด high impact
2026-01-15

Claude 3.5 Haiku generally available

Claude 3.5 Haiku is now GA after limited beta. Outperforms the previous Claude 3 Opus on most benchmarks at a fraction of the cost. Priced at $0.80/M input and $4/M output tokens.

Read announcement โ†—
๐Ÿ’™ Google๐Ÿ’ฐ Pricing๐Ÿ”ด high impact
2026-01-14

Gemini Flash 2.0 pricing โ€” effectively free

Google prices Gemini 2.0 Flash at $0.075/M input and $0.30/M output โ€” 10x cheaper than GPT-4o. Free tier: 15 RPM, 1M TPM with no cost. Aggressive push for developer adoption.

Read announcement โ†—
โค๏ธ MistralMistral Small 3๐Ÿš€ Model Release๐ŸŸก medium impact
2026-01-13

Mistral Small 3 โ€” best small open-source model

Mistral releases Small 3 (24B), trained with a novel distillation technique. Outperforms Llama 3.1 70B on most benchmarks at 3x fewer parameters. Apache 2.0 licensed.

Read announcement โ†—

December 2025

4
๐Ÿ’š OpenAIo3๐Ÿš€ Model Release๐Ÿ”ด high impact
2025-12-20

o3 announced โ€” beats ARC-AGI benchmark

OpenAI announces o3, achieving 87.5% on ARC-AGI (a test of general reasoning). Not yet released publicly, but marks a significant milestone. Safety evaluations ongoing.

Read announcement โ†—
๐Ÿ’™ Googleโœจ Feature๐ŸŸก medium impact
2025-12-18

Veo 2 โ€” state-of-the-art video generation

Google announces Veo 2, a video generation model that produces 4K video with unprecedented quality and camera control. Available through VideoFX and coming to Gemini.

Read announcement โ†—
๐Ÿ’™ GoogleGemini 2.0๐Ÿš€ Model Release๐Ÿ”ด high impact
2025-12-11

Gemini 2.0 โ€” agentic era begins

Google announces the Gemini 2.0 family, designed for an "agentic era" with native tool calling, streaming, and a new Live API for real-time multimodal interactions.

Read announcement โ†—
๐Ÿงก MetaLlama 3.3 70B๐Ÿš€ Model Release๐Ÿ”ด high impact
2025-12-06

Llama 3.3 70B โ€” matches 405B performance

Meta releases Llama 3.3 70B, a fine-tuned instruction model that matches Llama 3.1 405B performance on most benchmarks at 6x less compute. Best open-source model per parameter.

Read announcement โ†—

November 2025

4
๐Ÿ’š OpenAIโœจ Feature๐ŸŸก medium impact
2025-11-20

ChatGPT Canvas โ€” collaborative writing & coding

Canvas is a new ChatGPT interface for working on long documents and code side-by-side with Claude. Supports inline edits, comments, and version history.

Read announcement โ†—
๐Ÿ’™ Googleโœจ Feature๐ŸŸก medium impact
2025-11-19

Deep Research launched in Gemini

Deep Research uses Gemini 1.5 Pro to autonomously search the web, synthesize findings, and produce detailed research reports. Available to Gemini Advanced subscribers.

Read announcement โ†—
๐Ÿงก Anthropicโœจ Feature๐Ÿ”ด high impact
2025-11-10

Computer Use (beta) โ€” Claude controls desktops

Anthropic releases Computer Use, allowing Claude to view screenshots and control mouse/keyboard on real computers. First LLM with native computer control. Available via API.

Read announcement โ†—
๐Ÿ–ค xAIGrok 2๐Ÿ”ง API Change๐ŸŸก medium impact
2025-11-04

xAI API launch โ€” OpenAI-compatible

xAI launches its public API with OpenAI-compatible endpoints. Drop-in replacement for OpenAI in many apps. Priced at $2/M input, $10/M output for grok-2-1212.

Read announcement โ†—

October 2025

3
๐Ÿงก AnthropicClaude 3.5 Sonnet๐Ÿš€ Model Release๐Ÿ”ด high impact
2025-10-22

Claude 3.5 Sonnet (updated oct-2024)

Significant upgrade to Claude 3.5 Sonnet with major improvements to coding, especially tool use, multi-step tasks, and code editing. Computer Use public beta launched alongside.

Read announcement โ†—
โค๏ธ Mistralโœจ Feature๐ŸŸก medium impact
2025-10-14

Le Chat Pro launches โ€” European Claude alternative

Mistral launches Le Chat Pro, a premium subscription to their AI assistant with web search, image generation, and document analysis. Positioned as a privacy-first alternative for European users.

Read announcement โ†—
๐Ÿ’š OpenAIโœจ Feature๐ŸŸก medium impact
2025-10-01

Realtime API โ€” 300ms voice latency

The Realtime API enables sub-300ms voice conversations with GPT-4o, with function calling and VAD (voice activity detection) built in. Built for always-on voice agents.

Read announcement โ†—

September 2025

4
๐Ÿงก MetaLlama 3.2๐Ÿš€ Model Release๐Ÿ”ด high impact
2025-09-25

Llama 3.2 โ€” multimodal + small on-device models

Meta releases Llama 3.2 with vision capabilities (11B and 90B) and lightweight models (1B and 3B) optimized for mobile and edge deployment. First multimodal open-source Llama.

Read announcement โ†—
โค๏ธ MistralPixtral๐Ÿš€ Model Release๐ŸŸก medium impact
2025-09-17

Pixtral 12B โ€” first Mistral vision model

Mistral releases Pixtral 12B, a multimodal model combining a 12B language model with a 400M vision encoder. Open-source, supports variable image resolutions.

Read announcement โ†—
๐Ÿงก Anthropic๐Ÿ’ฐ Pricing๐Ÿ”ด high impact
2025-09-12

Prompt caching launched โ€” 90% cost reduction

Prompt caching allows reusing prompt prefixes across API calls. Up to 90% cost reduction and 85% latency improvement for repeated prompts. Available for Claude 3 family.

Read announcement โ†—
๐Ÿ’š OpenAIo1๐Ÿš€ Model Release๐Ÿ”ด high impact
2025-09-12

OpenAI o1 โ€” chain-of-thought reasoning model

o1 is OpenAI's first model trained with reinforcement learning to reason before responding. Excels at complex math, science, and coding. Available to ChatGPT Plus and API users.

Read announcement โ†—

August 2025

2
๐Ÿ–ค xAIGrok 2๐Ÿš€ Model Release๐Ÿ”ด high impact
2025-08-13

Grok 2 and Grok 2 mini released

xAI releases Grok 2 and Grok 2 mini with significant improvements over Grok 1.5. Real-time X.com data access and new image generation via Aurora model. API available.

Read announcement โ†—
๐Ÿ’š OpenAIโœจ Feature๐Ÿ”ด high impact
2025-08-06

Structured Outputs โ€” 100% schema conformance

OpenAI launches Structured Outputs, guaranteeing model outputs match any JSON schema exactly. Eliminates parsing errors in production. Supported across GPT-4o and newer.

Read announcement โ†—

July 2025

4
โค๏ธ MistralMistral Large 2๐Ÿš€ Model Release๐Ÿ”ด high impact
2025-07-24

Mistral Large 2 โ€” 123B parameter model

Mistral releases Large 2 with 123B parameters, achieving GPT-4 level performance on coding and math. Supports 128K context, function calling, and 80+ languages.

Read announcement โ†—
๐Ÿงก MetaLlama 3.1๐Ÿš€ Model Release๐Ÿ”ด high impact
2025-07-23

Llama 3.1 405B โ€” first frontier open-source

Meta releases Llama 3.1, including a 405B parameter model that competes with GPT-4o and Claude 3.5 Sonnet. Fully open weights. Supports 128K context, 8 languages.

Read announcement โ†—
๐Ÿ’š OpenAIGPT-4o mini๐Ÿš€ Model Release๐Ÿ”ด high impact
2025-07-18

GPT-4o mini โ€” cheapest capable model

GPT-4o mini replaces GPT-3.5 Turbo as the go-to cheap model. Scores 82% on MMLU (higher than GPT-4 at launch). Priced at $0.15/M input, $0.60/M output.

Read announcement โ†—
๐Ÿงก Anthropicโœจ Feature๐Ÿ”ด high impact
2025-07-15

Model Context Protocol (MCP) open-sourced

Anthropic open-sources the Model Context Protocol, a universal standard for connecting LLMs to external tools, data, and services. Already has 40+ official server implementations.

Read announcement โ†—

June 2025

1
๐Ÿงก AnthropicClaude 3.5 Sonnet๐Ÿš€ Model Release๐Ÿ”ด high impact
2025-06-20

Claude 3.5 Sonnet released

First model in the Claude 3.5 family. Sets new benchmark records on coding (HumanEval), reasoning (GPQA), and vision tasks. 2x faster than Claude 3 Opus at lower cost.

Read announcement โ†—

May 2025

2
๐Ÿ’™ GoogleGemini 1.5 Pro๐Ÿš€ Model Release๐Ÿ”ด high impact
2025-05-14

Gemini 1.5 Pro โ€” 1M context window

Gemini 1.5 Pro launches with a 1M token context window (2M in private preview). Processes entire codebases, hour-long videos, or thousands of documents in one call.

Read announcement โ†—
๐Ÿ’š OpenAIGPT-4o๐Ÿš€ Model Release๐Ÿ”ด high impact
2025-05-13

GPT-4o โ€” omni model with native voice/vision

GPT-4o ("omni") natively processes text, audio, and images in a single model. Real-time voice conversations with emotion detection. 2x faster than GPT-4 Turbo, 50% cheaper.

Read announcement โ†—

April 2025

2
๐Ÿงก MetaLlama 3๐Ÿš€ Model Release๐Ÿ”ด high impact
2025-04-18

Llama 3 released โ€” 8B and 70B

Meta releases Llama 3 with 8B and 70B models, significantly outperforming Llama 2. Trained on 15T tokens, showing major improvements in reasoning, code, and instruction following.

Read announcement โ†—
๐Ÿ’š OpenAIโš ๏ธ Deprecation๐ŸŸก medium impact
2025-04-11

GPT-4 Turbo deprecated

OpenAI deprecates the original GPT-4 Turbo (gpt-4-turbo-preview and gpt-4-0125-preview). Customers redirected to GPT-4o, which outperforms at lower cost. Shutdown date: June 2025.

Read announcement โ†—

March 2025

1
๐Ÿงก Anthropic๐Ÿ”ง API Change๐ŸŸก medium impact
2025-03-04

Files API and Citations released

New Files API allows storing documents server-side for repeated use across conversations. Citations feature returns precise document references with exact quotes for RAG workflows.

Read announcement โ†—