Releases, benchmarks, open-source drops, and industry shifts — refreshed regularly so you can see what’s newest at a glance.
Updated June 2026 · refreshed regularly
Anthropic's Claude Opus 4.8 debuted at the top of the Artificial Analysis Intelligence Index and leads aggregate leaderboards.
Artificial Analysis →Claude reached a $30B annualized revenue run rate by the end of Q1 2026 — roughly 80x growth in a single quarter.
LLM-Stats →Google launched Gemini 3.5 at I/O 2026 and committed Gemini 3.5 Pro for June, leaning hard into agentic capabilities.
WaveSpeed →Gemini 3.5 Flash rivals large flagships while running ~4x faster, priced around $1.50 / 1M input tokens.
LLM-Stats →GPT-5.5 is OpenAI's current frontier model, with Pro and Instant variants spanning quality and latency needs.
WaveSpeed →Agentic coding model, 256K context, Modified MIT — with ~30% lower reasoning token usage than K2.6.
LLM-Stats →A diffusion-based reasoning model generating tokens in parallel — targeting agentic loops and real-time voice.
LLM-Stats →A frontier model beats Gemini 3.1 Pro on Terminal-Bench 2.1 (76.2%) and MCP Atlas (83.6%) with ~4x faster output.
LLM-Stats →A customizable multimodal safety model for enterprise AI, with guardrails across text, image and more.
NVIDIA →GitHub shifted Copilot to metered billing as inference costs from agentic coding sessions rose.
GitHub →13+ hosted frontier models ship 1M+ windows; Grok 4 Fast exposes ~2.0M and Llama 4 Scout reaches 10M tokens.
Morph →Llama, Mistral and Qwen now meet or exceed GPT-4-class scores on several public benchmarks.
ComputingForGeeks →全球与中国的大模型生态在一定程度上是“脱节”的——这里用中文单独呈现国产大模型的最新格局与动态。
DeepSeek、GLM(智谱)、Qwen(阿里通义)、Kimi(月之暗面)在开源与性价比领域与国际厂商正面竞争。
人人都是产品经理 →MiniMax 发布 M2.5,10B 激活参数实现 Agent 场景高效推理,周调用 3.07T tokens 一度登顶 OpenRouter。
知乎 →以上为公开报道整理,仅供参考;以各厂商官方信息为准。