NewsroomModels & Releases

Models & Releases

New models, versions, and capabilities shipping across the frontier and open-model labs.

114 stories

Today

FeatureJun 19, 2026 · 2 min read

AWS launches Web Search on Amazon Bedrock AgentCore with proprietary index at $7 per 1,000 queries

Amazon Web Services has launched Web Search on Amazon Bedrock AgentCore as a generally available feature, giving AI agents access to a continuously updated proprietary web index without routing…

FeatureJun 19, 2026 · 3 min read

OpenAI's GPT-5.5 Instant outperforms physicians on 3,500 health questions, cutting factuality errors by 71%

OpenAI announced on June 18 that GPT-5.5 Instant — the model now powering health responses for all free ChatGPT users — outperformed physician-written answers on a set of 3,500 real-world health…

OpenAI

ReleaseJun 19, 2026 · 2 min read

Google Gemini API deprecates all Imagen 4 and Veo 2.0/3.0 models, Veo shutdown set for June 30

Google announced on June 15 the deprecation of six media generation models across its Gemini API — all three Imagen 4 image generation variants and all three Veo video generation models in the 2.0…

Google

Yesterday

FeatureJun 18, 2026 · 2 min read

xAI's Grok 4.3 arrives on Amazon Bedrock with 1M-token context and configurable reasoning

xAI's Grok 4.3 became available on Amazon Bedrock on June 15, 2026, making the reasoning-first model accessible to enterprise developers through AWS's managed AI infrastructure. The launch gives Grok…

xAI

ReleaseJun 18, 2026 · 2 min read

Z.ai releases GLM-5.2 with 1M-token lossless context and top open-source coding benchmark scores

Z.ai released GLM-5.2 on June 16, 2026, an open-source model with a 1M-token lossless context window and benchmark scores that place it at the front of available open-source coding models. The model…

Z.ai (Zhipu AI)

Earlier this week

FeatureJun 17, 2026 · 2 min read

Amazon SageMaker Async Inference now supports inline request payloads, eliminating mandatory S3 uploads

Amazon has updated SageMaker AI Async Inference to accept raw inference payloads directly in the request body, removing the long-standing requirement to stage input data in Amazon S3 before…

Earlier

FeatureJun 17, 2026 · 2 min read

Google adds real-time streaming to Gemini API speech generation with gemini-3.1-flash-tts-preview

Google updated the Gemini API on June 17, 2026 to support streaming for text-to-speech output via the gemini-3.1-flash-tts-preview model. Developers can now receive audio chunks as they are generated…

Google

FeatureJun 17, 2026 · 2 min read

Google deprecates Imagen 4 and Veo 2/3 generation models from the Gemini API, Veo shutdown on June 30

Google announced on June 15, 2026 the deprecation of its Imagen 4 image generation family and Veo 2/3 video generation models from the Gemini API. Veo users face a June 30 deadline — less than two…

Google

FeatureJun 16, 2026 · 2 min read

AWS introduces container caching in SageMaker AI, cutting model startup latency by 51%

Amazon Web Services has added container caching to Amazon SageMaker AI, removing the container image download step from the scale-out critical path and reducing model startup latency by 51% in tested…

FeatureJun 16, 2026 · 2 min read

AWS brings P-EAGLE parallelized speculative decoding to SageMaker, delivering up to 1.69x throughput speedup over EAGLE frameworks

Amazon Web Services has made P-EAGLE, a parallelized speculative decoding technique, available through Amazon SageMaker JumpStart. The method accelerates large language model inference by generating…

FeatureJun 16, 2026 · 2 min read

Meta launches AI Mode on Facebook, surfacing AI-synthesized answers from public posts across Groups and Reels

Meta introduced AI Mode on Facebook on June 16, 2026, a new search capability that replaces traditional link-based results with AI-generated answers drawn from public conversations happening across…

FeatureJun 16, 2026 · 2 min read

Cursor's Bugbot is now over 3x faster and 22% cheaper, powered by Composer 2.5

Cursor shipped a significant update to Bugbot on June 10, 2026, making its automated code review agent more than three times faster, 22% cheaper to run, and 10% more effective at catching bugs. The…

FeatureJun 16, 2026 · 3 min read

xAI expands Grok Imagine API with Files integration, persistent storage, and video generation preview

xAI has shipped a series of updates to its Grok Imagine API in June 2026, adding deep integration with the Files API, public URL sharing for stored assets, and a new image-to-video model — Grok…

xAI

FeatureJun 16, 2026 · 3 min read

Databricks makes ai_extract, ai_classify, and ai_query generally available, bringing AI into SQL pipelines without code

Databricks completed its native AI SQL function suite in June 2026, promoting aiextract, aiclassify, and aiquery to general availability in a series of releases between June 11 and June 15 — giving…

ReleaseJun 15, 2026 · 2 min read

Google deprecates Imagen 4 and Veo 2/3 models from the Gemini API, Veo shutdown in 15 days

Google announced on June 15 that it is deprecating all three Imagen 4 image generation models and three Veo video generation models from the Gemini API. The Imagen 4 family shuts down August 17,…

Google

ReleaseJun 15, 2026 · 2 min read

Anthropic retires Claude Sonnet 4 and Claude Opus 4, all requests to legacy model IDs now return errors

Anthropic retired two original Claude 4 models — Claude Sonnet 4 and Claude Opus 4 — effective June 15, 2026. All API requests to the model IDs claude-sonnet-4-20250514 and claude-opus-4-20250514 now…

Anthropic

FeatureJun 14, 2026 · 2 min read

Google launches Gemini 3.5 Flash GA and Managed Agents in the Gemini API in public preview on May 19

Google released Gemini 3.5 Flash as a generally available model on May 19, 2026, and simultaneously launched Managed Agents in the Gemini API in public preview — a hosted agent execution service that…

Google

FeatureJun 13, 2026 · 4 min read

Claude Code ships /fork command, letting developers branch AI sessions with full context inheritance starting in v2.1.161

Anthropic has shipped a /fork command to Claude Code, enabling developers to branch an AI coding session into a parallel subagent that inherits the full context of the existing conversation — a…

Anthropic

ReleaseJun 13, 2026 · 2 min read

Midjourney V8.1 becomes the default model, delivering 4x resolution and 4-second image generation

Midjourney set V8.1 as its default model on June 11, 2026, replacing V7 after user testing and feedback. The transition marks V8.1's general availability as the standard experience for all Midjourney…

ReleaseJun 13, 2026 · 2 min read

Google shuts down Gemini 2.0 Flash family on June 1, directing developers to Gemini 3.5 Flash

Google formally ended API access to all four Gemini 2.0 Flash models on June 1, 2026. Requests to gemini-2.0-flash, gemini-2.0-flash-001, gemini-2.0-flash-lite, and gemini-2.0-flash-lite-001 now…

Google

FeatureJun 11, 2026 · 2 min read

xAI integrates Files API with Imagine: public URLs, file-ID inputs, and persistent output storage

xAI updated its developer platform on June 10, 2026, closing the loop between the Files API and the Imagine image and video generation endpoints. Three interlocking changes let applications…

xAI

FeatureJun 10, 2026 · 3 min read

Anthropic ships fallbacks parameter for Claude Fable 5, automatically rerouting safety-refused requests to alternative models

Alongside the June 9, 2026, launch of Claude Fable 5 and Claude Mythos 5, Anthropic shipped a new beta API parameter called fallbacks that lets developers automatically reroute refused requests to a…

Anthropic

ReleaseJun 10, 2026 · 3 min read

xAI releases Grok Build: an agentic coding CLI and grok-build-0.1 API model with plan mode, parallel subagents, and MCP support

xAI launched Grok Build in May 2026, entering the agentic coding tool space alongside Anthropic's Claude Code, OpenAI's Codex CLI, and GitHub Copilot Workspace. The release packages two connected…

xAI

FeatureJun 10, 2026 · 2 min read

Google connects Gemini to Google Business Profile, giving small businesses AI powered by their own customer data

Google is integrating Gemini directly with Google Business Profile, giving small business owners an AI assistant that has access to their real operational data — customer reviews, search impressions,…

Google

ReleaseJun 10, 2026 · 2 min read

Anthropic retires Claude Sonnet 4 and Opus 4 on June 15, five days from now

Anthropic announced on June 5, 2026 that Claude Sonnet 4 (claude-sonnet-4-20250514) and Claude Opus 4 (claude-opus-4-20250514) will be retired from the Claude API on June 15, 2026—five days from…

Anthropic

FeatureJun 10, 2026 · 3 min read

Anthropic adds cron-scheduled agent runs and environment-variable secrets to Claude Managed Agents

Anthropic has shipped three developer-facing additions to its Claude Managed Agents platform alongside the June 9 Claude Fable 5 launch. The most significant is scheduled deployments: developers can…

Anthropic

ReleaseJun 10, 2026 · 4 min read

Anthropic launches Claude Fable 5 and Mythos 5: 1M-token context, always-on adaptive thinking, at $10/$50 per million tokens

Anthropic on June 9, 2026, launched Claude Fable 5 and Claude Mythos 5, its most capable publicly released models to date. Fable 5 is immediately available to all users across Claude.ai and the API;…

Anthropic

FeatureJun 10, 2026 · 2 min read

OpenAI adds image results to web search in the Responses API

OpenAI updated its Responses API on June 9, 2026, enabling web search to return image results alongside text. The change is live in the v1/responses endpoint and targets applications where a…

OpenAI

ReleaseJun 10, 2026 · 3 min read

Google releases Gemini 3.5 Live Translate, enabling real-time speech translation across 70+ languages in Meet, Google Translate, and developer APIs

Google has launched Gemini 3.5 Live Translate, a speech-to-speech translation model that handles more than 70 languages and 2,000 language pair combinations in real time. The model enters public…

Google

FeatureJun 8, 2026 · 3 min read

NVIDIA deepens UK AI commitment with £2 billion startup fund and 65 MW infrastructure push at London Tech Week

At London Tech Week on June 7, NVIDIA founder Jensen Huang and UK Prime Minister Keir Starmer declared that "the U.K. would be an AI maker, not an AI taker" — a commitment backed by a cluster of…

FeatureJun 8, 2026 · 2 min read

OpenAI adds inline moderation scoring to Responses API and Chat Completions, covering inputs and outputs in one call

OpenAI on June 4, 2026 added moderation scoring to both the Responses API and the Chat Completions API, enabling developers to check the safety of prompts and model outputs in a single generation…

OpenAI

FeatureJun 8, 2026 · 2 min read

NVIDIA and LG Group partner to build an AI factory spanning robotics, ADAS, and sovereign AI

NVIDIA and LG Group announced on June 7, 2026 a broad collaboration to build an AI factory designed to accelerate LG's businesses across robotics, autonomous driving, data center infrastructure, and…

FeatureJun 8, 2026 · 2 min read

NVIDIA and Doosan Group Expand Collaboration Across Physical AI, Robotics, and AI Factory Infrastructure

NVIDIA and Doosan Group announced an expanded collaboration on June 7, 2026, covering physical AI systems, industrial robotics, power infrastructure for AI factories, and substrate materials for AI…

FeatureJun 8, 2026 · 2 min read

Anthropic API: Refusals Without Output Now Free, Advisor Tool Gains Output Cap

Anthropic shipped two API changes on June 2, 2026 that reduce billing costs and improve control for developers running inference at scale. What's New No billing for zero-output refusals: On the…

Anthropic

ReleaseJun 7, 2026 · 2 min read

Google releases Gemini native image models to general availability, adds video-to-image generation

Google has moved two of its native Gemini image generation models from preview to general availability on the Gemini API: gemini-3.1-flash-image and gemini-3-pro-image. The GA release also introduces…

Google

ReleaseJun 7, 2026 · 2 min read

ElevenLabs releases Dubbing v2, an AI dubbing model that preserves speaker emotion across 90+ languages

ElevenLabs on May 28, 2026, released Dubbing v2, an AI dubbing model designed to preserve the emotional performance and vocal characteristics of the original speaker across more than 90 languages.…

ReleaseJun 7, 2026 · 3 min read

ElevenLabs releases Music v2 with section-by-section composition, sound-effects integration, and up to 50% price cuts

ElevenLabs on May 26, 2026, released Music v2, the next version of its AI music generation model. The update adds genre-transition support, embedded sound effects, an improved multilingual engine,…

FeatureJun 7, 2026 · 2 min read

OpenAI introduces Trusted Contact: a human-reviewed safety feature for ChatGPT users at risk of self-harm

OpenAI on June 7, 2026 introduced Trusted Contact — an optional, opt-in safety feature in ChatGPT that can notify a designated contact if automated systems and trained human reviewers determine a…

OpenAI

ReleaseJun 7, 2026 · 2 min read

OpenAI launches three new realtime voice models: GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper

OpenAI on June 7, 2026 published a detailed overview of three new realtime voice models now available in the API: GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper. The models advance…

OpenAI

ReleaseJun 7, 2026 · 2 min read

Google promotes Gemini 3.1 Flash Image and 3 Pro Image to general availability, adds video-to-image generation

Google moved two native visual generation models to general availability on May 28, 2026: Gemini 3.1 Flash Image (internally branded "Nano Banana 2") and Gemini 3 Pro Image ("Nano Banana Pro"). The…

Google

ReleaseJun 7, 2026 · 2 min read

Google shuts down Gemini 2.0 Flash and Lite models, urges migration to 3.x series

Google retired four Gemini 2.0 API models on June 1, 2026, ending support for the Flash and Flash Lite variants that anchored the 2.0 generation. Developers still hitting these endpoints receive…

Google

ReleaseJun 7, 2026 · 2 min read

NVIDIA Nemotron 3 Ultra Lands on Amazon SageMaker JumpStart with 1M-Token Context and 5x Agentic Inference Speed

Amazon Web Services on June 4, 2026 added NVIDIA Nemotron 3 Ultra to SageMaker JumpStart, giving cloud developers one-click access to a 550-billion-parameter reasoning model designed specifically for…

FeatureJun 7, 2026 · 2 min read

Anthropic API: Advisor Tool Gets Output Cap, Refusal-Only Requests Now Free

Anthropic on June 2 shipped two developer-facing improvements to the Claude API: a new parameter for capping the advisor tool's output per call, and a billing change that eliminates charges on API…

Anthropic

ReleaseJun 7, 2026 · 3 min read

Google I/O 2026: Gemini 3.5 Flash and Gemini Spark Debut as Gemini App Hits 900 Million Users

Google used its annual I/O developer conference on May 19, 2026 to announce Gemini 3.5 Flash — a frontier-class model designed for fast, low-cost agentic inference — and Gemini Spark, a 24/7 personal…

Google

FeatureJun 7, 2026 · 2 min read

OpenAI GPT-5.4 and GPT-5.5 arrive on Amazon Bedrock through an OpenAI-compatible Responses API endpoint

OpenAI's GPT-5.4 and GPT-5.5 became available on Amazon Bedrock on June 1, 2026, through an OpenAI-compatible Responses API endpoint. The move lets AWS customers access frontier OpenAI models within…

OpenAI

ReleaseJun 6, 2026 · 2 min read

Qwen3.6-27B: Alibaba open-sources a 27B dense model hitting 77% on SWE-bench Verified

Alibaba's Qwen team released Qwen3.6-27B on April 21, 2026, an open-weight vision-language model optimized for agentic coding. Despite its 27B parameter count — dense, not mixture-of-experts — it…

ReleaseJun 6, 2026 · 2 min read

DeepSeek launches V4-Pro and V4-Flash, retiring legacy API model names in July 2026

DeepSeek released V4-Pro and V4-Flash via its API on April 24, 2026, marking a new generation of its language model lineup. The update also sets a hard deprecation deadline: the long-standing…

FeatureJun 6, 2026 · 2 min read

OpenAI launches Lockdown Mode to block data exfiltration via prompt injection

OpenAI has launched Lockdown Mode, a new security feature for ChatGPT designed to reduce the risk of sensitive data being exfiltrated through prompt injection attacks. The feature is now rolling out…

OpenAI

FeatureJun 6, 2026 · 2 min read

OpenAI sets November 30 shutdown for Evals Platform and Agent Builder, deprecates three image models

OpenAI announced a round of platform deprecations in early June 2026, setting a November 30 deadline for the Evals Platform, Agent Builder, and reusable prompts, while also retiring three older image…

OpenAI

ReleaseJun 6, 2026 · 2 min read

Mistral launches Vibe, a unified agent platform with Work, Code, and Chat modes

Mistral launched Vibe on May 28, 2026, a unified agent platform that consolidates productivity, developer, and conversational AI tools under a single interface at chat.mistral.ai. What's new Vibe…

Mistral AI

ReleaseJun 6, 2026 · 2 min read

NVIDIA unveils RTX Spark superchip, bringing 1 petaflop of local AI performance to slim Windows PCs

NVIDIA unveiled RTX Spark on May 31, 2026, a purpose-built superchip that combines a Blackwell RTX GPU with a custom 20-core Grace CPU to deliver 1 petaflop of AI performance in slim laptops and…

FeatureJun 6, 2026 · 3 min read

Google launches Gemini for Science at I/O 2026 with multi-agent hypothesis generation and 30+ life science database integrations

Google unveiled Gemini for Science at Google I/O 2026 on May 19, 2026, announcing a suite of AI tools designed to accelerate scientific research workflows. Rather than a single purpose-trained model,…

Google

ReleaseJun 6, 2026 · 3 min read

xAI launches Grok Build, an agentic coding agent built on a dedicated 256k-context model

xAI launched Grok Build in May 2026, entering the coding agent market with a purpose-trained model — grok-build-0.1 — built specifically for software engineering workflows. The release positions xAI…

xAI

FeatureJun 6, 2026 · 3 min read

xAI adds Context Compaction to the Grok API, reducing cost and latency for long agent sessions

xAI released Context Compaction for the Grok API in late May 2026, providing developers with a server-side mechanism for trimming accumulated conversation histories without losing the context that…

xAI

ReleaseJun 6, 2026 · 2 min read

Google introduces Gemini Omni: multimodal video generation from any input type

Google introduced Gemini Omni at Google I/O 2026, a new multimodal model that accepts images, audio, video, and text in any combination and generates video as its primary output. It is Google's first…

Google

FeatureJun 6, 2026 · 2 min read

Anthropic releases MCP Tunnels and self-hosted sandboxes for private network deployments

Anthropic added two enterprise-oriented features on May 19, 2026: MCP Tunnels as a Research Preview, enabling connections to Model Context Protocol servers running inside private networks, and…

Anthropic

FeatureJun 6, 2026 · 2 min read

Anthropic removes billing for refusals and adds advisor tool output cap

Anthropic shipped two developer-facing improvements to the Claude API on June 2, 2026: a billing policy change that eliminates charges for zero-output refusals, and a new maxtokens parameter on the…

Anthropic

FeatureJun 6, 2026 · 2 min read

OpenAI extends default prompt cache retention to 24 hours for most API customers

OpenAI updated its default prompt caching behavior on May 29, 2026, changing the default promptcacheretention setting from inmemory to 24h for all organizations that do not have Zero Data Retention…

OpenAI

FeatureJun 6, 2026 · 2 min read

OpenAI launches Workload Identity Federation for keyless API authentication

OpenAI introduced Workload Identity Federation on May 26, 2026, a security capability that lets enterprise workloads authenticate to the OpenAI API without storing long-lived API keys. Trusted…

OpenAI

ReleaseJun 6, 2026 · 3 min read

Google launches Gemini 3.5 Flash and Gemini Omni at Google I/O 2026

Google launched Gemini 3.5 Flash on May 19, 2026 at Google I/O 2026, the first model in a new series it describes as combining frontier-level intelligence with Flash-tier speed. Alongside it, Google…

Google

FeatureJun 6, 2026 · 2 min read

Anthropic's web search tool now returns richer SEC filing data for financial and research agents

On May 18, 2026, Anthropic updated the web search tool in the Claude API to return richer SEC filing data. The change improves the tool's usefulness for financial research agents, earnings analysis…

Anthropic

FeatureJun 6, 2026 · 2 min read

Anthropic adds thinking_tokens breakdown to Messages API, exposing extended thinking costs per call

On May 27, 2026, Anthropic shipped a small but practically useful addition to the Claude API: responses now include a usage.outputtokensdetails.thinkingtokens field that reports exactly how many…

Anthropic

FeatureJun 6, 2026 · 3 min read

Anthropic Adds Advisor Tool Cost Controls and Drops Billing for Refused Requests

Anthropic on June 2, 2026, shipped two developer-facing API updates: a new cost-control parameter for the advisor tool, and a billing change that eliminates charges for requests the model refuses…

Anthropic

FeatureJun 6, 2026 · 3 min read

Anthropic ends billing for zero-output refusals and adds advisor tool token caps in June API update

Anthropic shipped two developer-facing Claude API changes on June 2, 2026: a maxtokens parameter for the advisor tool that caps advisor model output per call, and a billing policy change that…

Anthropic

FeatureJun 5, 2026 · 2 min read

OpenAI launches Rosalind Biodefense, giving vetted developers and U.S. government partners access to GPT-Rosalind for biodefense work

OpenAI on May 29, 2026 launched Rosalind Biodefense, a program that expands trusted access to GPT-Rosalind for vetted developers and U.S. government partners focused on biodefense, public health, and…

OpenAI

FeatureJun 5, 2026 · 3 min read

xAI adds image search to Grok's web search API, returning images as Markdown embeds in responses

xAI updated its web search capability on May 27, 2026 to support explicit image search. The addition allows Grok-powered applications to retrieve and embed relevant images as part of model responses,…

xAI

ReleaseJun 5, 2026 · 2 min read

xAI releases Grok Build 0.1, a fast model trained specifically for agentic coding, with an Agent Client Protocol for orchestration

xAI released Grok Build 0.1 on May 19, 2026, a model trained specifically for agentic coding tasks rather than general assistance. The release followed a May 14 beta launch of the Grok Build product,…

xAI

FeatureJun 5, 2026 · 2 min read

Anthropic deprecates Claude Opus 4.1, retiring it from the API on August 5, 2026

Anthropic announced on June 5, 2026 that Claude Opus 4.1 (claude-opus-4-1-20250805) is deprecated, with retirement from the Claude API scheduled for August 5, 2026. Teams still using Opus 4.1 have…

Anthropic

FeatureJun 5, 2026 · 3 min read

xAI ships Context Compaction API, WebSocket Responses mode, and Smart Turn STT detection in a single May 29 update

xAI shipped three API features on May 29, 2026 targeting production agentic workloads: a Context Compaction API for compressing long conversations into reusable shorter contexts, a WebSocket…

xAI

ReleaseJun 5, 2026 · 3 min read

Google releases Gemini 3.5 Flash to GA and launches Managed Agents in the Gemini API with the Antigravity Agent

Google made three connected announcements on May 19, 2026: Gemini 3.5 Flash reached general availability, Managed Agents launched in the Gemini API as a public preview, and Google shipped Antigravity…

Google

ReleaseJun 5, 2026 · 2 min read

NVIDIA releases Nemotron 3.5 Content Safety, unifying multimodal safety, multilingual coverage, and custom policy enforcement in a single 4B model

NVIDIA has released Nemotron 3.5 Content Safety, a 4-billion-parameter model that brings together multimodal input evaluation, support for 12 languages, custom enterprise policy enforcement, and…

ReleaseJun 5, 2026 · 2 min read

Google releases Gemini 3.1 Flash Image and 3 Pro Image as GA with exclusive video-to-image generation

Google moved gemini-3.1-flash-image and gemini-3-pro-image to general availability on May 28, 2026, bringing native visual generation to the Gemini API's stable tier. The flash model gains an…

Google

ReleaseJun 5, 2026 · 2 min read

Mistral releases Medium 3.5, a 128B open-weight coding model, and launches cloud remote agents in Vibe

Mistral AI on May 22, 2026 released Mistral Medium 3.5, a 128B open-weight dense model designed for coding, reasoning, and instruction-following in a single set of weights, alongside a new…

ReleaseJun 5, 2026 · 2 min read

Google introduces Gemini Omni: a video generation model that accepts any input and edits through conversation

Google launched Gemini Omni on May 29, 2026 — a new model built specifically for video generation from any input type, including images, audio, existing video, and text. Its primary differentiator is…

Google

FeatureJun 5, 2026 · 2 min read

OpenAI adds inline moderation scores to the Responses and Chat Completions APIs

OpenAI updated its Responses API and Chat Completions API on June 4, 2026 to support inline moderation scoring — letting developers receive policy-violation checks on both their input prompt and the…

OpenAI

FeatureJun 5, 2026 · 2 min read

Anthropic expands Claude Platform on AWS with managed agent webhooks, multiagent orchestration, and self-hosted sandboxes

Anthropic on May 29, 2026 extended its Claude Platform on AWS with three significant capabilities: webhook support for Claude Managed Agents, multiagent orchestration, and self-hosted sandboxes. The…

Anthropic

FeatureJun 5, 2026 · 3 min read

Anthropic launches Claude Partner Network Services Track and Partner Hub for enterprise SI firms

Anthropic on June 3 formally structured the services side of its Claude Partner Network, introducing a tiered Services Track and a public-facing Partner Hub — two moves that shift the company's…

Anthropic

FeatureJun 5, 2026 · 2 min read

Anthropic launches self-hosted sandboxes and MCP tunnels for Claude Managed Agents

Anthropic on May 19, 2026 launched two infrastructure features for Claude Managed Agents: self-hosted sandboxes and MCP tunnels. Both are in research preview and address enterprise concerns about…

Anthropic

FeatureJun 5, 2026 · 2 min read

Anthropic raises $65B Series H at $965B valuation as run-rate revenue tops $47B

Anthropic closed a $65 billion Series H funding round on May 28, 2026, at a $965 billion post-money valuation — the highest valuation ever recorded for a private AI company. The round was led by…

Anthropic

FeatureJun 4, 2026 · 2 min read

Google shuts down the Gemini 2.0 Flash family on the Gemini API

Google on June 1 shut down the entire Gemini 2.0 Flash family on the Gemini API, retiring gemini-2.0-flash, gemini-2.0-flash-001, gemini-2.0-flash-lite, and gemini-2.0-flash-lite-001 in a single…

Google

FeatureJun 4, 2026 · 3 min read

OpenAI starts rolling out Dreaming V3, a new memory synthesis system for ChatGPT

OpenAI began rolling out a new long-term memory architecture for ChatGPT on June 4, 2026. The system, which the company calls Dreaming V3, replaces the saved-memories plus Dreaming V0 combination…

OpenAI

FeatureJun 4, 2026 · 3 min read

Google launches AI Edge Gallery and Eloquent dictation on macOS, powered by Gemma 4 12B

Google quietly opened a new front in the on-device-AI race today, bringing two Mac-native apps out of the lab and onto everyday laptops. The Google Developers Blog announced that Google AI Edge…

Google

FeatureJun 4, 2026 · 3 min read

OpenAI launches six role-specific Codex plugins, Sites, and annotations for non-developer teams

OpenAI on June 2, 2026 launched six role-specific Codex plugins, for data analytics, creative production, sales, product design, public equity investing, and investment banking, alongside Sites, a…

OpenAI

ReleaseJun 4, 2026 · 2 min read

OpenAI breaks ground on The Barn, a 1 GW Stargate data center campus in Saline, Michigan

OpenAI broke ground on June 1, 2026 on a 1 gigawatt data center campus called The Barn in Saline, Michigan, the first publicly disclosed Stargate site east of Texas. Governor Gretchen Whitmer…

OpenAI

FeatureJun 4, 2026 · 3 min read

Anthropic launches Dynamic Workflows for Claude Code, orchestrating hundreds of parallel subagents per session

Anthropic on May 28, 2026 introduced Dynamic Workflows in Claude Code, a research preview that lets Claude plan and execute long-horizon coding tasks by orchestrating tens to hundreds of subagents…

Anthropic

ReleaseJun 4, 2026 · 3 min read

OpenAI's GPT-5.5, GPT-5.4, and Codex hit general availability on Amazon Bedrock

OpenAI's two frontier chat models and its Codex coding agent are now generally available on Amazon Bedrock. AWS announced the move on June 1, 2026, one month after the two companies expanded a…

OpenAI

ReleaseJun 4, 2026 · 3 min read

NVIDIA ships JetPack 7.2 with NemoClaw support, turning Jetson Orin and Thor into a production agentic-AI edge platform

NVIDIA shipped JetPack 7.2 with NemoClaw agent-skill support on 2026-06-01, framing the release as Jetson's transition from a developer board into a production-grade platform for agentic AI at the…

ReleaseJun 4, 2026 · 3 min read

Fundamental's NEXUS tabular foundation model launches on Amazon SageMaker JumpStart

Fundamental's NEXUS tabular foundation model became available on Amazon SageMaker JumpStart on 2026-06-03, giving AWS customers a one-click deployment path to a pre-trained model designed…

ReleaseJun 4, 2026 · 2 min read

Mayo Clinic and Microsoft team up on a Mayo-owned healthcare frontier AI model, distributed through Azure Foundry

Mayo Clinic and Microsoft announced a strategic collaboration on 2026-06-02 to build and deploy a frontier AI model purpose-built for healthcare. The model will be owned by Mayo Clinic and made…

ReleaseJun 4, 2026 · 3 min read

NVIDIA unveils Vera, an 88-core Arm CPU designed for agentic AI workloads

On May 31, 2026, NVIDIA announced Vera, its first standalone CPU positioned for the agentic-AI era. Vera pairs 88 NVIDIA-designed Olympus cores with high-bandwidth LPDDR5X memory and is being…

ReleaseJun 4, 2026 · 3 min read

NVIDIA ships Nemotron 3 Ultra, a 550B open MoE frontier model, on June 4

NVIDIA's largest open Nemotron model, Nemotron 3 Ultra, is set to land on June 4, 2026 as a NIM microservice and on the major open-weights hubs. NVIDIA framed Ultra at its May 31 software-and-agents…

FeatureJun 4, 2026 · 2 min read

Anthropic adds max_tokens cap to advisor tool and stops billing for empty refusals on Claude API

Anthropic shipped two developer-facing ergonomics changes to the Claude API on June 2, 2026, both posted in the company's official Claude Platform release notes. The advisor tool — the…

Anthropic

ReleaseJun 4, 2026 · 3 min read

Microsoft launches Scout, an always-on autopilot agent for Microsoft 365 built on OpenClaw

Microsoft on June 2, 2026 launched Scout, an always-on personal AI agent that lives inside Microsoft 365 and acts on a user's behalf without being prompted each turn. Microsoft is positioning Scout…

FeatureJun 4, 2026 · 3 min read

OpenAI updates GPT-Rosalind with new life sciences benchmarks and Codex plugins

OpenAI on June 3, 2026 shipped a major update to GPT-Rosalind, its life-sciences reasoning model, alongside two production plugins for Codex that turn the model's intelligence into repeatable…

OpenAI

FeatureJun 4, 2026 · 3 min read

xAI adds Smart Turn end-of-turn detection to the streaming Speech-to-Text API

xAI has added Smart Turn end-of-turn detection to its streaming Speech-to-Text API, a feature aimed at one of the most stubborn problems in voice interfaces — knowing when a speaker has actually…

xAI

FeatureJun 3, 2026 · 2 min read

OpenAI deprecates Prompts API, Evals platform, and Agent Builder with November 30 shutdown

OpenAI announced on June 3, 2026 the deprecation of three developer-platform products — reusable prompt objects (the v1/prompts API), the Evals platform, and Agent Builder — with a coordinated…

OpenAI

ReleaseJun 3, 2026 · 2 min read

Google releases Gemma 4 12B, an encoder-free multimodal open model sized for laptops

Google introduced Gemma 4 12B on June 3, 2026, a new open-weights multimodal model from Google DeepMind that is designed to run on consumer hardware while keeping benchmark performance close to a…

Google

FeatureJun 3, 2026 · 3 min read

Together AI ships MiniMax M3 inference with sparse-attention speedups for 1M-token context

Together AI on June 2, 2026 published an engineering deep-dive on serving MiniMax's forthcoming M3 model on its inference cloud, claiming order-of-magnitude speedups on the model's signature…

ReleaseJun 3, 2026 · 3 min read

Microsoft AI launches seven in-house MAI models at Build 2026, led by MAI-Thinking-1

Microsoft on June 2, 2026 used its Build developer conference to unveil seven foundation models built from scratch by its in-house Microsoft AI division — the first time the company has rolled out a…

FeatureJun 3, 2026 · 3 min read

Mistral launches Search Toolkit, an open-source framework for production AI search pipelines

Mistral released Search Toolkit on May 28, 2026, an open-source framework for building production search pipelines for AI applications. The framework ships with BM25 sparse retrieval, dense…

ReleaseJun 3, 2026 · 3 min read

H-company releases Holo3.1, a local-first computer-use agent family with up to 35B parameters

H-company released Holo3.1 on June 2, 2026, a family of computer-use agents in four sizes — 0.8B, 4B, 9B, and 35B-A3B (a 35-billion-parameter Mixture-of-Experts with roughly 3 billion active per…

ReleaseJun 3, 2026 · 3 min read

JetBrains releases Mellum2, an open 12B Mixture-of-Experts model for code

JetBrains released Mellum2 on June 1, 2026, a 12-billion-parameter Mixture-of-Experts (MoE) model trained from scratch on natural language and code, with only 2.5 billion parameters active per token.…

FeatureJun 3, 2026 · 2 min read

Cadence, Dassault, Siemens, and Synopsys adopt NVIDIA NemoClaw to ship autonomous AI engineers

At its GTC Taipei keynote at COMPUTEX, NVIDIA showcased more than a dozen industrial-software companies — including Cadence, Dassault Systèmes, Siemens and Synopsys — building autonomous AI engineers…

ReleaseJun 3, 2026 · 2 min read

NVIDIA unveils Cosmos 3 physical AI agent skills, Alpamayo 2 Super, and Isaac Sim 6.0 at CVPR

At CVPR 2026, NVIDIA unveiled a sweeping set of physical AI agent skills — runtime tools that let researchers stitch together world models, simulators, training and evaluation into a single…

FeatureJun 3, 2026 · 2 min read

NVIDIA and Microsoft unveil unified agentic AI stack across Windows devices, Azure, and local deployments

NVIDIA and Microsoft used Microsoft Build on June 2, 2026 to announce a coordinated agentic-AI stack that spans Windows PCs, Azure cloud, and local enterprise deployment, plus a concrete hardware…

ReleaseJun 3, 2026 · 2 min read

Mistral launches Vibe agent with Work and Code modes plus a VS Code plugin

Mistral launched Vibe on May 28, 2026 as a unified agent for long-horizon productivity and coding, packaging two modes — Work and Code — alongside a new VS Code extension. Pricing starts free, with a…

ReleaseJun 3, 2026 · 2 min read

Mistral launches Physics AI for engineering simulation with ASML, Airbus, Safran, and Siemens Energy

Mistral introduced a new class of models on May 27, 2026 called Physics AI — data-driven models that predict physical behavior directly from geometry and boundary conditions, running on a single GPU…

FeatureJun 3, 2026 · 1 min read

Anthropic launches Claude Platform on AWS

Anthropic launched Claude Platform on AWS on 2026-05-11, a new AWS-native surface for the Claude API that runs on Anthropic-managed infrastructure with AWS billing and IAM authentication. The…

Anthropic

ReleaseMay 28, 2026 · 1 min read

Anthropic releases Claude Opus 4.8

Anthropic released Claude Opus 4.8 on May 28, 2026, 41 days after Opus 4.7. The company says it improves on Opus 4.7 across benchmarks and ships at the same API price: $5 per million input tokens and…

Anthropic

ReleaseApr 30, 2026 · 1 min read

xAI launches Grok 4.3 on the API

xAI opened Grok 4.3 on its public API on April 30, 2026, after a SuperGrok Heavy beta that began April 17. xAI emphasizes improved agentic performance and a lower cost to run benchmark suites. On…

xAI

ReleaseApr 23, 2026 · 1 min read

OpenAI releases GPT-5.5 and GPT-5.5 Pro

OpenAI released GPT-5.5 and GPT-5.5 Pro on April 23, 2026, calling GPT-5.5 its "smartest and most intuitive to use model yet," with gains in agentic coding, knowledge work, math, and scientific…

OpenAI

ReleaseApr 16, 2026 · 1 min read

Anthropic ships Claude Opus 4.7

Anthropic announced general availability of Claude Opus 4.7 on April 16, 2026, with stronger coding and agentic performance than Opus 4.6 and higher-resolution vision. It reached 64.3% on SWE-bench…

Anthropic

ReleaseFeb 19, 2026 · 1 min read

Google announces Gemini 3.1 Pro

Google released Gemini 3.1 Pro in preview on February 19, 2026 - its first ".1" mid-cycle Gemini update - built for complex problem-solving and advanced reasoning. It scored 77.1% on ARC-AGI-2, which…

Google

FeatureDec 17, 2025 · 1 min read

Gemini 3 Flash becomes the default model in the Gemini app

On December 17, 2025, Google launched Gemini 3 Flash and made it the default model in the Gemini app and AI Mode in Search globally, replacing Gemini 2.5 Flash. Google positions it as Pro-grade…

Google