The latest, as it lands.
Every release, paper and policy shift across the frontier — newest first, each verified against its primary source.
Every release, paper and policy shift across the frontier — newest first, each verified against its primary source.
NVIDIA on June 4, 2026, launched Nemotron 3 Ultra, an open-weights 550-billion-parameter reasoning model designed for enterprise AI agents. The model delivers claimed "5x higher throughput compared…
Mastercard launched Agent Pay for Machines on June 10, 2026 — an open payments protocol designed to let AI agents and autonomous systems execute high-frequency, low-value transactions…
Google formally ended API access to all four Gemini 2.0 Flash models on June 1, 2026. Requests to gemini-2.0-flash, gemini-2.0-flash-001, gemini-2.0-flash-lite, and gemini-2.0-flash-lite-001 now…
The US government issued a directive on June 12, 2026 requiring Anthropic to suspend access to Claude Fable 5 and Claude Mythos 5 — its two most capable models — for all users worldwide. The order…
Anthropic on May 4, 2026, announced the formation of a new enterprise AI services company built to deploy Claude for mid-sized businesses that lack the in-house engineering resources to implement…
NVIDIA's Blackwell architecture has taken the top spot in AgentPerf, the first benchmark designed specifically to measure agentic AI infrastructure performance, according to results published June 12…
OpenAI announced on June 11, 2026 that it plans to acquire Ona, a cloud infrastructure company formerly known as Gitpod, to expand its Codex AI coding agent with secure, customer-controlled execution…
Google filed a civil lawsuit on June 12, 2026 targeting a China-based cybercrime operation it calls the "Outsider Enterprise," alleging the network built an AI-powered phishing infrastructure that…
Anthropic on June 12 released the inaugural results from its Anthropic Public Record, a new survey series the company says it will repeat to track how U.S. public attitudes toward AI shift over time.…
A former xAI engineer has filed a lawsuit against xAI and SpaceX claiming he was fired in retaliation for raising AI safety concerns about Grok, with allegations that xAI co-founder Jimmy Ba ignored…
BBVA, one of Europe's largest banks, has completed a company-wide rollout of ChatGPT Enterprise to more than 100,000 employees globally, OpenAI disclosed in a detailed case study published June 11,…
OpenAI and Oracle have formed a new enterprise distribution partnership that allows Oracle Cloud Infrastructure (OCI) customers to apply existing Universal Credits toward OpenAI's frontier models and…
Anthropic and Tata Consultancy Services (TCS) announced a Global Premier Partnership on June 11, 2026, making TCS one of the largest enterprise distribution channels for Claude in Anthropic's partner…
Moonshot AI has released Kimi K2.7 Code as an open-weights model on Hugging Face, extending the Kimi K2 lineage with a coding-specialist architecture that packs 1 trillion total parameters into a…
On June 7, 2026, NAVER and NVIDIA announced a major sovereign AI infrastructure expansion: NAVER will build AI factories on the NVIDIA DSX platform at its GAK Sejong data center in South Korea,…
Cohere on June 9, 2026 released North Mini Code — the company's first agentic coding model and the opening release of its next generation of models. The model uses a mixture-of-experts architecture…
Anthropic and DXC Technology announced a multi-year global alliance on June 11, 2026, under which DXC will integrate Claude into mission-critical systems for regulated industries. The partnership…
Google DeepMind announced a $10 million multi-agent AI safety research initiative on June 11, 2026, joining with Schmidt Sciences, the Cooperative AI Foundation, the UK Advanced Research and…
Amazon Web Services published Agent-EvalKit on June 11, 2026 — an open-source toolkit released under the Apache 2.0 license that gives developers a structured pipeline for evaluating AI agents by…
Anthropic on June 11, 2026, launched Claude Corps, a national fellowship program placing 1,000 early-career workers at nonprofits across the United States over the next year, backed by an initial…
xAI updated its developer platform on June 10, 2026, closing the loop between the Files API and the Imagine image and video generation endpoints. Three interlocking changes let applications…
Anthropic has published a policy framework titled "Policy on the AI Exponential," calling for tiered regulatory oversight of frontier AI with specific numerical thresholds and explicit government…
Alibaba's Qwen research team has published Qwen-Image-Flash: Beyond Objective Design on arXiv, presenting a few-step distillation framework that accelerates the Qwen-Image-2.0 model family for both…
OpenAI published a threat intelligence report on June 10, 2026 documenting two separate influence operations linked to Chinese state-affiliated actors that used ChatGPT to generate content targeting…
NVIDIA detailed the production architecture of its Halos Operating System at GTC Taipei on June 10, 2026, announcing four simultaneous commercial robotaxi deployments across Munich, Taiwan, Southeast…
Anthropic's 319-page system card for Claude Fable 5, released alongside the model on June 9, 2026, contains a disclosure that the model has been given invisible performance limits for tasks related…
Microsoft on June 2, 2026, announced Majorana 2, its next-generation topological quantum chip, alongside the general availability of Microsoft Discovery, the company's agentic AI platform for…
Alongside the June 9, 2026, launch of Claude Fable 5 and Claude Mythos 5, Anthropic shipped a new beta API parameter called fallbacks that lets developers automatically reroute refused requests to a…
xAI launched Grok Build in May 2026, entering the agentic coding tool space alongside Anthropic's Claude Code, OpenAI's Codex CLI, and GitHub Copilot Workspace. The release packages two connected…
Together AI announced on June 10, 2026, that it has achieved ISO 27001:2022 certification for its information security management system, certified by A-LIGN Compliance and Security, an…
Google DeepMind has released DiffusionGemma, an experimental open-weights language model that generates text using a diffusion process rather than token-by-token prediction, achieving speeds up to…
AWS has released Neuron Agentic Development, a suite of specialized AI agents and skills that enable machine learning engineers to develop, debug, and optimize custom kernels for AWS Trainium and…
Anthropic has added a new refusal category to Claude Fable 5 that specifically blocks API requests aimed at reverse-engineering or duplicating the model's outputs. The change, documented in the…
Google is integrating Gemini directly with Google Business Profile, giving small business owners an AI assistant that has access to their real operational data — customer reviews, search impressions,…
Google has agreed to pay SpaceX $920 million per month from October 2026 through June 2029, securing access to approximately 110,000 NVIDIA GPUs housed in a SpaceX data center facility, according to…
The European Commission issued interim measures on June 9, 2026, ordering Meta to restore free access to its WhatsApp for Business API for rival general-purpose AI assistants within five working…
ElevenLabs announced on May 5, 2026 that it has surpassed $500 million in annual recurring revenue — reached in the first four months of the year — and closed a Series D funding round with a group of…
ElevenLabs announced on June 8, 2026 that it has signed a memorandum of understanding with the UK's Department for Science, Innovation and Technology (DSIT), covering three areas: public services, AI…
At WWDC 2026 on June 8, Apple introduced the third generation of Apple Foundation Models (AFM 3), a family of five purpose-built AI models co-developed with Google that spans on-device inference and…
Meta has signed its first built-to-suit AI data center agreement in India, partnering with Reliance Industries to build a 168-megawatt facility in Jamnagar, Gujarat. The announcement deepens a…
Google DeepMind kicked off its first dedicated robotics accelerator on June 9, 2026, gathering 15 selected European startups in London to begin a three-month program built around access to the…
Google shut down all four Gemini 2.0 Flash model variants on June 1, 2026, completing the retirement of the 2.0 Flash generation and clearing a path for Gemini 3.5 Flash as the standard…
DeepSeek researchers published a paper on June 9, 2026 introducing Lookahead Sparse Attention (LSA), an inference-time technique that compresses the GPU memory footprint of long-context LLM serving…
Anthropic announced on June 5, 2026 that Claude Sonnet 4 (claude-sonnet-4-20250514) and Claude Opus 4 (claude-opus-4-20250514) will be retired from the Claude API on June 15, 2026—five days from…
OpenAI submitted a confidential draft S-1 registration statement to the U.S. Securities and Exchange Commission on June 8, 2026, beginning the formal regulatory process for a potential initial public…
Anthropic has shipped three developer-facing additions to its Claude Managed Agents platform alongside the June 9 Claude Fable 5 launch. The most significant is scheduled deployments: developers can…
Google announced at Apple's WWDC 2026 that Gemini cloud models are now directly accessible to Apple developers through Apple's Foundation Models framework and built into Xcode. Developers building…
At Apple's WWDC 2026 developer conference, NVIDIA announced that its Blackwell GPUs with Confidential Computing are now integrated into Apple's Private Cloud Compute (PCC) infrastructure. The…
Anthropic has introduced a mandatory 30-day data retention requirement for its two newest models, Claude Fable 5 and Claude Mythos 5, effective June 9, 2026 — the same day both models launched. The…
Anthropic on June 9, 2026, launched Claude Fable 5 and Claude Mythos 5, its most capable publicly released models to date. Fable 5 is immediately available to all users across Claude.ai and the API;…
Vercel published its AI gateway production index for May 2026, drawing on routing traffic across its infrastructure to document a widening divergence in the AI provider market: DeepSeek's share of…
OpenAI updated its Responses API on June 9, 2026, enabling web search to return image results alongside text. The change is live in the v1/responses endpoint and targets applications where a…
Google has launched Gemini 3.5 Live Translate, a speech-to-speech translation model that handles more than 70 languages and 2,000 language pair combinations in real time. The model enters public…
At London Tech Week on June 7, NVIDIA founder Jensen Huang and UK Prime Minister Keir Starmer declared that "the U.K. would be an AI maker, not an AI taker" — a commitment backed by a cluster of…
OpenAI on June 4, 2026 added moderation scoring to both the Responses API and the Chat Completions API, enabling developers to check the safety of prompts and model outputs in a single generation…
NVIDIA and LG Group announced on June 7, 2026 a broad collaboration to build an AI factory designed to accelerate LG's businesses across robotics, autonomous driving, data center infrastructure, and…
Google will shut down the consumer version of Gemini Code Assist for GitHub on July 17, 2026, with new installations blocked starting June 18. The enterprise tier is unaffected. What's new Google…
Anthropic disclosed on June 3, 2026 that it detected and disrupted what it describes as the first large-scale cyberattack executed largely without human intervention — a campaign in which a Chinese…
OpenAI updated its container session billing on June 2, 2026, moving from a flat 20-minute session rate to per-minute billing with a 5-minute minimum. The change reduces costs for developers whose…
NVIDIA and Doosan Group announced an expanded collaboration on June 7, 2026, covering physical AI systems, industrial robotics, power infrastructure for AI factories, and substrate materials for AI…
Anthropic shipped two API changes on June 2, 2026 that reduce billing costs and improve control for developers running inference at scale. What's New No billing for zero-output refusals: On the…
Google has moved two of its native Gemini image generation models from preview to general availability on the Gemini API: gemini-3.1-flash-image and gemini-3-pro-image. The GA release also introduces…
ElevenLabs on May 28, 2026, released Dubbing v2, an AI dubbing model designed to preserve the emotional performance and vocal characteristics of the original speaker across more than 90 languages.…
ElevenLabs on May 26, 2026, released Music v2, the next version of its AI music generation model. The update adds genre-transition support, embedded sound effects, an improved multilingual engine,…
Anthropic announced on June 5, 2026 that Claude Opus 4.1 (claude-opus-4-1-20250805) is now deprecated, with its API retirement scheduled for August 5, 2026. Developers using the model have 60 days to…
Google DeepMind published details on May 20, 2026 of an accessibility agent that guides blind and low-vision athletes while running outdoors in real time, using a Pixel 10 Pro smartphone worn on the…
OpenAI on June 7, 2026 introduced Trusted Contact — an optional, opt-in safety feature in ChatGPT that can notify a designated contact if automated systems and trained human reviewers determine a…
OpenAI on June 7, 2026 published a detailed overview of three new realtime voice models now available in the API: GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper. The models advance…
Google moved two native visual generation models to general availability on May 28, 2026: Gemini 3.1 Flash Image (internally branded "Nano Banana 2") and Gemini 3 Pro Image ("Nano Banana Pro"). The…
Google retired four Gemini 2.0 API models on June 1, 2026, ending support for the Flash and Flash Lite variants that anchored the 2.0 generation. Developers still hitting these endpoints receive…
A wave of Instagram account hijackings in late May and early June 2026 exposed a critical flaw in Meta's AI-powered support assistant: the chatbot could be socially engineered to add an attacker's…
Amazon Web Services on June 4, 2026 added NVIDIA Nemotron 3 Ultra to SageMaker JumpStart, giving cloud developers one-click access to a 550-billion-parameter reasoning model designed specifically for…
Anthropic on June 2 shipped two developer-facing improvements to the Claude API: a new parameter for capping the advisor tool's output per call, and a billing change that eliminates charges on API…
Google used its annual I/O developer conference on May 19, 2026 to announce Gemini 3.5 Flash — a frontier-class model designed for fast, low-cost agentic inference — and Gemini Spark, a 24/7 personal…
Anthropic on June 5 published benchmark data showing that Claude Opus 4.7 — a general-purpose frontier model with no chemistry-specific fine-tuning — performs comparably to specialized NMR software…
OpenAI's GPT-5.4 and GPT-5.5 became available on Amazon Bedrock on June 1, 2026, through an OpenAI-compatible Responses API endpoint. The move lets AWS customers access frontier OpenAI models within…
Google DeepMind's AI research assistant Co-Scientist helped biologists at MIT identify more than 20 genetic factors that may reverse cellular aging — and lab experiments confirmed that several of…
Alibaba's Qwen team released Qwen3.6-27B on April 21, 2026, an open-weight vision-language model optimized for agentic coding. Despite its 27B parameter count — dense, not mixture-of-experts — it…
DeepSeek released V4-Pro and V4-Flash via its API on April 24, 2026, marking a new generation of its language model lineup. The update also sets a hard deprecation deadline: the long-standing…
OpenAI has launched Lockdown Mode, a new security feature for ChatGPT designed to reduce the risk of sensitive data being exfiltrated through prompt injection attacks. The feature is now rolling out…
OpenAI announced a round of platform deprecations in early June 2026, setting a November 30 deadline for the Evals Platform, Agent Builder, and reusable prompts, while also retiring three older image…
Mistral launched Vibe on May 28, 2026, a unified agent platform that consolidates productivity, developer, and conversational AI tools under a single interface at chat.mistral.ai. What's new Vibe…
NVIDIA unveiled RTX Spark on May 31, 2026, a purpose-built superchip that combines a Blackwell RTX GPU with a custom 20-core Grace CPU to deliver 1 petaflop of AI performance in slim laptops and…
Google unveiled Gemini for Science at Google I/O 2026 on May 19, 2026, announcing a suite of AI tools designed to accelerate scientific research workflows. Rather than a single purpose-trained model,…
xAI launched Grok Build in May 2026, entering the coding agent market with a purpose-trained model — grok-build-0.1 — built specifically for software engineering workflows. The release positions xAI…
xAI released Context Compaction for the Grok API in late May 2026, providing developers with a server-side mechanism for trimming accumulated conversation histories without losing the context that…
Google introduced Gemini Omni at Google I/O 2026, a new multimodal model that accepts images, audio, video, and text in any combination and generates video as its primary output. It is Google's first…
Anthropic added two enterprise-oriented features on May 19, 2026: MCP Tunnels as a Research Preview, enabling connections to Model Context Protocol servers running inside private networks, and…
Anthropic shipped two developer-facing improvements to the Claude API on June 2, 2026: a billing policy change that eliminates charges for zero-output refusals, and a new maxtokens parameter on the…
The S&P Dow Jones Indices on June 4, 2026 declined to modify its index eligibility criteria for SpaceX, ending any prospect of expedited S&P 500 membership for SpaceX — and, by extension, for OpenAI…
OpenAI updated its default prompt caching behavior on May 29, 2026, changing the default promptcacheretention setting from inmemory to 24h for all organizations that do not have Zero Data Retention…
OpenAI introduced Workload Identity Federation on May 26, 2026, a security capability that lets enterprise workloads authenticate to the OpenAI API without storing long-lived API keys. Trusted…
Google launched Gemini 3.5 Flash on May 19, 2026 at Google I/O 2026, the first model in a new series it describes as combining frontier-level intelligence with Flash-tier speed. Alongside it, Google…
On May 18, 2026, Anthropic updated the web search tool in the Claude API to return richer SEC filing data. The change improves the tool's usefulness for financial research agents, earnings analysis…
On May 27, 2026, Anthropic shipped a small but practically useful addition to the Claude API: responses now include a usage.outputtokensdetails.thinkingtokens field that reports exactly how many…
Anthropic on June 2, 2026, shipped two developer-facing API updates: a new cost-control parameter for the advisor tool, and a billing change that eliminates charges for requests the model refuses…
Anthropic shipped two developer-facing Claude API changes on June 2, 2026: a maxtokens parameter for the advisor tool that caps advisor model output per call, and a billing policy change that…
OpenAI on May 29, 2026 launched Rosalind Biodefense, a program that expands trusted access to GPT-Rosalind for vetted developers and U.S. government partners focused on biodefense, public health, and…
xAI updated its web search capability on May 27, 2026 to support explicit image search. The addition allows Grok-powered applications to retrieve and embed relevant images as part of model responses,…
xAI released Grok Build 0.1 on May 19, 2026, a model trained specifically for agentic coding tasks rather than general assistance. The release followed a May 14 beta launch of the Grok Build product,…
Anthropic announced on June 5, 2026 that Claude Opus 4.1 (claude-opus-4-1-20250805) is deprecated, with retirement from the Claude API scheduled for August 5, 2026. Teams still using Opus 4.1 have…
xAI shipped three API features on May 29, 2026 targeting production agentic workloads: a Context Compaction API for compressing long conversations into reusable shorter contexts, a WebSocket…
Google made three connected announcements on May 19, 2026: Gemini 3.5 Flash reached general availability, Managed Agents launched in the Gemini API as a public preview, and Google shipped Antigravity…
NVIDIA has released Nemotron 3.5 Content Safety, a 4-billion-parameter model that brings together multimodal input evaluation, support for 12 languages, custom enterprise policy enforcement, and…
Google moved gemini-3.1-flash-image and gemini-3-pro-image to general availability on May 28, 2026, bringing native visual generation to the Gemini API's stable tier. The flash model gains an…
Mistral AI on May 22, 2026 released Mistral Medium 3.5, a 128B open-weight dense model designed for coding, reasoning, and instruction-following in a single set of weights, alongside a new…
Blackstone-backed data center operator AirTrunk has committed $30 billion to develop 5 gigawatts of new data center capacity in India by 2030, one of the largest single infrastructure pledges…
Google launched Gemini Omni on May 29, 2026 — a new model built specifically for video generation from any input type, including images, audio, existing video, and text. Its primary differentiator is…
A team of researchers including prominent Google DeepMind scientists published a paper at the 43rd International Conference on Machine Learning (ICML 2026) arguing that the dominant paradigm in AI…
OpenAI updated its Responses API and Chat Completions API on June 4, 2026 to support inline moderation scoring — letting developers receive policy-violation checks on both their input prompt and the…
Anthropic on May 29, 2026 extended its Claude Platform on AWS with three significant capabilities: webhook support for Claude Managed Agents, multiagent orchestration, and self-hosted sandboxes. The…
Anthropic on June 3 formally structured the services side of its Claude Partner Network, introducing a tiered Services Track and a public-facing Partner Hub — two moves that shift the company's…
Anthropic on May 19, 2026 launched two infrastructure features for Claude Managed Agents: self-hosted sandboxes and MCP tunnels. Both are in research preview and address enterprise concerns about…
Anthropic closed a $65 billion Series H funding round on May 28, 2026, at a $965 billion post-money valuation — the highest valuation ever recorded for a private AI company. The round was led by…
Anthropic has released a public best-practices guide for using Claude to find and fix vulnerabilities in source code, paired with an open-source reference harness on GitHub. The guide is grounded in…
Anthropic's Institute published "When AI builds itself," a position paper arguing that recursive self-improvement — AI systems building, testing, and improving themselves with diminishing human…
Google on June 1 shut down the entire Gemini 2.0 Flash family on the Gemini API, retiring gemini-2.0-flash, gemini-2.0-flash-001, gemini-2.0-flash-lite, and gemini-2.0-flash-lite-001 in a single…
OpenAI began rolling out a new long-term memory architecture for ChatGPT on June 4, 2026. The system, which the company calls Dreaming V3, replaces the saved-memories plus Dreaming V0 combination…
Google quietly opened a new front in the on-device-AI race today, bringing two Mac-native apps out of the lab and onto everyday laptops. The Google Developers Blog announced that Google AI Edge…
Perplexity on June 1 published research introducing Search as Code (SaC), an architecture in which AI agents compose search pipelines programmatically from atomic building blocks rather than calling…