Products & Tooling
Apps, platforms, developer tools, open-source releases, and pricing changes.
Apps, platforms, developer tools, open-source releases, and pricing changes.
Z.AI has released GLM-5.2, its latest open-weight flagship model for long-horizon tasks. The 753B-parameter model is available under an MIT license with no regional restrictions and ships with a…
OpenAI on June 14, 2026 launched its first formal partner program — the OpenAI Partner Network — backed by a $150 million investment designed to build a global ecosystem of certified AI…
Amazon Web Services launched the AWS FinOps Agent in public preview on June 9, 2026, an AI-powered agent designed to help engineering and finance teams investigate cloud cost anomalies, answer cost…
The City of Rio de Janeiro has released Rio 3.5 Open 397B, a 403-billion-parameter open-weights AI model built by IplanRIO, the municipal technology company that manages the city's digital…
Shanghai-based AI lab MiniMax released MiniMax-M3 on June 1, 2026 — a 427-billion parameter open-weight model with a 1-million-token context window, native multimodal understanding, and a new sparse…
DeepSeek has released DeepSeek-V4-Pro and DeepSeek-V4-Flash on Hugging Face, its most capable open-weight models to date. The V4-Pro variant is a 1.6 trillion total parameter mixture-of-experts model…
NVIDIA unveiled Cosmos 3 on May 31, 2026 at GTC Taipei — an open world foundation model for physical AI capable of natively understanding and generating text, images, video, ambient sound, and…
NVIDIA on June 4, 2026, launched Nemotron 3 Ultra, an open-weights 550-billion-parameter reasoning model designed for enterprise AI agents. The model delivers claimed "5x higher throughput compared…
Mastercard launched Agent Pay for Machines on June 10, 2026 — an open payments protocol designed to let AI agents and autonomous systems execute high-frequency, low-value transactions…
Moonshot AI has released Kimi K2.7 Code as an open-weights model on Hugging Face, extending the Kimi K2 lineage with a coding-specialist architecture that packs 1 trillion total parameters into a…
Cohere on June 9, 2026 released North Mini Code — the company's first agentic coding model and the opening release of its next generation of models. The model uses a mixture-of-experts architecture…
Amazon Web Services published Agent-EvalKit on June 11, 2026 — an open-source toolkit released under the Apache 2.0 license that gives developers a structured pipeline for evaluating AI agents by…
Anthropic on June 11, 2026, launched Claude Corps, a national fellowship program placing 1,000 early-career workers at nonprofits across the United States over the next year, backed by an initial…
NVIDIA detailed the production architecture of its Halos Operating System at GTC Taipei on June 10, 2026, announcing four simultaneous commercial robotaxi deployments across Munich, Taiwan, Southeast…
Google DeepMind has released DiffusionGemma, an experimental open-weights language model that generates text using a diffusion process rather than token-by-token prediction, achieving speeds up to…
At WWDC 2026 on June 8, Apple introduced the third generation of Apple Foundation Models (AFM 3), a family of five purpose-built AI models co-developed with Google that spans on-device inference and…
Google DeepMind kicked off its first dedicated robotics accelerator on June 9, 2026, gathering 15 selected European startups in London to begin a three-month program built around access to the…
Google shut down all four Gemini 2.0 Flash model variants on June 1, 2026, completing the retirement of the 2.0 Flash generation and clearing a path for Gemini 3.5 Flash as the standard…
OpenAI updated its container session billing on June 2, 2026, moving from a flat 20-minute session rate to per-minute billing with a 5-minute minimum. The change reduces costs for developers whose…
OpenAI changed how container sessions are billed on the API, swapping the flat 20-minute session charge for per-minute billing with a 5-minute minimum. The shift trims cost for short-running…
Gemini 3.5 Flash is priced at $1.50 per million input tokens and $9 per million output tokens - 3x the $0.50 / $3 of Gemini 3 Flash, but still roughly 40% cheaper than Gemini 3.1 Pro. Google has…
GPT-5.5 became available to API developers on April 24, 2026 at $5 per million input tokens and $30 per million output tokens with a 1M-token context window. GPT-5.5 Pro is priced at $30 per million…