NewsroomProducts & Tooling

Products & Tooling

Apps, platforms, developer tools, open-source releases, and pricing changes.

22 stories

Earlier

Open sourceJun 16, 2026 · 2 min read

Z.AI releases GLM-5.2: a 753B open-weight model with 1M context and 2.9x FLOPs reduction via IndexShare

Z.AI has released GLM-5.2, its latest open-weight flagship model for long-horizon tasks. The 753B-parameter model is available under an MIT license with no regional restrictions and ships with a…

ProductJun 15, 2026 · 2 min read

OpenAI launches Partner Network with $150M investment to certify 300,000 enterprise consultants

OpenAI on June 14, 2026 launched its first formal partner program — the OpenAI Partner Network — backed by a $150 million investment designed to build a global ecosystem of certified AI…

OpenAI

ProductJun 15, 2026 · 3 min read

AWS launches FinOps Agent in public preview, bringing AI-driven cost investigation to cloud teams

Amazon Web Services launched the AWS FinOps Agent in public preview on June 9, 2026, an AI-powered agent designed to help engineering and finance teams investigate cloud cost anomalies, answer cost…

Open sourceJun 15, 2026 · 2 min read

Rio de Janeiro's city government releases Rio 3.5 Open 397B: a 403-billion-parameter open model built on Qwen

The City of Rio de Janeiro has released Rio 3.5 Open 397B, a 403-billion-parameter open-weights AI model built by IplanRIO, the municipal technology company that manages the city's digital…

Open sourceJun 15, 2026 · 2 min read

MiniMax releases M3: open-weight 427B model with 1M context, native multimodality, and 9x long-context speedup

Shanghai-based AI lab MiniMax released MiniMax-M3 on June 1, 2026 — a 427-billion parameter open-weight model with a 1-million-token context window, native multimodal understanding, and a new sparse…

Open sourceJun 14, 2026 · 2 min read

DeepSeek releases V4-Pro and V4-Flash on Hugging Face: 1.6T-parameter open-weight models with 1M context and MIT license

DeepSeek has released DeepSeek-V4-Pro and DeepSeek-V4-Flash on Hugging Face, its most capable open-weight models to date. The V4-Pro variant is a 1.6 trillion total parameter mixture-of-experts model…

Open sourceJun 13, 2026 · 3 min read

NVIDIA launches Cosmos 3, the first fully open omnimodel for physical AI, at GTC Taipei

NVIDIA unveiled Cosmos 3 on May 31, 2026 at GTC Taipei — an open world foundation model for physical AI capable of natively understanding and generating text, images, video, ambient sound, and…

Open sourceJun 13, 2026 · 2 min read

NVIDIA releases Nemotron 3 Ultra 550B: open reasoning model with 5x throughput advantage, targeting long-running agents

NVIDIA on June 4, 2026, launched Nemotron 3 Ultra, an open-weights 550-billion-parameter reasoning model designed for enterprise AI agents. The model delivers claimed "5x higher throughput compared…

ProductJun 13, 2026 · 3 min read

Mastercard launches Agent Pay for Machines, an open payments protocol for AI agents with microtransaction support and 30+ partners including Coinbase, Stripe, and Adyen

Mastercard launched Agent Pay for Machines on June 10, 2026 — an open payments protocol designed to let AI agents and autonomous systems execute high-frequency, low-value transactions…

Open sourceJun 12, 2026 · 3 min read

Moonshot AI releases Kimi K2.7 Code on Hugging Face: a 1-trillion-parameter MoE coding model with 32B active weights

Moonshot AI has released Kimi K2.7 Code as an open-weights model on Hugging Face, extending the Kimi K2 lineage with a coding-specialist architecture that packs 1 trillion total parameters into a…

Open sourceJun 12, 2026 · 3 min read

Cohere releases North Mini Code, an open-source agentic coding model with 30B parameters and 3B active on a mixture-of-experts architecture

Cohere on June 9, 2026 released North Mini Code — the company's first agentic coding model and the opening release of its next generation of models. The model uses a mixture-of-experts architecture…

Open sourceJun 11, 2026 · 3 min read

AWS releases Agent-EvalKit, an open-source toolkit for systematic AI agent evaluation

Amazon Web Services published Agent-EvalKit on June 11, 2026 — an open-source toolkit released under the Apache 2.0 license that gives developers a structured pipeline for evaluating AI agents by…

ProductJun 11, 2026 · 2 min read

Anthropic launches Claude Corps with $150M commitment, placing 1,000 AI fellows at US nonprofits

Anthropic on June 11, 2026, launched Claude Corps, a national fellowship program placing 1,000 early-career workers at nonprofits across the United States over the next year, backed by an initial…

Anthropic

ProductJun 10, 2026 · 3 min read

NVIDIA Halos OS brings ISO 26262-certified safety stack to commercial robotaxis, with four active global deployments

NVIDIA detailed the production architecture of its Halos Operating System at GTC Taipei on June 10, 2026, announcing four simultaneous commercial robotaxi deployments across Munich, Taiwan, Southeast…

Open sourceJun 10, 2026 · 2 min read

Google DeepMind's DiffusionGemma arrives with NVIDIA day-zero RTX support, generating text 4x faster than autoregressive models

Google DeepMind has released DiffusionGemma, an experimental open-weights language model that generates text using a diffusion process rather than token-by-token prediction, achieving speeds up to…

Google

ProductJun 10, 2026 · 3 min read

Apple launches third-generation Foundation Models at WWDC 2026: five models from 3B on-device sparse to Pro cloud, co-built with Google

At WWDC 2026 on June 8, Apple introduced the third generation of Apple Foundation Models (AFM 3), a family of five purpose-built AI models co-developed with Google that spans on-device inference and…

ProductJun 10, 2026 · 3 min read

Google DeepMind launches robotics accelerator for 15 European startups, offering Gemini Robotics model access

Google DeepMind kicked off its first dedicated robotics accelerator on June 9, 2026, gathering 15 selected European startups in London to begin a three-month program built around access to the…

Google

ProductJun 10, 2026 · 2 min read

Google Retires All Gemini 2.0 Flash Models, Directs Developers to Gemini 3.5 Flash

Google shut down all four Gemini 2.0 Flash model variants on June 1, 2026, completing the retirement of the 2.0 Flash generation and clearing a path for Gemini 3.5 Flash as the standard…

Google

PricingJun 8, 2026 · 2 min read

OpenAI switches container sessions to per-minute billing with a 5-minute minimum, cutting costs for short runs

OpenAI updated its container session billing on June 2, 2026, moving from a flat 20-minute session rate to per-minute billing with a 5-minute minimum. The change reduces costs for developers whose…

OpenAI

PricingJun 3, 2026 · 1 min read

OpenAI moves API container sessions to per-minute billing with a 5-minute minimum

OpenAI changed how container sessions are billed on the API, swapping the flat 20-minute session charge for per-minute billing with a 5-minute minimum. The shift trims cost for short-running…

OpenAI

PricingMay 19, 2026 · 1 min read

Gemini 3.5 Flash triples Flash pricing to $1.50 / $9

Gemini 3.5 Flash is priced at $1.50 per million input tokens and $9 per million output tokens - 3x the $0.50 / $3 of Gemini 3 Flash, but still roughly 40% cheaper than Gemini 3.1 Pro. Google has…

Google

PricingApr 24, 2026 · 1 min read

GPT-5.5 lands in the API at $5 / $30 per million tokens

GPT-5.5 became available to API developers on April 24, 2026 at $5 per million input tokens and $30 per million output tokens with a 1M-token context window. GPT-5.5 Pro is priced at $30 per million…

OpenAI