Latest Baseten News & Updates

See the latest news and media coverage for Baseten. We track all announcements, press releases, and industry mentions in real time, all in one place.

Baseten

AI model inference platform

baseten.co

Headquarters: San Francisco, United States
Company type: Private company
Number of employees: 250–500

Last updated Today

Latest news about Baseten

In short: Baseten expanded its inference platform with frontier model support and technical optimizations while entering reported talks for a $1 billion raise.

Company announcements

June 22 Baseten

Baseten built the world's fastest API for GLM-5.2

It achieves over 280 tokens per second through optimizations like NVFP4 quantization, KV-aware routing, PD disaggregation, and Multi-Token Prediction.

June 22 Baseten

Baseten raised $1.5B in Series F at $13B valuation

Revenue grew 20x and inference volume 40x, with participation from Altimeter, Conviction, and Spark.

June 12 Baseten

Baseten introduces rolling deployments for zero-downtime model updates

The feature updates models incrementally without doubling GPU spend and offers pause, resume, and rollback controls.

June 11 Baseten

Baseten now hosts Inception's Mercury 2 diffusion LLM

Mercury 2 runs over 1000 tokens per second on NVIDIA GPUs, at half the cost of comparable models, with 90% cost reduction for Augment Code.

Media coverage

Yesterday Zawya

Nvidia Invests in AI Start-up Baseten. It Shows a Shift in the Market.

Nvidia invested $150 million in Baseten, which helps companies deploy and run large AI models.

Never miss news about Baseten

Track Baseten and your other target companies to get real-time alerts and weekly summaries delivered straight to your inbox.

Baseten competitors & trending companies

Browse news for competitors to Baseten and other trending companies.