CoreWeave Sets New AI Inference Benchmark with MLPerf v5.0 Results Using NVIDIA GB200 Grace Blackwell Superchips

NEWS 03 April 2025

CoreWeave, the AI Hyperscaler™, today announced its MLPerf v5.0 results, setting a new industry benchmark in AI inference with NVIDIA GB200 Grace Blackwell Superchips. Using a CoreWeave instance with NVIDIA GB200, featuring two NVIDIA Grace CPUs and four NVIDIA Blackwell GPUs, CoreWeave delivered 800 tokens per second (TPS) on the Llama 3.1 405B model1—one of the largest open-source models. "CoreWeave is committed to delivering cutting-edge infrastructure optimized for large-model inference through our purpose-built cloud platform," said Peter Salanki, Chief Technology Officer at CoreWeave. "These benchmark MLPerf results reinforce CoreWeave's position as a preferred cloud provider for leading AI labs and enterprises." CoreWeave also submitted new results for NVIDIA H200 GPU instances. It achieved 33,000 TPS on the Llama 2 70B model, representing a 40 percent improvement in throughput over NVIDIA H100 instances

CoreWeave Sets New AI Inference Benchmark with MLPerf v5.0 Results Using NVIDIA GB200 Grace Blackwell Superchips

CoreWeave, the AI Hyperscaler™, today announced its MLPerf v5.0 results, setting a new industry benchmark in AI inference with NVIDIA GB200 Grace Blackwell Superchips. Using a CoreWeave instance with NVIDIA GB200, featuring two NVIDIA Grace CPUs and four NVIDIA Blackwell GPUs, CoreWeave delivered 800 tokens per second (TPS) on the Llama 3.1 405B model¹—one of the largest open-source models.

"CoreWeave is committed to delivering cutting-edge infrastructure optimized for large-model inference through our purpose-built cloud platform," said Peter Salanki, Chief Technology Officer at CoreWeave. "These benchmark MLPerf results reinforce CoreWeave's position as a preferred cloud provider for leading AI labs and enterprises."

CoreWeave also submitted new results for NVIDIA H200 GPU instances. It achieved 33,000 TPS on the Llama 2 70B model, representing a 40 percent improvement in throughput over NVIDIA H100 instances

CoreWeave Sets New AI Inference Benchmark with MLPerf v5.0 Results Using NVIDIA GB200 Grace Blackwell Superchips

United States of AMERICA

World’s First Digital Wallet for Smart Vehicles Launched by Robo.ai and Changer.ae

Blend Secures Snowflake Premier Status to Accelerate Joint Data Innovation

Smart Guard by Sentry AI Sets New Benchmark in AI-Driven Security

Todata Launched Tod AI, Empowering Teams With Real-Time Business Insights

Continu Launched Eddy, the World’s Only Conversational AI Learning Agent

Client Intelligence™ Debuts, Redefining the Legal Client Experience With Smart Insights

Stibo Systems Introduced SmartSync™ for Salesforce, Enhancing Data Trust and AI Integration

EUROPE

Pollo AI Debuted ‘Shorts’ Platform for Seamless Multi-Scene Video Generation

Kolo AI Introduced First Cross-Platform Assistant Connecting SMS, App, and Web

DeepL Unveiled Developer Marketplace for Instant API Integration

Claude AI Was Embedded into Oracle NetSuite by Massimo Group to Boost Operational Intelligence

AtData Launched Email Identity Intelligence Licensing to Drive AI and Identity Breakthroughs

CData Introduced Connect AI to Streamline Enterprise Data Access for AI Systems

OpenAI and NVIDIA Forge $100B Partnership to Build Million-GPU AI Infrastructure

ASIA

Univers Introduced EnOS™ Ark 2.0, Merging Speed and Intelligence for Greener Outcomes

Radar Launched World’s First Location OS, Bringing AI to Enterprise Geolocation

IoTeX Announced Real-World AI Foundry with Partners to Shape Industry Standards

Lessie AI Introduced Groundbreaking Multi-Scenario People Search Capabilities

Pollo AI Debuted ‘Shorts’ Platform for Seamless Multi-Scene Video Generation

Trip.Biz Introduced Trip.Biz ONE to Streamline and Modernize Business Travel Operations

Malaysia and Zetrix AI Launched Initiative to Define Ethical Standards for Shariah-Compliant AI Worldwide

CoreWeave Sets New AI Inference Benchmark with MLPerf v5.0 Results Using NVIDIA GB200 Grace Blackwell Superchips

United States of AMERICA

EUROPE

ASIA

Keep Up to Date with the Latest Artificial Intelligence Industry NEWS & Insights