Tech
Apr 24, 2026
Meta Signs Deal for Millions of Amazon Graviton CPUs to Power AI Agents
Meta announced a multi‑year agreement to run its AI workloads on millions of Amazon Graviton ARM‑ba…
Meta announced on April 24, 2026 that it will run its AI workloads on millions of AWS Graviton ARM‑based CPUs, marking a strategic shift from GPU‑centric training to CPU‑optimized inference for AI agents.Meta Chooses AWS Graviton CPUs for AI Agent WorkloadsThe agreement leverages the latest generation of Graviton, which Amazon says is tuned for “real‑time reasoning, code generation, search and multi‑step task coordination.” Unlike traditional GPUs, these CPUs handle the compute‑intensive inference phase that follows model training.Scale of the Deal and Financial ImplicationsMillions of Graviton chips will be provisioned for Meta’s AI services.The partnership redirects a portion of Meta’s cloud spend back to AWS, contrasting with its prior $10 billion six‑year contract with Google Cloud.Earlier in 2026, Anthropic committed $100 billion over ten years to run on AWS Trainium, with Amazon investing an additional $5 billion (total $13 billion) in Anthropic.Shifting Competitive Landscape Among Cloud ProvidersThe timing of the announcement—immediately after Google Cloud Next—signals Amazon’s intent to challenge Google’s AI‑chip narrative. Nvidia’s new ARM‑based Vera CPU also targets the same agentic workloads, but Nvidia sells directly to enterprises, whereas AWS offers the chips only through its cloud platform.What This Means for Future AI Chip StrategiesAmazon CEO Andy Jassy has pledged to win on price‑performance, pressuring the internal chip team to accelerate Graviton and Trainium roadmaps. If Meta’s deployment proves successful, other AI‑heavy firms may follow, accelerating the migration from GPU‑only training pipelines to hybrid CPU‑GPU inference architectures.
#Meta
#Amazon
#AWS Graviton
Read More