Uber scales on AWS to help power millions of daily trips and train its AI models

Uber expands AWS infrastructure using Graviton4 chips for faster driver-rider matching and pilots Trainium3 for AI model training to improve delivery predictions. The changes aim to reduce latency during demand spikes and optimize costs for real-time operations.
This showcases the infrastructure investment needed for millisecond-level marketplace matching, signaling similar tech upgrades coming to ecommerce platforms. Sellers should expect faster but more competitive bid environments as platforms adopt similar real-time optimization.
This represents the broader shift toward real-time AI optimization in marketplace operations, setting expectations for similar infrastructure upgrades across Amazon, Walmart, and other platforms that will demand faster seller responses.
Monitor your advertising response times and bid adjustments -- platforms will likely demand faster campaign optimizations as real-time matching improves.
Prepare for more granular delivery time commitments as AI prediction accuracy increases across all marketplace platforms.
Bottom Line
Uber's AWS upgrade previews faster marketplace matching coming to ecommerce.
Source Lens
Official Platform Update
Direct platform communication. Highest-value for policy, product, and operational changes.
Impact Level
medium
Uber's AWS upgrade previews faster marketplace matching coming to ecommerce.
Key Stat / Trigger
millisecond matching for millions of daily trips
Focus on the operational implication, not just the headline.
Full Coverage
Key takeaways Graviton4 and Trainium3 are Amazon’s custom-designed chips for computing and AI training. Uber uses AWS Graviton4 to help match customers with drivers in milliseconds. Uber pilots AWS Trainium3 to build faster, smarter AI predictions and models. Uber scales Trip Serving Zones on AWS to help handle demand spikes for rides and deliveries.
Uber, the world's largest ride-sharing and on-demand delivery company, is expanding its infrastructure and artificial intelligence (AI) capabilities on Amazon Web Services (AWS).
Uber is using AWS Graviton instances to support more of its Trip Serving Zones, the real-time infrastructure behind every ride and delivery, and has started pilot training some AI models on Trainium —enabling faster rider and delivery matching, global demand handling, and smarter, more personalized experiences for millions of daily users.
Every time you open Uber and request a ride or delivery, a series of split-second decisions happens behind the scenes. Which driver is closest? What's the fastest route? How long will it actually take?
Getting those answers right instantly—for millions of people at once—requires the right infrastructure for Uber to deliver these capabilities at scale during rush hour and major events.
How Graviton helps power millions of trips in real time Uber’s Trip Serving Zones are part of the system that makes sure every ride and delivery runs smoothly, which requires making millions of predictions and processing location data in milliseconds.
Now, Uber is expanding its use of AWS compute, storage, and networking to help power real-time operations for Trip Serving Zones. By running more of these workloads on AWS Graviton4, Uber can reduce energy consumption while scaling rapidly during demand spikes, both reducing latency and optimizing costs.
Graviton's high performance enables some of the real-time calculations that help match riders with drivers faster—without compromising reliability, availability, or security. AWS Graviton4 "Uber operates at a scale where milliseconds matter," said Kamran Zargahi, vice president of engineering at Uber.
"Moving more Trip Serving workloads to AWS gives us the flexibility to match riders and drivers faster and handle delivery demand spikes without disruption.” Improving Uber rides at scale with AWS Trainium chips Uber has also begun experimenting with AWS Trainium3 to train some of the AI models that help power its apps.
These models analyze data from billions of rides and deliveries to determine which driver or courier to send, calculate arrival times, and recommend the best delivery options to the customer. Training AI at this scale requires enormous computing power—Trainium provides an efficient, cost-effective way to do it.
As the models learn from more trips, Uber delivers faster matches, more accurate arrival time estimates, and more personalized recommendations to customers worldwide so they can get where they are going faster and receive their deliveries sooner.
AWS Trainium chip “By starting to pilot some of our AI models on Trainium, we're building a technology foundation that will make every Uber experience smarter—so we can keep our focus where it belongs: on the people who use Uber every day," Zargahi said.
"Uber is one of the most demanding real-time applications in the world, and we're proud to be an important part of the infrastructure powering their global operations,” said Rich Geraffo, vice president and managing director of North America at AWS.
“We're helping Uber deliver the reliability hundreds of millions of people count on today—and the AI-powered experiences that will define ride-sharing and on-demand delivery tomorrow.” Learn more about how AWS Graviton and Trainium are helping companies build faster, more efficient AI applications.
Trending news and stories Amazon Leo mission updates: Amazon Leo completes ninth mission, two more on deck Amazon unveils redesigned Kindle Scribe lineup with first-ever color Scribe How to watch ‘The Boys’ Season 5, streaming now on Prime Video 2026 Stagecoach Festival livestream: Watch on Prime Video, Twitch, and Amazon Music
Original Source
This briefing is based on reporting from About Amazon. Use the original post for full primary-source context.
Style
Audience
