machine learning machine learning deployment How Rufus doubled their inference speed and handled Prime Day traffic with AWS AI chips and parallel decoding – Amazon Web Services (AWS) Google Inc. May 28, 2025 May 28, 2025 How Rufus doubled their inference speed and handled Prime Day traffic with AWS AI chips and parallel decoding Amazon Web Services (AWS)