machine learning machine learning deployment Accelerating LLM inference with post-training weight and activation using AWQ and GPTQ on Amazon SageMaker AI – Amazon Web Services (AWS) Google Inc. January 9, 2026 January 9, 2026 Accelerating LLM inference with post-training weight and activation using AWQ and GPTQ on Amazon SageMaker AI Amazon Web Services (AWS)