Faster LLMs with speculative decoding and AWS Inferentia2 – AWS Blog

August 5, 2024 August 5, 2024

Technologist & Computational Social Scientist

Working daily on solving social science prediction challenges using predictive technologies.