Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI – AWS Blog
Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI – AWS Blog