Hello Everyone,
i have recently started working on a project, where I need to animate an image of a face in real time to speak sentences. Essentially I am trying to build a face for my own large language model. I know of Nvidia's Audio2Face and Metahuman, but these are all in 3D and take a lot of time rendering the lip and eye animations. I need something, which works only with a bit of latency.
Does anyone know a service or a repo I could use to animate a 2D picture to speak text?
[link] [comments]