What tools do they use to Create hyper-realistic AI Human Characters, and then use those characters in videos speaking realistic non-English text-to-speech models?
What tools do they use to Create hyper-realistic AI Human Characters, and then use those characters in videos speaking realistic non-English text-to-speech models?

What tools do they use to Create hyper-realistic AI Human Characters, and then use those characters in videos speaking realistic non-English text-to-speech models?

Note: This is obviously for my own research purposes and NOT FOR THINGS LIKE ONLYFANS.

I finished my Bachelor's in CS majoring in AI 3 years back. I've been working in different fields since but I'm now in a position where I can finally study and research what I am passionate about, AI.

I live outside the US and in a country where AI isn't that prominent, widely used, or taught that much. So I am hoping for some help here.

A few days back, I was talking to a friend about those OnlyFans guys who used an AI model and wondered what kind of sets of tools they could've used. And similarly, how people are using AI characters for their businesses both in Social Media pictures and videos.

Here's how I have segmented the whole process.

  • Create a hyper-realistic character Image on a platform that can account for the right ethnicity, race and age. That platform can remember the final character and produce various images in various postures and backgrounds.
  • Platforms to create videos with an image of the character, if there's a platform that does both Non-English text-to-speech and transposes that on my custom character realistically with facial and body movement, I would use that
  • If there isn't any platform that does both, perhaps a platform could be used to integrate the text-to-speech and the image to create a video
  • If it's a better solution to run some AI models on my PC, what are those AI Models?

Now my question and topic of help to this community is that, is there any all-in-one solution platform for this? If not, what's the next best solution for control and precision?

Please keep in mind the following example parameter:

Ethnicity: Bangladeshi/South East Asian

Text-to-speech: Bangla, English

submitted by /u/HK_OG
[link] [comments]