<span class="vcard">/u/Mr-Barack-Obama</span>
/u/Mr-Barack-Obama

SOTA models at 2K tps

I need SOTA ai at like 2k TPS with tiny latency so that I can get time to first answer token under 3 seconds for real time replies with full COT for maximum intelligence. I don't need this consistently, only maybe for an hour at a time for real-tim…

Best model for transcribing videos?

i have a screen recording of a zoom meeting. When someone speaks, it can be visually seen who is speaking. I'd like to give the video to an ai model that can transcribe the video and note who says what by visually paying attention to who is speakin…

Best small models for survival situations?

What are the current smartest models that take up less than 4GB as a guff file? I'm going camping and won't have internet connection. I can run models under 4GB on my iphone. It's so hard to keep track of what models are the smartest becaus…

Share your favorite benchmarks, here are mine.

My favorite overall benchmark is livebench. If you click show subcategories for language average you will be able to rank by plot_unscrambling which to me is the most important benchmark for writing: https://livebench.ai/ Vals is useful for tax and law…

Everyone share their favorite chain of thought prompts!

Here’s my favorite COT prompt, I DID NOT MAKE IT. I’ve found these chain of thought prompts can be very powerful for making jailbreaks. I’m trying to find more so that i can make a good one to release! This one is good for both logic and creativity, pl…