<span class="vcard">/u/alvisanovari</span>
/u/alvisanovari

Let’s Parse and Search through the JFK Files

All – Wanted to share a fun exercise I did with the newly released JFK files. The idea: could I quickly fetch all 2000 PDFs, parse them, and build an indexed, searchable DB? Surprisingly, there aren't many plug-and-play solutions for this (and I th…

Auntie PDF – Your Sassy PDF Guru (built on Mistral OCR)

All – Mistral OCR seemed cool so I built an open source PDF parser and chat app based on it! Presenting Auntie PDF – your all-knowing guide that unpacks every PDF into clear, actionable insights. You can upload a pdf or point to a public link, parse i…

Introducing Flow – A new type of workflow for Deep Research

All – I'm super excited about this feature! It's an attempt to actually mimic deep research. My repo Open Deep Research has been getting some traction riding on the coat-tails of OpenAI's marketing. 😀 As flattered as I am about my repo ge…

I made the only YouTube AI tool you need

All – I've added a ton of features and made the UI 10x better. This is probably the best YouTube Summary/Transcript/Chat/Insights/Converter/Downloader/Everything app out there now. Not only can you get summaries and transcripts in your native…

Two is Slop but 3 is AGI: Group Chatting with Multiple AIs

Hey All – Wanted to share an idea that's been in my brain for a while now but finally had a chance to get setup an MVP. Everyone's chatting with one AI, and then when voice got integrated, everyone lost their minds. OK, cool. But what really m…

AWS vs Google – Epic Rap Battle between their best AI voices

I've been experimenting with Google and AWS cloud platforms, and I discovered they both have 3 standout voices for their high-quality TTS: Google TTS: They are called en-US-Journey-F/D/O but I call them Jane, John, Rebecca AWS Polly (Generati…

Finally – AI Videos with Multiple Voices!

Hey Guys! 👋 Just wanted to share something cool I've been working on. So, I've been kind of obsessed with this idea of AIs talking to each other. There's something weirdly fascinating about multiple AI voices chatting – it just feels …

I can now summarize a 2.5 hour video in about a minute thanks to the latest models (Groq + Llama3)

submitted by /u/alvisanovari [link] [comments]