I am looking for self-hosted AI implementations that I can train on emails, PDFs, and MS Office documents
I am looking for self-hosted AI implementations that I can train on emails, PDFs, and MS Office documents

I am looking for self-hosted AI implementations that I can train on emails, PDFs, and MS Office documents

OpenAI's ChatGPT, Google's Bard, Anthropic's Claude, and Microsoft's Being are all nice freemium tools, but let's be honest, we don't know what they do with our information. Especially for work-related topics we are strictly prohibited from sharing anything on those platforms, for good reasons. So I am wondering if I can find any Free, Libre, and Open Source Software that I can self-host. I want to train it on emails, meeting transcripts, PDFs, and Microsoft Office documents. What I need from the software:

  • I can give it a long PDF or MS Office document and it answers some questions like making a summary, listing some requirements, and some instructions to do something according to that document
  • make a summary of the sessions, create a list of open issues with deadlines and people responsible, helping to maintain Kanban boards related to that project...
  • anonymize textual content so I can use those content later in the freemium software on the internet...
  • Indexing information, so I ask a question and it points to the email or document where I can find information about that topic

Do we have anything like this available today or am I asking this question too early?

submitted by /u/foadsf
[link] [comments]