This week in AI - partnered with aibrews.com feel free to follow their newsletter
News & Insights
- Microsoft Research presents Composable Diffusion (CoDi), a novel generative model capable of generating any combination of output modalities, such as language, image, video, or audio, from any combination of input modalities. Unlike existing generative AI systems, CoDi can generate multiple modalities in parallel and its input is not limited to a subset of modalities like text or image.[Details].
- MoonlanderAI announced the alpha release of its generative AI platform for building immersive 3D games using text descriptions [Details].
- Bark, text-to-audio model, is now live on Discord. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and laughing, sighing and crying sounds. [Details | GitHub].
- OpenAI's Code Interpreter plugin, allowing ChatGPT to execute code and access uploaded files, will roll out to all ChatGPT Plus users within a week. It enables data analysis, chart creation, file editing, math calculations, and more [Twitter Link].
- OpenAI announces general availability of GPT-4 API. Current API developers who have made successful payments can use it now, and new developers will have access by month's end [Details].
- Microsoft AI presents LONGNET a Transformer variant that can scale the sequence length to 1 billion+ tokens without sacrificing performance on shorter sequences [Details].
- Researchers present a neural machine translation model to translate the ancient language Akkadian on 5,000-year-old cuneiform tablets instantly to english [Details | Paper].
- A set of open-source LLM models, OpenLLMs, fine-tuned on only ~6K GPT-4 conversations, have achieved remarkable performance. Of these, OpenChat-13B, built upon LLAMA-13B, is at rank #1 of open-source models on AlpacaEval Leaderboard [GitHub |Huggingface| AlpacaEval].
- Researchers have developed an AI tool named CognoSpeak that uses a virtual character for patient interaction and speech analysis to identify early indicators of dementia and Alzheimer's disease [Link].
- Secretive hardware startup Humane, shares details about its first product: ‘Ai Pin’. It is a wearable, AI-powered device that performs smartphone-like tasks, including summarizing emails, translating languages, and making calls. It also recognizes objects using a camera and computer vision, and it can project an interactive interface onto nearby surfaces, like the palm of a hand or the surface of a table [Details].
- Nvidia acquired OmniML, an AI startup whose software helped shrink machine-learning models so they could run on devices rather than in the cloud [Details].
- Cal Fire, the firefighting agency in California is using AI to fight wildfires [Details].
- Over 150 executives from top European companies have signed an open letter urging the EU to rethink its plans to regulate AI [Details].
- Google updated its privacy policy: the company reserves the right to use just about everything users post online for developing its AI models and tools [Details].
- OpenAI believes superintelligence could arrive this decade. Announced a new project, Superalignment with a focus on aligning superintelligent AI systems with human intent [Details].
🔦 Open Source Projects
- Embedchain: a framework to easily create LLM powered bots over any dataset [Link].
- GPT-author: uses a chain of GPT-4 and Stable Diffusion API calls to generate an an entire novel, outputting an EPUB file [Link].
- GPT-Migrate: Easily migrate your codebase from one framework or language to another [Link].
—-------
Welcome to the r/artificial weekly megathread. This is where you can discuss Artificial Intelligence - talk about new models, recent news, ask questions, make predictions, and chat other related topics.
Click here for discussion starters for this thread or for a separate post.
Self-promo is allowed in these weekly discussions. If you want to make a separate post, please read and go by the rules or you will be banned.
[link] [comments]