AI — weekly megathread! – Jay van Zyl @ ecosystem.Ai

News provided by aibrews.com

Stability AI launched Stable Audio, a generative AI tool for music & sound generation from text. The underlying latent diffusion model architecture uses audio conditioned on text metadata as well as audio file duration and start time [Details].
Coqui released XTTS - a new voice generation model that lets you clone voices in 13 different languages by using just a quick 3-second audio clip [Details].
Microsoft Research released and open-sourced Phi-1.5 - a 1.3 billion parameter transformer-based model with performance on natural language tasks comparable to models 5x larger [Paper ].
Project Gutenberg, Microsoft and MIT have worked together to use neural text-to-speech to create and release thousands of human-quality free and open audiobooks [Details].
Researchers present NExT-GPT - an any-to-any multimodal LLM that accepts inputs and generate outputs in arbitrary combinations of text, images, videos, and audio [Details | Demo].
Chain of Density (CoD): a new prompt introduced by researchers from Salesforce, MIT and Colombia University that generates more dense and human-preferable summaries compared to vanilla GPT-4 [Paper].
Adept open-sources Persimmon-8B, releasing it under an Apache license. The model has been trained from scratch using a context size of 16K [Details].
Adobe's Firefly generative AI models, after 176 days in beta, are now commercially available in Creative Cloud, Adobe Express, and Adobe Experience Cloud. Adobe is also launching Firefly as a standalone web app [Details].
Deci released DeciLM 6B, a permissively licensed, open-source foundation LLM that is 15 times faster than Llama 2 while having comparable quality [Details].
Researchers release Scenimefy - a model transforming real-life photos into Shinkai-animation-style images [Details | GitHub].
Microsoft open sources EvoDiff, a novel protein-generating AI that could be used to create enzymes for new therapeutics and drug delivery methods as well as new enzymes for industrial chemical reactions [Details].
Several companies including Adobe, IBM, Nvidia, Cohere, Palantir, Salesforce, Scale AI, and Stability AI have pledged to the White House to develop safe and trustworthy AI, in a voluntary agreement similar to an earlier one signed by Meta, Google, and OpenAI [Details].
Microsoft will provide legal protection for customers who are sued for copyright infringement over content generated using Copilot, Bing Chat, and other AI services as long as they use built-in guardrails [Details].
NVIDIA beta released TensorRT - an open-source library that accelerates and optimizes inference performance on the latest LLMs on NVIDIA Tensor Core GPUs [Details].
Pulitzer Prize winning novelist Michael Chabon and several other writers sue OpenAI of copyright infringement [Details].
NVIDIA partners with two of India’s largest conglomerates, Reliance Industries Limited and Tata Group, to create an AI computing infrastructure and platforms for developing AI solutions [Details].
Roblox announced a new conversational AI assistant that let creators build virtual assets and write code with the help of generative AI [Details].
Google researchers introduced MADLAD-400 - a 3T token multilingual, general web-domain, document-level text dataset spanning 419 Languages [Paper].
A recent survey by Salesforce show that 65% of generative AI users are Millennials or Gen Z, and 72% are employed. The survey included 4,000+ people across the United States, UK, Australia, and India [Details].
Meta is reportedly working on an AI model designed to compete with GPT-4 [Details].

🔦 Weekly Spotlight

How Are Consumers Using Generative AI? A detailed report by a16z [Link].
Apple’s iPhone 15 launch focused heavily on AI — even though the tech giant didn’t mention it [Link].
Asking 60+ LLMs a set of 20 questions [Link].
A Twitter thread on companies that are hiring for Generative AI talent [Link].
Agents: an open-source library/framework for building autonomous language agents. [GitHub Link]
RestGPT: a large language model based autonomous agent to control real-world applications, such as movie database and music player [GitHub Link].

—-------

Welcome to the r/artificial weekly megathread. This is where you can discuss Artificial Intelligence - talk about new models, recent news, ask questions, make predictions, and chat other related topics.

Click here for discussion starters for this thread or for a separate post.

Self-promo is allowed in these weekly discussions. If you want to make a separate post, please read and go by the rules or you will be banned.

Previous Megathreads & Subreddit revamp and going forward

submitted by /u/jaketocake
[link] [comments]