/u/jaketocake

AI — weekly megathread!

/u/jaketocake December 8, 2023 December 8, 2023

News provided by aibrews.com Google introduced Gemini – a family of multimodal models built from the ground up for multimodality, capable of reasoning seamlessly across text, images, video, audio, and code. It comes in Ultra, Pro, and Nano sizes, suit…

artificial

AI — weekly megathread!

/u/jaketocake December 1, 2023 December 1, 2023

News provided by aibrews.com Meta AI introduced a suite of AI language translation models that preserve expression and improve streaming [Details | GitHub]: SeamlessExpressive enables the transfer of tones, emotional expression and vocal styles in sp…

artificial

AI — weekly megathread!

/u/jaketocake November 24, 2023 November 24, 2023

News provided by aibrews.com Stability AI released Stable Video Diffusion, a latent video diffusion model for high-resolution text-to-video and image-to-video generation. [Details | Paper]. Microsoft Research released Orca 2 (7 billion and 13 billion…

artificial

AI — weekly megathread!

/u/jaketocake November 17, 2023 November 17, 2023

News provided by aibrews.com Meta AI introduces: Emu Video: new text-to-video model that leverages Meta’s Emu image generation model and can respond to text-only, image-only or combined text & image inputs to generate high quality video [Details]…

artificial

AI — weekly megathread!

/u/jaketocake November 10, 2023 November 10, 2023

News provided by aibrews.com Luma AI introduced Genie, a generative 3D foundation model in research preview. It’s free during research preview via Discord [Details]. Nous Research released Obsidian, the world's first 3B multi-modal model family pr…

artificial

AI — weekly megathread!

/u/jaketocake November 3, 2023 November 3, 2023

News provided by aibrews.com Luma AI introduced Genie, a generative 3D foundation model in research preview. It’s free during research preview via Discord [Details]. Nous Research released Obsidian, the world's first 3B multi-modal model family pr…

artificial

AI — weekly megathread!

/u/jaketocake October 27, 2023 October 27, 2023

News provided by aibrews.com Twelve Labs announced video-language foundation model Pegasus-1 (80B) along with a new suite of Video-to-Text APIs. Pegasus-1 integrates visual, audio, and speech information to generate more holistic text from vi…

artificial

AI — weekly megathread!

/u/jaketocake October 20, 2023 October 20, 2023

News provided by aibrews.com Adept open-sources Fuyu-8B – a multimodal model designed from the ground up for digital agents, so it can support arbitrary image resolutions, answer questions about graphs and diagrams, answer UI-based questions …

artificial

AI — weekly megathread!

/u/jaketocake October 13, 2023 October 13, 2023

News provided by aibrews.com Google DeepMind introduced 𝗥𝗧-𝗫: a generalist AI model to help advance how robots can learn new skills. To train it, DeepMind together with 33 academic labs developed Open X-Embodiment, a massive open dataset that…

artificial

AI — weekly megathread!

/u/jaketocake October 6, 2023 October 6, 2023

News provided by aibrews.com Google DeepMind introduced 𝗥𝗧-𝗫: a generalist AI model to help advance how robots can learn new skills. To train it, DeepMind together with 33 academic labs developed Open X-Embodiment, a massive open dataset that…

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this: