<span class="vcard">/u/wyem</span>
/u/wyem

This week in AI – all the Major AI developments in a nutshell

Microsoft introduced: Copilot+ PCs, a new category of Windows PCs designed for AI with built-in AI hardware and support for AI features across the operating system. Users can easily locate and recall previously viewed content using Recall, generate a…

This week in AI – all the Major AI developments in a nutshell

Alibaba Research released Qwen1.5-110B, the largest model in the Qwen1.5 series with over 100 billion parameters in the series. It demonstrates competitive performance against Llama-3-70. The model supports the context length of 32K tokens and is mult…

This week in AI – all the Major AI developments in a nutshell

Meta released: Meta Llama 3 family of large language models in 8 and 70B sizes. The training dataset (15T+ tokens) is seven times larger than that used for Llama 2. Meta believes these to be the best open source models of their class. The models ach…

This week in AI – all the Major AI developments in a nutshell

Cohere introduced Rerank 3, a new foundation model purpose built for efficient enterprise search and Retrieval Augmented Generation (RAG) systems. It enables search over multi-aspect and semi-structured data like emails, invoices, JSON documents, code…

This week in AI – all the Major AI developments in a nutshell

AI21 Labs introduced Jamba, a production-grade Mamba based model. By enhancing Mamba Structured State Space model (SSM) technology with elements of the traditional Transformer architecture, Jamba compensates for the inherent limitations of a pure SSM …

This week in AI – all the Major AI developments in a nutshell

Meta AI introduced SceneScript, a novel method of generating scene layouts and representing scenes using language. SceneScript allows AR & AI devices to understand the geometry of physical spaces. It uses next token prediction like an LLM, but ins…

This week in AI – all the Major AI developments in a nutshell

DeepSeek released DeepSeek-VL, an open-source Vision-Language (VL) model designed for real-world vision and language understanding applications. The DeepSeek-VL family, includes 7B and1.3B base and chat models and achieves state-of-the-art or competit…

This week in AI – all the Major AI developments in a nutshell

Mistral introduced a new model Mistral Large. It reaches top-tier reasoning capabilities, is multi-lingual by design, has native function calling capacities and has 32K tokens context window. The pre-trained model has 81.2% accuracy on MMLU. Alongside…

This week in AI – all the Major AI developments in a nutshell

Meta AI introduces V-JEPA (Video Joint Embedding Predictive Architecture), a method for teaching machines to understand and model the physical world by watching videos. Meta AI releases a collection of V-JEPA vision models trained with a feature predi…

This week in AI – all the Major AI developments in a nutshell

Google launches Ultra 1.0, its largest and most capable AI model, in its ChatGPT-like assistant which has now been rebranded as Gemini (earlier called Bard). Gemini Advanced is available, in 150 countries, as a premium plan for $19.99/month, starting …