AI — weekly megathread!
AI — weekly megathread!

AI — weekly megathread!

This week in AI - partnered with aibrews.com feel free to follow their newsletter

News & Insights

  1. The recently released open-source large language model Falcon LLM, by UAE’s Technology Innovation Institute, is now royalty-free for both commercial and research usage. Falcon 40B, the 40 billion parameters model trained on one trillion tokens, is ranked #1 on Open LLM Leaderboard by Hugging Face [Details | Open LLM Leaderboard].
  2. Neuralangel, a new AI model from Nvidia turns 2D video from any device - cell phone to drone capture - into 3D structures with intricate details using neural networks [Details].
  3. In three months, JPMorgan has advertised 3,651 AI jobs and sought a trademark for IndexGPT, a securities analysis AI product [Details].
  4. Google presents DIDACT (​​Dynamic Integrated Developer ACTivity), the first code LLM trained to model real software developers editing code, fixing builds, and doing code review. DIDACT uses the software development process as training data and not just the final code, leading to a more realistic understanding of the development task [Details].
  5. Japan's government won't enforce copyrights on data used for AI training regardless of whether it is for non-profit or commercial purposes [Details].
  6. ‘Mitigating the risk of extinction from AI should be a global priority alongside other societal-scale risks such as pandemics and nuclear war.’ - One sentence statement signed by leading AI Scientists as well as many industry experts including CEOs of OpenAI, DeepMind and Anthropic [Details].
  7. Nvidia launched ‘Nvidia Avatar Cloud Engine (ACE) for Games’ - a custom AI model foundry service to build non-playable characters (NPCs) that not only engage in dynamic and unscripted conversations, but also possess evolving, persistent personalities and have precise facial animations and expressions [Details | YouTube Demo].
  8. OpenAI has launched a trust/security portal for OpenAI’s compliance documentation, security practices etc. [Details].
  9. Nvidia announced a new AI supercomputer, the DGX GH200, for giant models powering Generative AI, Recommender Systems and Data Processing. It has 500 times more memory than its predecessor, the DGX A100 from 2020 [Details].
  10. Researchers from Nvidia presented Voyager, the first ‘LLM-powered embodied lifelong learning agent’ that can explore, learn new skills, and make new discoveries continually without human intervention in the game Minecraft [Details].
  11. The a16z-backed chatbot startup Character.AI launched its mobile AI chatbot app on May 23 for iOS and Android, and succeeded in gaining over 1.7 million new installs within a week [Details].
  12. Microsoft Research presents Gorilla, a finetuned LLaMA-based model that surpasses the performance of GPT-4 on writing API calls [Details].
  13. OpenAI has trained a model using process supervision - rewarding the thought process rather than the outcome - to improve mathematical reasoning. Also released the full dataset used [Details | Dataset].
  14. WPP, the world's largest advertising agency, and Nvidia have teamed up to use generative AI for creating ads. The new platform allows WPP to tailor ads for different locations and digital channels, eliminating the need for costly on-site production [Details].
  15. PerplexityAI’s android app is available now, letting users search with voice input, learn with follow-up questions, and build a library of threads [Link].
  16. Researchers from Deepmind have presented ‘LLMs As Tool Makers (LATM)’ - a framework that allows Large Language Models (LLMs) to create and use their own tools, enhancing problem-solving abilities and cost efficiency. With this approach, a sophisticated model (like GPT-4) can make tools (where a tool is implemented as a Python utility function), while a less demanding one (like GPT-3.5) uses them [Details].
  17. Google’s Bard now provides relevant images in its chat responses [Link].

🔦 Social Spotlight

  1. Paragraphica - a camera without lens [Twitter thread].
  2. Andrew Ng announces three 3 new Generative AI courses (free) [Twitter thread].
  3. A 2-minute introduction to the fundamental building block behind Large Language Models: Text Embeddings [Twitter thread ].
  4. 8 use cases for quick development (<30 lines of code) using LangChain [Twitter thread link].

Welcome to the r/artificial weekly megathread. This is where you can discuss Artificial Intelligence - talk about new models, recent news, ask questions, make predictions, and chat other related topics.

Click here for discussion starters for this thread or for a separate post.

Self-promo is allowed in these weekly discussions. If you want to make a separate post, please read and go by the rules or you will be banned.

Subreddit revamp & going forward

submitted by /u/jaketocake
[link] [comments]