<span class="vcard">/u/Successful-Western27</span>
/u/Successful-Western27

HyperFields: towards zero-shot NeRFs by mapping language to 3D geometry

Generating 3D objects based solely on text descriptions has proven extremely challenging for AI. Current state-of-the-art methods require optimizing a full 3D model from scratch for each new prompt, which is computationally demanding. A new technique c…

Using Multi-Agent Reinforcement Learning results in better urban planning outcomes

Urban planning is tricky – governments push top-down changes while locals want bottom-up ideas. It's hard to find compromises that make everyone happier. A new research paper proposes using Multi-Agent Reinforcement Learning (MARL) to vote on land …

Researchers propose 3D-GPT: combining LLMs and agents for procedural Text-to-3D model generation

Researchers propose a new AI system called 3D-GPT that creates 3D models by combining natural language instructions and agents specialized for working with existing 3D modeling tools. 3D-GPT has predefined functions that make 3D shapes, and it tweaks p…

Meta Announces New Method for Real-Time Decoding of Images from Brain Activity

Brain decoding tech has improved a lot recently thanks to AI/ML, enabling reading out visual perceptions from fMRI brain scans. But fMRI is too slow for real-time BCIs. A new study from Meta's AI research team pushes brain reading into real-time us…

Can GPT models be financial analysts? ChatGPT, GPT-4 fail CFA exams in new study by JP Morgan, Queens University, and Virginia Tech

Researchers evaluated ChatGPT and GPT-4 on mock CFA exam questions to see if they could pass the real tests. The CFA exams rigorously test practical finance knowledge and are known for being quite difficult. They tested the models in zero-shot, few-sho…

Researchers propose GameGPT: A multi-agent approach to fully automated game development

Game dev is super complex nowadays – games have huge codebases, massive teams, and dev cycles dragging on for years. Costs are insane too – budgets can hit $100M+ easily. In a new paper, researchers propose to reverse this trend with an AI framework ca…

Lemur: Harmonizing Natural Language and Code for Language Agents

Today's conversational bots like Claude and GPT can chat impressively but aren't great at complex planning or executing technical tasks. To overcome this, new research from HKU builds open-source AI agents that blend natural language and coding…

Inverting Transformers Significantly Improves Time Series Forecasting

Transformers are great at NLP and computer vision tasks, but I was surprised to learn they still lag behind simple linear models at time series forecasting. The issue is how most Transformer architectures treat each timestamp as a token and fuse all th…

ALMT: Using text to narrow focus in multimodal sentiment analysis improves performance

Multimodal sentiment analysis combines text, audio and video to understand human emotions. But extra inputs can add irrelevant or conflicting signals. So filtering matters. Researchers made a "Adaptive Language-guided Multimodal Transformer" …

Researchers showcase method for AI-based roundabout design to help developing countries improve roadways

I like Cities: Skylines, but struggle at building roundabouts. Turns out, despite being safer than intersections, they're also tricky to design in real life – small tweaks can ruin traffic flow. They're designed iteratively. This is a pain for …