Algorithms
Algorithms

Method prevents an AI model from being overconfident about wrong answers

More efficient than other approaches, the “Thermometer” technique could help someone know when they should trust a large language model.

Study: When allocating scarce resources with AI, randomization can improve fairness

Introducing structured randomization into decisions based on machine-learning model predictions can address inherent uncertainties while maintaining efficiency.

MIT researchers advance automated interpretability in AI models

MAIA is a multimodal agent that can iteratively design experiments to better understand various components of AI systems.

Large language models don’t behave like people, even though we may expect them to

A new study shows someone’s beliefs about an LLM play a significant role in the model’s performance and are important for how it is deployed.

Creating and verifying stable AI-controlled systems in a rigorous and flexible way

Neural network controllers provide complex robots with stability guarantees, paving the way for the safer deployment of autonomous vehicles and industrial machines.

How to assess a general-purpose AI model’s reliability before it’s deployed

A new technique enables users to compare several large models and choose the one that works best for their task.

Reasoning skills of large language models are often overestimated

New CSAIL research highlights how LLMs excel in familiar scenarios but struggle in novel ones, questioning their true reasoning abilities versus reliance on memorization.

“They can see themselves shaping the world they live in”

Developed by MIT RAISE, the Day of AI curriculum empowers K-12 students to collaborate on local and global challenges using AI.

MIT researchers introduce generative AI for databases

This new tool offers an easier way for people to analyze complex tabular data.

Helping nonexperts build advanced generative AI models

MosaicML, co-founded by an MIT alumnus and a professor, made deep-learning models faster and more efficient. Its acquisition by Databricks broadened that mission.