Data
Data

New method efficiently safeguards sensitive AI training data

The approach maintains an AI model’s accuracy while ensuring attackers can’t extract secret information.

Could LLMs help design our next medicines and materials?

A new method lets users ask, in plain language, for a new molecule with certain properties, and receive a detailed description of how to synthesize it.

Vana is letting users own a piece of the AI models trained on their data

More than 1 million people are contributing their data to Vana’s decentralized network, which started as an MIT class project.

At the core of problem-solving

Stuart Levine ’97, director of MIT’s BioMicro Center, keeps departmental researchers at the forefront of systems biology.

Like human brains, large language models reason about diverse data in a general way

A new study shows LLMs represent different data types based on their underlying meaning and reason about data in their dominant language.

Validation technique could help scientists make more accurate forecasts

MIT researchers developed a new approach for assessing predictions with a spatial dimension, like forecasting weather or mapping air pollution.

Streamlining data collection for improved salmon population management

Assistant Professor Sara Beery is using automation to improve monitoring of migrating salmon in the Pacific Northwest.

User-friendly system can help developers build more efficient simulations and AI models

By automatically generating code that leverages two types of data redundancy, the system saves bandwidth, memory, and computation.

The multifaceted challenge of powering AI

Providing electricity to power-hungry data centers is stressing grids, raising prices for consumers, and slowing the transition to clean energy.

Explained: Generative AI’s environmental impact

Rapid development and deployment of powerful generative AI models comes with environmental consequences, including increased electricity demand and water consumption.