Video – Jay van Zyl @ ecosystem.Ai

Augmenting citizen science with computer vision for fish monitoring

Lily Keyes | MIT Sea Grant March 25, 2026 March 25, 2026

MIT Sea Grant works with the Woodwell Climate Research Center and other collaborators to demonstrate a deep learning-based system for fish monitoring.

Algorithms Artificial Intelligence Computer Science and Artificial Intelligence Laboratory (CSAIL) Computer science and technology Data Electrical engineering and computer science (EECS) machine learning MIT Schwarzman College of Computing MIT-IBM Watson AI Lab Research School of Engineering Video

Method teaches generative AI models to locate personalized objects

Adam Zewe | MIT News October 16, 2025 October 16, 2025

After being trained with this technique, vision-language models can better identify a unique item in a new scene.

Algorithms Artificial Intelligence Computer Science and Artificial Intelligence Laboratory (CSAIL) Computer science and technology Data Electrical engineering and computer science (EECS) machine learning MIT Schwarzman College of Computing MIT-IBM Watson AI Lab Research School of Engineering Video

AI learns how vision and sound are connected, without human intervention

Adam Zewe | MIT News May 22, 2025 May 22, 2025

This new machine-learning model can match corresponding audio and visual data, which could someday help robots interact in the real world.

Artificial Intelligence Arts Computer Science and Artificial Intelligence Laboratory (CSAIL) Computer science and technology computer-vision Electrical engineering and computer science (EECS) machine learning MIT Schwarzman College of Computing Research School of Engineering Video

Hybrid AI model crafts smooth, high-quality videos in seconds

Alex Shipps | MIT CSAIL May 6, 2025 May 6, 2025

The CausVid generative AI tool uses a diffusion model to teach an autoregressive (frame-by-frame) system to rapidly produce stable, high-resolution videos.

Artificial Intelligence Computer Science and Artificial Intelligence Laboratory (CSAIL) Computer science and technology computer-vision Electrical engineering and computer science (EECS) machine learning MIT Schwarzman College of Computing Research Robotics School of Engineering Video

Combining next-token prediction and video diffusion in computer vision and robotics

Alex Shipps | MIT CSAIL October 16, 2024 October 16, 2024

A new method can train a neural network to sort corrupted data while anticipating next steps. It can make flexible plans for robots, generate high-quality video, and help AI agents navigate digital environments.