computer-vision
computer-vision

Natural language boosts LLM performance in coding, planning, and robotics

Three neurosymbolic methods help language models find better abstractions within natural language, then use those representations to execute complex tasks.

Advancing technology for aquaculture

MIT Sea Grant students apply machine learning to support local aquaculture hatcheries.

AI generates high-quality images 30 times faster in a single step

Novel method makes tools like Stable Diffusion and DALL-E-3 faster by simplifying the image-generating process to a single step while maintaining or enhancing image quality.

New algorithm unlocks high-resolution insights for computer vision

FeatUp, developed by MIT CSAIL researchers, boosts the resolution of any deep network or visual foundation for computer vision systems.

Researchers enhance peripheral vision in AI models

By enabling models to see the world more like humans do, the work could help improve driver safety and shed light on human behavior.

Reasoning and reliability in AI

PhD students interning with the MIT-IBM Watson AI Lab look to improve natural language usage.

Multiple AI models help robots execute complex plans more transparently

A multimodal system uses models trained on language, vision, and action data to help robots develop and execute plans for household, construction, and manufacturing tasks.

A flexible solution to help artists improve animation

This new method draws on 200-year-old geometric foundations to give artists control over the appearance of animated characters.

Image recognition accuracy: An unseen challenge confounding today’s AI

“Minimum viewing time” benchmark gauges image recognition complexity for AI systems by measuring the time needed for accurate human identification.

A computer scientist pushes the boundaries of geometry

Justin Solomon applies modern geometric techniques to solve problems in computer vision, machine learning, statistics, and beyond.