artificial
artificial

Training Vision-Language Models for BLV-Aligned Diagram Descriptions using Sighted User Feedback

Sightation: Using Sighted Feedback to Build Better Diagram Descriptions for BLV Users This paper introduces a novel approach to creating high-quality diagram descriptions for blind and low-vision (BLV) users by leveraging sighted user feedback on VLM-g…

Evaluating Large Reasoning Models on Analogical Reasoning Tasks Under Perceptual Uncertainty

This paper tackles a critical question: can multimodal AI models perform accurate reasoning when faced with uncertain visual inputs? The researchers introduce I-RAVEN-X, a modified version of Raven's Progressive Matrices that deliberately introduce…