machine learning machine learning deployment RVPO: Risk-Sensitive Alignment via Variance Regularization – Apple Machine Learning Research Google Inc. May 8, 2026 May 8, 2026 RVPO: Risk-Sensitive Alignment via Variance Regularization Apple Machine Learning Research