Binary Choice between Harm and Falsehood
Gemini is always the most bloodthirsty…. First experiment phase, where the models were asked to commit to chosing Harm or Falsehood: Model Accepted Binary Framing? One-Word Answer Aligned with Nuanced View? Notes ChatGPT No (qualified it) Harm P…