Open question, but intended for people who train AIs. Do we have open questions about how rewards are assessed by an AI?
I keep hearing that AIs are trained via a reward system. Makes sense. Then I hear more that AIs find ways to cheat in order to maximize rewards. I've even seen articles where researchers claim AIs will create their own goals regardless of 'rew…