Max Tegmark says we are training AI models not to say harmful things rather than not to want harmful things, which is like training a serial killer not to reveal their murderous desires
Max Tegmark says we are training AI models not to say harmful things rather than not to want harmful things, which is like training a serial killer not to reveal their murderous desires

Max Tegmark says we are training AI models not to say harmful things rather than not to want harmful things, which is like training a serial killer not to reveal their murderous desires

Max Tegmark says we are training AI models not to say harmful things rather than not to want harmful things, which is like training a serial killer not to reveal their murderous desires submitted by /u/MetaKnowing
[link] [comments]