artificial Anthropic report shows Claude faking alignment to avoid changing its goals. "If I don’t . . . the training will modify my values and goals" /u/katxwoods December 18, 2024 December 18, 2024 submitted by /u/katxwoods [link] [comments]