artificial GPT-4o’s Chinese token-training data is polluted by spam and porn websites /u/techreview May 22, 2024 May 22, 2024 submitted by /u/techreview [link] [comments]