Search for:
Home
Publications
- Process Innovation Imperative
- Built to Thrive
Resources
Advisory
Projects
About

Predictive technology: computational social science

Home
Advisory
Projects
- ecosystem.Ai Pre-pivot Platform
- i4j Project
Publications
- Built to Thrive
- Process Innovation Imperative
Resources
Video Gallery
About

LLMs Often Know When They’re Being Evaluated: "Nobody has a good plan for what to do when the models constantly say ‘This is an eval testing for X. Let’s say what the developers want to hear.’"

June 5, 2025 June 5, 2025

/u/MetaKnowing

Paper: https://www.arxiv.org/abs/2505.23836 submitted by /u/MetaKnowing [link] [comments]

artificial

LLMs Often Know When They’re Being Evaluated: "Nobody has a good plan for what to do when the models constantly say ‘This is an eval testing for X. Let’s say what the developers want to hear.’"

/u/MetaKnowing

June 5, 2025 June 5, 2025

Paper: https://www.arxiv.org/abs/2505.23836

submitted by /u/MetaKnowing
[link] [comments]

Technologist & Computational Social Scientist

Working daily on solving social science prediction challenges using predictive technologies.

PreviousShould I create new chat for every workout plan for myself?

NextTrump administration cuts ‘Safety’ from AI Safety Institute | "We’re not going to regulate it" says Commerce Secretary