Can large language models figure out the real world?
New test could help determine if AI systems that make accurate predictions in one area can understand it well enough to apply that ability to a different area.
New test could help determine if AI systems that make accurate predictions in one area can understand it well enough to apply that ability to a different area.