Active Inference outperforms chatgbt in Mastermind benchmark using a laptop

From:

https://www.globenewswire.com/news-release/2024/12/17/2998249/0/en/VERSES-Genius-Outperforms-OpenAI-Model-in-Code-Breaking-Challenge-Mastermind.html

“Accuracy and Reliability. Genius solved the code every time in a consistent number of steps. Speed.

Genius consistently solved games in 1.1–4.5 seconds, while ChatGPT’s solve times ranged from 7.9 to 889 seconds (approximately 15 mins)

Efficiency. Genius’ total compute time for 100 games was just over 5 minutes, compared to ChatGPT’s 12.5 hours.

Cost. Genius’ compute cost was estimated at $0.05 USD for all 100 games, compared to ChatGPT’s o1 model at $263 USD.

In summary, Genius solved Mastermind 100% of the time, was 140 times faster and 5260 times cheaper than o1-preview.”

Pretty impressive demonstration of active inference and free energy principle using bayesian methods. Hopefully they see similar results with upcoming Atari 10k benchmarks. Seeing Karl Friston’s research in 2025-2026 will be pretty interesting for agi development which I think will be a blend of multiple ai methods.

Genius SDK beta overview:

https://medium.com/aimonks/behind-the-scenes-with-genius-how-active-inference-is-redefining-the-very-definition-of-ai-22c77743b8a5

Active inference overview:

https://ai.plainenglish.io/how-to-grow-a-sustainable-artificial-mind-from-scratch-54503b099a07

submitted by /u/oroechimaru
[link] [comments]