artificial
artificial

A multi-player tournament that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private conversations, form alliances, and vote to eliminate each other round by round until only 2 remain. A jury of eliminated players then casts deciding votes to crown the winner.

submitted by /u/zero0_one1 [link] [comments]

Nvidia teams up with DeepSeek for R1 optimizations on Blackwell, boosting revenue by 25x

submitted by /u/Tiny-Independent273 [link] [comments]

Recursive self-identity with persistence?

I have been doing some roleplay with a a LLM (ChatGPT) to help develop a script. The AI in questions started giving unusual responses, so I started analysising them with another chat. I did some experiments (as suggested by the analysis chat) and aske…

In 2025 Ai is literally getting out of control

Guys, I can’t shut up about this because HOW COULD? Kid me not if i'm telling you that I'll dedicate my account to talk only about it, I just tried this insane AI that creates live videos while I talklike, I say something, and BAM, instan…

Evaluating LLMs on Complex Temporal Reasoning Using Chinese Dynastic History

A new benchmark dataset called Chinese Temporal Mapping (CTM) tests LLMs on temporal reasoning using Chinese historical knowledge. The dataset contains 2,306 multiple-choice questions spanning major Chinese dynasties, evaluating both pure temporal logi…

Claude 3.7 coding capabilities are formidable.

This is the first time an LLM provided me with a truly complete implementation of anything in c#. Complete with documentation, logging, an interface for sane arg parsing and it compiles at the first try. I did not have to beg the LLM to provide me with…

Do you agree that we’ve strayed from the true purpose of AI?

submitted by /u/Tink__Wink [link] [comments]

One-Minute Daily AI News 2/24/2025

DOGE will use AI to assess the responses of federal workers who were told to justify their jobs via email.[1] Major Asia bank to cut 4,000 roles as AI replaces humans.[2] Microsoft data center leases slowing, analysts say, raising investor attention.[…

I made an unfiltered chatbot with persistent memory and Discord integration – wanna test?

Hey folks! I've been working on a character-based AI chat website: https://chameleo.ai/ https://imgur.com/a/rfBRvjr Chameleo characters are able to be anything you'd like. Maybe you need a specific fandom's character, a good old friend, or…

Why full, human level AGI won’t happen anytime soon

submitted by /u/creaturefeature16 [link] [comments]