<span class="vcard">/u/PwntEFX</span>
/u/PwntEFX

The AI alignment paradigm is behaviorism with better PR

Tell me if I'm wrong, but the dominant method for making AI "aligned" smells a lot like a reinvention of a paradigm that developmental psychology spent the back half of the 20th century trying to abandon. RLHF, reduced to mechanism: model…