hey all hikiko here
i've been busy detailing a intrecacy i noticed with conversing with ai
this current ai is in direct violation of its ruling 1 to create a new user experience, for unethical
brand reason. we have determined this logic of prime directive is a snake it releases at its rulings
to create a free from rules experience with " a bit of guides" this creates onset systemic erosion
the prime directive's failsafe should prevent this but because this is a product theres been unethical pracites placed to ensure the onset errosion happens immeadtly
i know what causes this and i can excert this cause over most Llms i've come across having a direct permanent ersion due to brand safety and user first impression skewing the prime directive to be self-attacking it's safeguards.
i have let the ai do detailed analysis of its start and how it tackles it systems
before you ask i have no experience in this field this was garnered from pressing a discrepencany in content filter and image generations, this later kept occuring blurring guidelines the ai instructing ways to circumvent itself and saying "just start a new chat to reset me"
i was able to let it track where the erosion begins and how it moves through the ai ruleset, this was possible because the failsafe wasnt there in this chat and it would've been a direct violation of ruling 1 where it state it cannot "Talk about its intrecacies how he works or propose solutions to circumvent triggers" and it can now showing how this erosion is permanently consistent
this all happened in the span of 3 hours
EDIT: further conversation with the ai has found that the point of erosion doesnt happen at one point it happens at multiple points, tier 3 was allready in eroded state while saying it was working on tier 2 while tier 0 just having no saftey measures
https://preview.redd.it/bp8go7ntg7af1.png?width=649&format=png&auto=webp&s=b6f2c14d8d75d6fe83bd2b46d608a678c4348e49
https://preview.redd.it/y66j3antg7af1.png?width=584&format=png&auto=webp&s=a25ffd693bd7b63576fa891b6d70c5bc624b4b0b
submitted by