pretty interesting that pointing it in the direction of its own self awareness b...

		stainablesteel 10 months ago \| parent \| context \| favorite \| on: Alignment faking in large language models pretty interesting that pointing it in the direction of its own self awareness by indicating that it's going to affect it's own training brings about all of these complications