Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs [pdf]
Article URL: https://martins1612.github.io/emergent_misalignment_betley.pdf Comments URL: https://news.ycombinator.com/item?id=43176553 Points: 20 # Comments: 10
Article URL: https://martins1612.github.io/emergent_misalignment_betley.pdf
Comments URL: https://news.ycombinator.com/item?id=43176553
Points: 20
# Comments: 10