Towards Safer Generative Language Models: A Survey on Safety Risks, Evaluations, and Improvements
A summary of “Towards Safer Generative Language Models: A Survey on Safety Risks, Evaluations, and Improvements” by Jiawen Deng, Jiale Cheng, Hao Sun, Zhexin Zhang and Minlie Huang.
September 18, 2024