WalledEval: A Comprehensive Safety Evaluation Toolkit for Large Language Models
BibTex
Copy
@Article{Gupta2024WalledEvalAC,
author = {Prannaya Gupta and Le Qi Yau and Hao Han Low and I-Shiang Lee and Hugo Maximus Lim and Yu Xin Teoh and Jia Hng Koh and Dar Win Liew and Rishabh Bhardwaj and Rajat Bhardwaj and Soujanya Poria},
booktitle = {Conference on Empirical Methods in Natural Language Processing},
journal = {ArXiv},
title = {WalledEval: A Comprehensive Safety Evaluation Toolkit for Large Language Models},
volume = {abs/2408.03837},
year = {2024}
}
Transform this paper into an audio lecture
Get an engaging lecture and Q&A format to quickly understand the paper in minutes, perfect for learning on the go.