alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Browser Extension

We're hiring
PaperBlogResources

Eliciting Secret Knowledge from Language Models

BibTex
Copy
@misc{cywińskiFri Oct 31 2025 12:55:04 GMT+0000 (Coordinated Universal Time)elicitingsecretknowledge,
      title={Eliciting Secret Knowledge from Language Models},
      author={Bartosz Cywiński and Emil Ryd and Rowan Wang and Senthooran Rajamanoharan and Neel Nanda and Arthur Conmy and Samuel Marks},
      year={Fri Oct 31 2025 12:55:04 GMT+0000 (Coordinated Universal Time)},
      eprint={2510.01070},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2510.01070},
}
GitHub
eliciting-secret-knowledge
0
HTTPS
https://github.com/cywinski/eliciting-secret-knowledge
SSH
git@github.com:cywinski/eliciting-secret-knowledge.git
CLI
gh repo clone cywinski/eliciting-secret-knowledge
Transform this paper into an audio lecture
Get an engaging lecture and Q&A format to quickly understand the paper in minutes, perfect for learning on the go.
Audio lecture
Q&A format