Yong Zheng-Xin
I am a final-year PhD student at Brown University advised by Stephen Bach. I am fortunate to be supported by the Open Philanthropy (now Coefficient Giving) grant for technical AI safety.
I am currently an Astra Safety Research Fellow with OpenAI, mentored by Miles Wang and Olivia Watkins.
Research
I currently work on chain-of-thought monitorability as well as agentic safety evaluation.
My other relevant AI safety research includes safety for reasoning models (ICLR 2026) and for multilingual models (Best Paper, NeurIPS 2023 SoLaR; Best Paper, ACL 2024; NAACL 2025; EMNLP 2025).
Personal site inspired by Tianyu Gao and Gregory Gundersen.