Yong Zheng-Xin
Current: Computer Science Ph.D. @ Brown University
Past: Research Scientist Intern @ Meta AI, Research Collaborator @ Cohere For AI

I am a fourth-year Ph.D. student in Computer Science at Brown University, advised by Prof. Stephen Bach. I am fortunate to have worked with researchers at Meta GenAI Safety Alignment (with Jianfeng Chi), Meta AI FAIR (with Jean Maillard and Michael Auli), and Cohere Labs (with Julia Kreutzer, Beyza Ermis, Marzieh Fadaee, and Sara Hooker).
These days, my current focus is on scaling reasoning compute. My most recent work shows that test-time scaling of English models can unlock crosslingual reasoning through “quote-and-think” pattern (preprint). More work to come soon :)
I also work on safety alignment. I was the first to discover that low-resource languages can jailbreak GPT-4 (⭑Best Paper Award, NeurIPS 2023 Socially Responsible Language Modeling Workshop), This work pioneered multilingual red-teaming and was highlighted in the first International Scientific Report on the Safety of Advanced AI (2024). It was also featured on New Scientist, Montreal AI Ethics Institute, ZDNET, etc. Besides, I have worked on interpretability study to explain crosslingual transfer of alignment training (EMNLP 2024 Findings) and fine-tuning attacks (NAACL 2025 Findings). I also have experience red-teaming frontier LLMs such as Aya model (Cohere Labs).
I’ve dedicated considerable amount of time working on multilingual NLP to create LLMs that are helpful and safe for everyone in the world. I am a major contributor to frontier open-sourced multilingual LLMs such as Aya models (⭑Best Paper Award, ACL 2024 – co-first author) and mT0/BLOOMZ (ACL 2023). I have also worked on low-resource NLP, such as language adaptation (ACL 2023) and synthetic data generation (EMNLP 2024 Findings). I have also worked with speech technology, particularly on understanding accent bias in ASR (to appear). Besides, as a Malaysian, I have contributed to NLP for Southeast Asian (SEA) languages. I’ve co-hosted *ACL tutorial (2023), helped curate SEACrowd data hub (EMNLP 2024), and studied how well LLMs can handle SEA linguistic phenomenon, such as code-switching (EMNLP 2023 CALCS Workshop), and understand culture in SEA region (NeurIPS 2024).
Other Misc Stuff:
- I went to Minerva University for my undergrad so I had the opportunity to travel and live in six different cities (for at least 4 months in each city) around the world: 🇺🇸 San Francisco, 🇰🇷 Seoul, 🇮🇳 Hyderabad, 🇩🇪 Berlin, 🇦🇷 Buenos Aires and 🇬🇧 London.
- My passion hobby is dancing 🕺, especially salsa and bachata. I also dance a bit of Lindy Hop, Argentine Tango and K-pop.
I usually check out the dance scenes in the city when I travel to conferences ––– if you also enjoy dancing, hmu we can check them out together.
selected publications (see all)
-
- NAACL Findings, 2025
- EMNLP Findings, 2024
- ACL, 2024 (Best Paper Award)
- NeurIPS Workshop: Socially Responsible Language Modelling Research (SoLaR) , 2023 (Best Paper Award)
news
02 / 2025 | 1 paper accepted! Work on cross-lingual finetuning attacks is accepted to NAACL’25 findings. |
---|---|
09 / 2024 | 4 papers accepted! LexC-Gen and explanations of cross-lingual LLM toxicity reduction are accepted to Findings of EMNLP 2024. SEACrowd is also accepted to EMNLP 2024. CVQA is accepted to NeurIPS 2024 Datasets & Benchmarks. |
08 / 2024 | Aya Model paper received the ⭑Best Paper Award at ACL 2024. |
07 / 2024 | Gave a talk about multilingual AI safety at London Data Week (organized by The Alan Turing Institute and supported by Mayor of London). |
06 / 2024 | Meta AI: Started my research scientist internship at Meta AI (FAIR), working on Massively Multilingual Speech (MMS) models. Also collaborated with GenAI Trust Team on a multilingual safety project. |
05 / 2024 | 1 paper accepted! A Safe Harbor for AI Evaluation and Red Teaming is accepted to ICML 2024, accompanied with an open letter signed by 300+ researchers urging for legal and technical protections for AI red-teaming by independent researchers. |
02 / 2024 | Aya model and dataset papers are released! I presented Aya multilingual safety research at Aya Grand Finale. |
11 / 2023 | Co-organized the tutorial of Current Status of NLP in South East Asia at AACL 2023. |
10 / 2023 | “Low-Resource Languages Jailbreak GPT-4” received the ⭑Best Paper Award at NeurIPS 2023 Socially Responsible Language Modeling (SoLaR) workshop. |
09 / 2023 | Cohere For AI: Joining the Responsible Deployment Team for Aya red-teaming. |
05 / 2023 | Interviewed by Wired on our code-switching paper and grassroot research initiative for Southeast Asian (SEA) languages. |
05 / 2023 | 3 papers accepted! BLOOM+1, BLOOMZ and code-switching survey are accepted to ACL 2023. |
03 / 2022 | 2 papers accepted! T0 is accepted to ICLR 2022 (Spotlight) and its blog post is out! PromptSource is also accepted to ACL 2022 Demo track. |
06 / 2021 | Started PhD at Brown University. |