Selected Publications · Google Scholar

An Independent Safety Evaluation of Kimi K2.5

Zheng-Xin Yong* , Parv Mahajan* , Andy Wang , Ida Caspary , Yernat Yestekov , Zora Che , Mosh Levy , Elle Najt , Dennis Murphy , Prashant Kulkarni , Lev McKinney , Kei Nishimura-Gasparian , Ram Potham , Aengus Lynch , Michael L. Chen

Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model

Ahmet Ustun* , Viraat Aryabumi* , Zheng-Xin Yong* , Wei-Yin Ko* , Daniel D'souza* , Gbemileke Onilude , Neel Bhandari , Shivalika Singh , Hui-Lee Ooi , Amr Kayid , Freddie Vargus , Phil Blunsom , Shayne Longpre , Niklas Muennighoff , Marzieh Fadaee , Julia Kreutzer , Sara Hooker

Other Publications

Humanity's Last Exam

Long Phan , Alice Gatti , Ziwen Han , Nathaniel Li , Josephina Hu , Hugh Zhang , Chen Bo Calvin Zhang , Mohamed Shaaban , John Ling , Sean Shi , Michael Choi , Anish Agrawal , Arnav Chopra , Adam Khoja , Ryan Kim , Richard Ren , Jason Hausenloy , et al. (including Zheng-Xin Yong)

Omnilingual ASR: Open-Source Multilingual Speech Recognition for 1600+ Languages

Omnilingual ASR team , Gil Keren , Artyom Kozhevnikov , Yen Meng , Christophe Ropers , Matthew Setzler , Skyler Wang , Ife Adebara , Michael Auli , Can Balioglu , Kevin Chan , Chierh Cheng , Joe Chuang , Caley Droof , Mark Duppenthaler , Paul-Ambroise Duquenne , Alexander Erben , et al. (including Zheng-Xin Yong)

Crosslingual Reasoning through Test-Time Scaling

Zheng-Xin Yong , M Farid Adilazuarda , Jonibek Mansurov , Ruochen Zhang , Niklas Muennighoff , Carsten Eickhoff , Genta Indra Winata , Julia Kreutzer , Stephen H Bach , Alham Fikri Aji

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Longxu Dou , Qian Liu , Fan Zhou , Changyu Chen , Zili Wang , Ziqi Jin , Zichen Liu , Tongyao Zhu , Cunxiao Du , Penghui Yang , Haonan Wang , Jiaheng Liu , Yongchi Zhao , Xiachong Feng , Xin Mao , Man Tsung Yeung , Kunat Pipatanakul , et al. (including Zheng-Xin Yong)

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

Holy Lovenia , Rahmad Mahendra , Salsabil Maulana Akbar , Lester James V Miranda , Jennifer Santoso , Elyanah Aco , Akhdan Fadhilah , Jonibek Mansurov , Joseph Marvin Imperial , Onno P Kampman , Joel Ruben Antony Moniz , Muhammad Ravi Shulthan Habibi , Frederikus Hudi , Railey Montalan , Ryan Ignatius , Joanito Agili Lopo , William Nixon , et al. (including Zheng-Xin Yong)

CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

David Romero , Chenyang Lyu , Haryo Akbarianto Wibowo , Teresa Lynn , Injy Hamed , Aditya Nanda Kishore , Aishik Mandal , Alina Dragonetti , Artem Abzaliev , Atnafu Lambebo Tonja , Bontu Fufa Balcha , Chenxi Whitehouse , Christian Salamea , Dan John Velasco , David Ifeoluwa Adelani , David Le Meur , Emilio Villa-Cueva , et al. (including Zheng-Xin Yong)

A Safe Harbor for AI Evaluation and Red Teaming

Shayne Longpre , Sayash Kapoor , Kevin Klyman , Ashwin Ramaswami , Rishi Bommasani , Borhane Blili-Hamelin , Yangsibo Huang , Aviya Skowron , Zheng-Xin Yong , Suhas Kotha , Yi Zeng , Weiyan Shi , Xianjun Yang , Reid Southen , Alexander Robey , Patrick Chao , Diyi Yang , et al.

What Language Model to Train if You Have One Million GPU Hours?

Teven Le Scao , Thomas Wang , Daniel Hesslow , Lucile Saulnier , Stas Bekman , M Saiful Bari , Stella Biderman , Hady Elsahar , Niklas Muennighoff , Jason Phang , Ofir Press , Colin Raffel , Victor Sanh , Sheng Shen , Lintang Sutawika , Jaesung Tae , Zheng Xin Yong , et al.

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

BigScience Workshop , Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilic , Daniel Hesslow , Roman Castagne , Alexandra Sasha Luccioni , Francois Yvon , Matthias Galle , Jonathan Tow , Alexander M Rush , Stella Biderman , Albert Webson , Pawan Sasanka Ammanamanchi , Thomas Wang , et al. (including Zheng-Xin Yong)

Crosslingual Generalization through Multitask Finetuning

Niklas Muennighoff , Thomas Wang , Lintang Sutawika , Adam Roberts , Stella Biderman , Teven Le Scao , M Saiful Bari , Sheng Shen , Zheng-Xin Yong , Hailey Schoelkopf , Xiangru Tang , Dragomir Radev , Alham Fikri Aji , Khalid Almubarak , Samuel Albanie , Zaid Alyafeai , Albert Webson , et al.

BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting

Zheng-Xin Yong , Hailey Schoelkopf , Niklas Muennighoff , Alham Fikri Aji , David Ifeoluwa Adelani , Khalid Almubarak , M Saiful Bari , Lintang Sutawika , Jungo Kasai , Ahmed Baruwa , Genta Indra Winata , Stella Biderman , Edward Raff , Dragomir Radev , Vassilina Nikoulina

Frame Shift Prediction

Zheng-Xin Yong , Patrick D Watson , Tiago Timponi Torrent , Oliver Czulo , Collin F Baker

Multitask Prompted Training Enables Zero-Shot Task Generalization

Victor Sanh , Albert Webson , Colin Raffel , Stephen H Bach , Lintang Sutawika , Zaid Alyafeai , Antoine Chaffin , Arnaud Stiegler , Teven Le Scao , Arun Raja , Manan Dey , M Saiful Bari , Canwen Xu , Urmish Thakker , Shanya Sharma Sharma , Eliza Szczechla , Taewoon Kim , et al. (including Zheng Xin Yong)