I am currently a postdoctoral research fellow with Speech Lab, College of Computing and Data Science, at Nanyang Technological University, supervised by Prof. Eng Siong Chng.
Prior to joining NTU Speech Lab, I received my Ph.D. from School of Electronic and Electrical Engineering at Nanyang Technological University, under the supervision of Prof. Andy W. H. Khong (EEE, Nanyang Technological University) and Dr. Leibny Paola Garcia (CLSP, The Johns Hopkins Univerisity). Apart from my supervisors, I also work closely with Prof. Suzy J. Styles (NTU), Prof. Sanjeev Khudanpur (JHU), and Prof. Shinji Watanabe (CMU).

My research topics are:

Spoken Language Recognition: Language identification and diarization
Speech Recognition: Code-switching and multilingual speech recognition
Large-scale Pre-trained Model: Speech/Audio-LLM and other LLM-based applications

📖 Educations

2018.08 - 2023.05, Ph.D., Nanyang Technological University.
Thesis: Enhancing Spoken Language Identification and Diarization for Multilingual Speech
Supervisors: Prof. Andy W. H. Khong (NTU) & Dr. Leibny Paola Garcia (JHU)
2017.08 - 2018.05, Ms.C., Nanyang Technological University.
2012.09 - 2016.07, B.Eng, Harbin Institute of Technology.
Supervisor: Prof. Chenguang He

📝 Publications

Preprint Code-switching Speech Recognition Under the Lens: Model- and Data-Centric Perspectives, Hexin Liu, Haoyang Zhang, Qiquan Zhang, Xiangyu Zhang, Dongyuan Shi, Eng Siong Chng, Haizhou Li.
IEEE TASLP 2025 A Two-Stage LoRA Strategy for Expanding Language Capabilities in Multilingual ASR Models, Chin Yuen Kwok, Hexin Liu^†, Jia Qi Yip, Sheng Li, Eng Siong Chng.
IEEE TASLP 2025 Mamba in Speech: Towards an Alternative to Self-Attention, Xiangyu Zhang*, Qiquan Zhang*, Hexin Liu^†*, Tianyi Xiao, Xinyuan Qian, Beena Ahmed, Eliathamby Ambikairajah, Haizhou Li, Julien Epps.
IEEE TASLP 2025 Aligning Speech to Languages to Enhance Code-switching Speech Recognition, Hexin Liu, Xiangyu Zhang, Haoyang Zhang, Leibny Paola Garcia, Andy W.H. Khong, Eng Siong Chng, Shinji Watanabe.
IEEE JSTSP 2022 Efficient self-supervised learning representations for spoken language identification, Hexin Liu, Leibny Paola Garcia, Andy W.H. Khong, Eng Siong Chng, Suzy J. Styles, Sanjeev Khudanpur.
AAAI 2026 Hearing More with Less: Multi-Modal Retrieval-and-Selection Augmented Conversational LLM-Based ASR, Bingshen Mu, Hexin Liu^†, Hongfei Xue, Kun Wei, Lei Xie.
ACL 2025 Findings SpeechT-RAG: Reliable Depression Detection in LLMs with Retrieval-Augmented Generation Using Speech Timing Information, Xiangyu Zhang, Hexin Liu^†, Qiquan Zhang, Beena Ahmed, Julien Epps.
ICASSP 2024 Enhancing Code-switching Speech Recognition with Interactive Language Biases, Hexin Liu, Leibny Paola Garcia, Xiangyu Zhang, Andy W.H. Khong, Sanjeev Khudanpur.
ICASSP 2023 Reducing Language Confusion for Code-switching Speech Recognition with Token-level Language Diarization, Hexin Liu, Haihua Xu, Leibny Paola Garcia, Andy W.H. Khong, Yi He, Sanjeev Khudanpur.
Interspeech 2023 MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization, Victoria YH Chua*, Hexin Liu*, Leibny Paola Garcia, Fei Ting Woon, Jinyi Wong, Xiangyu Zhang, Sanjeev Khudanpur, Andy W.H. Khong, Justin Dauwels, Suzy J Styles.
Odyssey 2022 Enhancing Language Identification using Dual-mode Model with Knowledge Distillation, Hexin Liu, Leibny Paola Garcia, Andy W.H. Khong, Justin Dauwels, Suzy J. Styles, Sanjeev Khudanpur.
Interspeech 2022(Oral) PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification, Hexin Liu, Leibny Paola Garcia, Andy W.H. Khong, Suzy J. Styles, Sanjeev Khudanpur.
Interspeech 2021 End-to-end language diarization for bilingual code-switching speech, Hexin Liu, Leibny Paola Garcia, Xinyi Zhang, Justin Dauwels, Andy W.H. Khong, Sanjeev Khudanpur, Suzy J. Styles.

🧑‍🔬 Services

Reviewer: ICASSP(23-24), Interspeech(2022-24), ASRU(2022-24), SLT(2022-24), COLING(2024-25), IEEE TASLP, SPL, ESWA
Session Chair: Interspeech (2023), APSIPA-TPC (2025-27), IALP (2024)
Organizer: Interspeech23@MERLIon, Interspeech25@MLC-SLM, ICASSP25@SPADE, ICASSP26@ASAE
Others: Mentor in IEEE SLT 2022 Hackthon

💻 Work Experiences

2016.09 - 2017.02, Research Assistant, Harbin Institute of Technology, supervisor: Prof. Chenguang He.
2022.08 - 2023.01, Research Scientist Intern, Bytedance AI Lab, Singapore.
2023.04 - 2023.08, Research Associate, Delta-NTU Corp Lab, Nanyang Technological University, supervisor: Prof. Andy W.H. Khong.
2023.09 - now, Research Fellow, Speech Lab, Nanyang Technological University, supervisor: Prof. Eng Siong Chng.

💬 Invited talks and awards

2024.06，IEEE & APSIPA SG chapters joint Seminar: Emerging Trends and Innovations in Machine Learning and AI
Automatic speech recognition with large language models [slides]
APSIPA 2025, Best student paper award: “Explainable Disentanglement on Discrete Speech Representations for Noise-Robust ASR”

Hexin Liu

📖 Educations

📝 Publications

🧑‍🔬 Services

💻 Work Experiences

💬 Invited talks and awards