


default search action
Hexin Liu
Person information
SPARQL queries 
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
[c23]Bingshen Mu, Hexin Liu, Hongfei Xue, Kun Wei, Lei Xie:
Hearing More with Less: Multi-Modal Retrieval-and-Selection Augmented Conversational LLM-Based ASR. AAAI 2026: 32519-32527
[i31]Yue Heng Yeo, Yuchen Hu, Shreyas Gopal, Yizhou Peng, Hexin Liu, Eng Siong Chng:
Improving Code-Switching Speech Recognition with TTS Data Augmentation. CoRR abs/2601.00935 (2026)
[i30]Zhixian Zhao, Shuiyuan Wang, Guojian Li, Hongfei Xue, Chengyou Wang, Shuai Wang, Longshuai Xiao, Zihan Zhang, Hui Bu, Xin Xu, Xinsheng Wang, Hexin Liu, Eng Siong Chng, Hung-yi Lee, Haizhou Li, Lei Xie:
The ICASSP 2026 HumDial Challenge: Benchmarking Human-like Spoken Dialogue Systems in the LLM Era. CoRR abs/2601.05564 (2026)
[i29]Guobin Ma, Yuxuan Xia, Jixun Yao, Huixin Xue, Hexin Liu, Shuai Wang, Hao Liu, Lei Xie:
The ICASSP 2026 Automatic Song Aesthetics Evaluation Challenge. CoRR abs/2601.07237 (2026)
[i28]Bingshen Mu, Xian Shi, Xiong Wang, Hexin Liu, Jin Xu, Lei Xie:
LLM-ForcedAligner: A Non-Autoregressive and Accurate LLM-Based Forced Aligner for Multilingual and Long-Form Speech. CoRR abs/2601.18220 (2026)
[i27]Shreyas Gopal, Donghang Wu, Ashutosh Anshul, Yue Heng Yeo, Yizhou Peng, Haoyang Li, Hexin Liu, Eng Siong Chng:
Language-Aware Distillation for Multilingual Instruction-Following Speech LLMs with ASR-Only Supervision. CoRR abs/2603.07025 (2026)
[i26]Donghang Wu, Tianyu Zhang, Yuxin Li, Hexin Liu, Chen Chen, Eng Siong Chng, Yoshua Bengio:
The Silent Thought: Modeling Internal Cognition in Full-Duplex Spoken Dialogue Models via Latent Reasoning. CoRR abs/2603.17837 (2026)- 2025
[j4]Xinyuan Qian
, Jiaran Gao, Yaodan Zhang, Qiquan Zhang
, Hexin Liu
, Leibny Paola García-Perera
, Haizhou Li
:
SAV-SE: Scene-Aware Audio-Visual Speech Enhancement With Selective State Space Model. IEEE J. Sel. Top. Signal Process. 19(4): 623-634 (2025)
[j3]Moran Chen
, Qiquan Zhang
, Mingjiang Wang
, Xiangyu Zhang
, Hexin Liu, Eliathamby Ambikairajah
, Deying Chen:
Selective State Space Model for Monaural Speech Enhancement. IEEE Trans. Consumer Electron. 71(2): 5414-5424 (2025)
[c22]Xiangyu Zhang, Hexin Liu, Qiquan Zhang, Beena Ahmed, Julien Epps:
SpeechT-RAG: Reliable Depression Detection in LLMs with Retrieval-Augmented Generation Using Speech Timing Information. ACL (Findings) 2025: 10019-10030
[c21]Yue Heng Yeo, Yuchen Hu, Shreyas Gopal, Yizhou Peng, Hexin Liu, Eng Siong Chng:
Improving Code-Switching Speech Recognition with TTS Data Augmentation. APSIPA 2025: 992-997
[c20]Shreyas Gopal, Ashutosh Anshul, Haoyang Li, Yue Heng Yeo, Hexin Liu, Eng Siong Chng:
Explainable Disentanglement on Discrete Speech Representations for Noise-Robust ASR. APSIPA 2025: 2535-2540
[c19]Jiahui Zhao, Hao Shi, Chenrui Cui, Tianrui Wang, Hexin Liu, Zhaoheng Ni, Lingxuan Ye, Longbiao Wang:
Adapting Whisper for Code-Switching through Encoding Refining and Language-Aware Decoding. ICASSP 2025: 1-5
[c18]Jixun Yao, Hexin Liu, Chen Chen, Yuchen Hu, Eng Siong Chng, Lei Xie:
GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling. ICLR 2025
[c17]Jixun Yao, Hexin Liu, Eng Siong Chng, Lei Xie:
EASY: Emotion-aware Speaker Anonymization via Factorized Distillation. INTERSPEECH 2025
[c16]Hongfei Xue
, Yufeng Tang
, Hexin Liu
, Jun Zhang
, Xuelong Geng
, Lei Xie
:
Enhancing Non-Core Language Instruction-Following in Speech LLMs via Semi-Implicit Cross-Lingual CoT Reasoning. ACM Multimedia 2025: 10984-10993
[c15]Qiquan Zhang, Moran Chen, Zeyang Song, Hexin Liu, Xiangyu Zhang, Haizhou Li:
Long-Context Modeling Networks for Monaural Speech Enhancement: A Comparative Study. WASPAA 2025: 1-5
[i25]Jixun Yao, Hexin Liu, Chen Chen, Yuchen Hu, Chng Eng Siong, Lei Xie:
GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling. CoRR abs/2502.02942 (2025)
[i24]Hongfei Xue, Yufeng Tang, Hexin Liu, Jun Zhang, Xuelong Geng, Lei Xie:
Enhancing Non-Core Language Instruction-Following in Speech LLMs via Semi-Implicit Cross-Lingual CoT Reasoning. CoRR abs/2504.20835 (2025)
[i23]Jixun Yao, Hexin Liu, Eng Siong Chng, Lei Xie:
EASY: Emotion-aware Speaker Anonymization via Factorized Distillation. CoRR abs/2505.15004 (2025)
[i22]Haoyang Zhang, Hexin Liu, Xiangyu Zhang, Qiquan Zhang, Yuchen Hu, Junqi Zhao, Fei Tian, Xuerui Yang, Eng Siong Chng:
Impact of Frame Rates on Speech Tokenizer: A Case Study on Mandarin and English. CoRR abs/2505.17076 (2025)
[i21]Yizhou Peng, Bin Wang, Yi-Wen Chao, Ziyang Ma, Haoyang Zhang, Hexin Liu, Xie Chen, Eng Siong Chng:
NTU Speechlab LLM-Based Multilingual ASR System for Interspeech MLC-SLM Challenge 2025. CoRR abs/2506.13339 (2025)
[i20]Yizhou Peng, Hexin Liu, Eng Siong Chng:
Bi-directional Context-Enhanced Speech Large Language Models for Multilingual Conversational ASR. CoRR abs/2506.13396 (2025)
[i19]Bingshen Mu, Hexin Liu, Hongfei Xue, Kun Wei, Lei Xie:
Hearing More with Less: Multi-Modal Retrieval-and-Selection Augmented Conversational LLM-Based ASR. CoRR abs/2508.01166 (2025)
[i18]Bingshen Mu, Pengcheng Guo, Zhaokai Sun, Shuai Wang, Hexin Liu, Mingchen Shao, Lei Xie, Eng Siong Chng, Longshuai Xiao, Qiangze Feng, Daliang Wang:
Summary on The Multilingual Conversational Speech Language Model Challenge: Datasets, Tasks, Baselines, and Methods. CoRR abs/2509.13785 (2025)
[i17]Donghang Wu, Haoyang Zhang, Chen Chen, Tianyu Zhang, Fei Tian, Xuerui Yang, Gang Yu, Hexin Liu, Nana Hou, Yuchen Hu, Eng Siong Chng:
Chronological Thinking in Full-Duplex Spoken Dialogue Language Models. CoRR abs/2510.05150 (2025)
[i16]Donghang Wu, Haoyang Zhang, Jun Chen, Xiangyu Tony Zhang, Hexin Liu, Eng Siong Chng, Fei Tian, Xuerui Yang, Xiangyu Zhang, Daxin Jiang, Gang Yu:
Mind-Paced Speaking: A Dual-Brain Approach to Real-Time Reasoning in Spoken Language Models. CoRR abs/2510.09592 (2025)
[i15]Shreyas Gopal
, Ashutosh Anshul, Haoyang Li, Yue Heng Yeo, Hexin Liu, Eng Siong Chng:
Explainable Disentanglement on Discrete Speech Representations for Noise-Robust ASR. CoRR abs/2510.25150 (2025)
[i14]Fei Tian, Xiangyu Tony Zhang, Yuxin Zhang, Haoyang Zhang, Yuxin Li, Daijiao Liu, Yayue Deng, Donghang Wu, Jun Chen, Liang Zhao, Chengyuan Yao, Hexin Liu, Eng Siong Chng, Xuerui Yang, Xiangyu Zhang, Daxin Jiang, Gang Yu:
Step-Audio-R1 Technical Report. CoRR abs/2511.15848 (2025)- 2024
[c14]Xiangyu Zhang
, Hexin Liu, Kaishuai Xu, Qiquan Zhang, Daijiao Liu, Beena Ahmed, Julien Epps:
When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection. EMNLP 2024: 146-158
[c13]Xiangyu Zhang
, Daijiao Liu, Hexin Liu, Qiquan Zhang, Hanyu Meng
, Leibny Paola García-Perera, Engsiong Chng, Lina Yao:
Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model. EMNLP 2024: 159-171
[c12]Ruixing Liang, Xiangyu Zhang
, Qiong Li, Lai Wei, Hexin Liu, Avisha Kumar, Kelley M. Kempski Leadingham, Joshua Punnoose, Leibny Paola García, Amir Manbachi:
Unidirectional Brain-Computer Interface: Artificial Neural Network Encoding Natural Images to FMRI Response in the Visual Cortex. ICASSP 2024: 1851-1855
[c11]Hexin Liu
, Leibny Paola García, Xiangyu Zhang
, Andy W. H. Khong, Sanjeev Khudanpur:
Enhancing Code-Switching Speech Recognition With Interactive Language Biases. ICASSP 2024: 10886-10890
[c10]Yujia Wang, Hexin Liu, Leibny Paola García:
Bridging Child-Centered Speech Language Identification and Language Diarization via Phonetics. INTERSPEECH 2024
[i13]Xiangyu Zhang, Daijiao Liu, Hexin Liu, Qiquan Zhang, Hanyu Meng, Leibny Paola García, Eng Siong Chng, Lina Yao:
Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model. CoRR abs/2402.10642 (2024)
[i12]Xiangyu Zhang, Hexin Liu, Kaishuai Xu, Qiquan Zhang, Daijiao Liu, Beena Ahmed, Julien Epps:
When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection. CoRR abs/2402.13276 (2024)
[i11]Xiangyu Zhang, Qiquan Zhang, Hexin Liu, Tianyi Xiao, Xinyuan Qian, Beena Ahmed, Eliathamby Ambikairajah, Haizhou Li, Julien Epps:
Mamba in Speech: Towards an Alternative to Self-Attention. CoRR abs/2405.12609 (2024)
[i10]Xinyuan Qian, Jiaran Gao, Yaodan Zhang, Qiquan Zhang, Hexin Liu, Leibny Paola García, Haizhou Li:
SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model. CoRR abs/2411.07751 (2024)
[i9]Jiahui Zhao, Hao Shi, Chenrui Cui, Tianrui Wang, Hexin Liu, Zhaoheng Ni, Lingxuan Ye, Longbiao Wang:
Adapting Whisper for Code-Switching through Encoding Refining and Language-Aware Decoding. CoRR abs/2412.16507 (2024)- 2023
[c9]Shuyue Stella Li, Xiangyu Zhang
, Shu Zhou, Hongchao Shu, Ruixing Liang, Hexin Liu, Leibny Paola García:
PQLM - Multilingual Decentralized Portable Quantum Language Model. ICASSP 2023: 1-5
[c8]Hexin Liu
, Haihua Xu, Leibny Paola García, Andy W. H. Khong, Yi He, Sanjeev Khudanpur:
Reducing Language Confusion for Code-Switching Speech Recognition with Token-Level Language Diarization. ICASSP 2023: 1-5
[c7]Shuyue Stella Li, Beining Xu, Xiangyu Zhang, Hexin Liu, Wenhan Chao, Paola García:
A Quantitative Approach to Understand Self-Supervised Models as Cross-lingual Feature Extracters. ICNLSP 2023: 200-211
[c6]Rui Li, Zhiwei Xie, Haihua Xu, Yizhou Peng
, Hexin Liu, Hao Huang, Eng Siong Chng:
Self-supervised Learning Representation based Accent Recognition with Persistent Accent Memory. INTERSPEECH 2023: 1968-1972
[c5]Yi Han Victoria Chua, Hexin Liu, Leibny Paola García
, Fei Ting Woon, Jinyi Wong, Xiangyu Zhang
, Sanjeev Khudanpur, Andy W. H. Khong, Justin Dauwels, Suzy J. Styles:
MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization. INTERSPEECH 2023: 4109-4113
[c4]Suzy J. Styles, Yi Han Victoria Chua, Fei Ting Woon, Hexin Liu, Leibny Paola García, Sanjeev Khudanpur, Andy W. H. Khong, Justin Dauwels:
Investigating model performance in language identification: beyond simple error statistics. INTERSPEECH 2023: 4129-4133
[i8]Suzy J. Styles, Yi Han Victoria Chua
, Fei Ting Woon, Hexin Liu, Leibny Paola García-Perera, Sanjeev Khudanpur, Andy W. H. Khong, Justin Dauwels:
Investigating model performance in language identification: beyond simple error statistics. CoRR abs/2305.18925 (2023)
[i7]Ruixing Liang, Xiangyu Zhang, Qiong Li, Lai Wei, Hexin Liu, Avisha Kumar, Kelley M. Kempski Leadingham, Joshua Punnoose, Leibny Paola García, Amir Manbachi:
Unidirectional brain-computer interface: Artificial neural network encoding natural images to fMRI response in the visual cortex. CoRR abs/2309.15018 (2023)
[i6]Hexin Liu, Leibny Paola García, Xiangyu Zhang, Andy W. H. Khong, Sanjeev Khudanpur:
Enhancing Code-switching Speech Recognition with Interactive Language Biases. CoRR abs/2309.16953 (2023)
[i5]Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Hexin Liu, Sabato Marco Siniscalchi, Eng Siong Chng:
Generative error correction for code-switching speech recognition using large language models. CoRR abs/2310.13013 (2023)
[i4]Shuyue Stella Li, Beining Xu, Xiangyu Zhang, Hexin Liu, Wenhan Chao, Leibny Paola García:
A Quantitative Approach to Understand Self-Supervised Models as Cross-lingual Feature Extractors. CoRR abs/2311.15954 (2023)- 2022
[j2]Hexin Liu
, Leibny Paola García-Perera
, Andy W. H. Khong
, Eng Siong Chng, Suzy J. Styles, Sanjeev Khudanpur
:
Efficient Self-Supervised Learning Representations for Spoken Language Identification. IEEE J. Sel. Top. Signal Process. 16(6): 1296-1307 (2022)
[c3]Hexin Liu
, Leibny Paola García-Perera, Andy W. H. Khong, Suzy J. Styles, Sanjeev Khudanpur:
PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification. INTERSPEECH 2022: 2233-2237
[c2]Hexin Liu, Leibny Paola García-Perera, Andy W. H. Khong, Justin Dauwels, Suzy J. Styles
, Sanjeev Khudanpur:
Enhancing Language Identification Using Dual-Mode Model with Knowledge Distillation. Odyssey 2022: 248-254
[i3]Hexin Liu, Leibny Paola García-Perera, Andy W. H. Khong, Justin Dauwels, Suzy J. Styles
, Sanjeev Khudanpur:
Enhance Language Identification using Dual-mode Model with Knowledge Distillation. CoRR abs/2203.03218 (2022)
[i2]Shuyue Stella Li, Xiangyu Zhang, Shu Zhou, Hongchao Shu, Ruixing Liang, Hexin Liu, Leibny Paola García-Perera:
PQLM - Multilingual Decentralized Portable Quantum Language Model for Privacy Protection. CoRR abs/2210.03221 (2022)
[i1]Hexin Liu, Haihua Xu, Leibny Paola García, Andy W. H. Khong, Yi He, Sanjeev Khudanpur:
Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization. CoRR abs/2210.14567 (2022)- 2021
[c1]Hexin Liu
, Leibny Paola García-Perera
, Xinyi Zhang, Justin Dauwels, Andy W. H. Khong, Sanjeev Khudanpur, Suzy J. Styles
:
End-to-End Language Diarization for Bilingual Code-Switching Speech. Interspeech 2021: 1489-1493
2010 – 2019
- 2013
[j1]Trevor J. Hall, Ramón Maldonado-Basilio, Sawsan Abdul-Majid, Joe Seregelyi, Ran Li, Irene Antolín-Pérez, Hamdam Nikkhah, Frédéric Lucarz, Jean-Louis de Bougrenet de la Tocnaye, Bruno Fracasso, Patrice Pajusco, Camilla Kärnfelt
, Daniel Bourreau, Michel M. Ney, Rabiaa Guemri, Yves Josse, Hexin Liu:
Radio-over-Fibre access for sustainable Digital Cities. Ann. des Télécommunications 68(1-2): 3-21 (2013)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-04-19 22:08 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







