I am now working on robust and controllable end-to-end speech recognition and multimodal large language models. I am a Ph.D. candidate in Software Engineering at Soochow University (苏州大学), where I am advised by Prof. Zhenghua Li.
My research interest has changed over time: starting from semantic role labeling and syntax-based language understanding, then shifting to context-aware automatic speech recognition, and more recently focusing on multimodal large language models. I have published papers at top international AI conferences such as ACL, AAAI, and COLING, and was honored with the Best Long Paper Award at COLING 2022.
🔥 News
- 2025.07: 🎉🎉 Improving Contextual ASR via Multi-grained Fusion with Large Language Models has been opened at arXiv:2507.12252.
- 2025.07: 🎉🎉 Nexus: An Omni-Perceptive And-Interactive Model for Language, Audio, And Vision has been accepted by ACM MM 2025.
- 2025.04: 🎉🎉 Recording for eyes, not echoing to ears: Contextualized spoken-to-written conversion of ASR transcripts has been accepted by AAAI 2025.
- 2024.08: 🎉🎉 CopyNE: Better Contextual ASR by Copying Named Entities and Chinese Spoken Named Entity Recognition in Real-world Scenarios: Dataset and Approaches have been accepted by ACL 2024.
- 2022.10: 🎉🎉 Fast and Accurate End-to-End Span-based Semantic Role Labeling as Word-based Graph Parsing has been awarded the Best Long Paper Award at COLING 2022.
📝 Publications
-
Improving Contextual ASR via Multi-grained Fusion with Large Language Models. arXiv:2507.12252. Shilin Zhou, Zhenghua Li
-
An Omni-Perceptive And-Interactive Model for Language, Audio, And Vision. ACM MM 2025. Che Liu, Yingji Zhang, Dong Zhang, Weijie Zhang, Chenggong Gong, Haohan Li, Yu Lu, Shilin Zhou, Yue Lu, Ziliang Gan, Ziao Wang, Junwei Liao, Haipang Wu, Ji Liu, André Freitas, Qifan Wang, Zenglin Xu, Rongjuncheng Zhang, Yong Dai
-
Recording for eyes, not echoing to ears: Contextualized spoken-to-written conversion of ASR transcripts. AAAI 2025. Jiaqing Liu, Chong Deng, Qinglin Zhang, Shilin Zhou, Qian Chen, Hai Yu, Wen Wang
-
CopyNE: Better Contextual ASR by Copying Named Entities. ACL 2024. Shilin Zhou, Zhenghua Li, Yu Hong, Min Zhang, Zhefeng Wang, Baoxing Huai
-
Chinese Spoken Named Entity Recognition in Real-world Scenarios: Dataset and Approaches. ACL 2024. Shilin Zhou, Zhenghua Li, Chen Gong, Lei Zhang, Yu Hong, Min Zhang
-
Fast and Accurate End-to-End Span-based Semantic Role Labeling as Word-based Graph Parsing. COLING 2022 (Best Long Paper Award). Shilin Zhou, Qingrong Xia, Zhenghua Li, Yu Zhang, Yu Hong, Min Zhang
-
Semantic role labeling as dependency parsing: Exploring latent tree structures inside arguments. COLING 2022. Yu Zhang, Qingrong Xia, Shilin Zhou, Yong Jiang, Guohong Fu, Min Zhang
🎖 Honors and Awards
- 2022.10: Best Long Paper Award at COLING 2022
📖 Educations
- 2016.09 - 2020.06: Undergraduate, Soochow University, China.
- 2020.09 - now: Ph.D., Soochow University, China.
💻 Internships
- 2024.08 - 2025.01: Intern, Shanghai AI Lab, China.