🧑‍🎨 About Me

I am a final-year PhD student in Computer Science at the Beijing Jiaotong University ADaM Lab, where I am advised by Prof. Jitao Sang. Prior to starting my PhD, I obtained a Master’s degree with a major in Computer Technology from Beijing Jiaotong University and a Bachelor’s degree with a major in Software Engineering from Xiangtan University.

My research interest includes multimodal large language model and reliable machine learning.

📆 News

2026/04 Two papers accepted to ACL 2026!

2025/05 One paper accepted to ACL 2025!

2025/05 One paper accepted to Information Sciences!

2025/01 One paper accepted to CVPR 2025!

2023/07 One Paper accepted to ACM MM 2023 Oral!

📝 Publications

sym

VAPO: End-to-end Slide-Enhanced Speech Recognition with Omni-modal Large Language Models

Rui Hu, Delai Qiu, Yining Wang, Shengping Liu, Jitao Sang*

Annual Meeting of the Association for Computational Linguistics (ACL), Main Conference, 2026.

[Conference] [Paper] [Code]

sym

Investigating and Enhancing Vision-Audio Capability in Omnimodal Large Language Models

Rui Hu, Delai Qiu, Shuyu Wei, Jiaming Zhang, Yining Wang, Shengping Liu*, Jitao Sang*

Annual Meeting of the Association for Computational Linguistics (ACL), Findings, 2025.

[Conference] [Paper]

sym

ODE: Open-Set Evaluation of Hallucinations in Multimodal Large Language Models

Yahan Tu, Rui Hu, Jitao Sang*

IEEE Conference on Computer Vision and Pattern Recognition Conference (CVPR), 2025.

[Conference] [Paper]

sym

Prescribing the Right Remedy: Mitigating Hallucinations in Large Vision-Language Models via Targeted Instruction Tuning

Rui Hu, Yahan Tu, Shuyu Wei, Dongyuan Lu*, Jitao Sang

Information Sciences (INS), 2025.

[Journal] [Paper]

sym

Echoes: Unsupervised Debiasing via Pseudo-bias Labeling in an Echo Chamber

Rui Hu, Yahan Tu, Jitao Sang*

ACM International Conference on Multimedia (MM), 2023. Oral

[Conference] [Paper] [Code]