🧑‍🎨 About Me

I am a final-year PhD student in Computer Science at the Beijing Jiaotong University ADaM Lab, where I am advised by Prof. Jitao Sang </a>. Prior to starting my PhD, I obtained a Master’s degree with a major in Computer Technology from Beijing Jiaotong University and a Bachelor’s degree with a major in Software Engineering from Xiangtan University.

My research interest includes multimodal large language model and reliable machine learning.

📝 Publications

sym

Look before Transcription: End-to-End SlideASR with Visually-Anchored Policy Optimization

Rui Hu, Delai Qiu, Yining Wang, Shengping Liu, Jitao Sang

ArXiv Preprint, 2025.

[Preprint] [Paper] [Code]

sym

Investigating and Enhancing Vision-Audio Capability in Omnimodal Large Language Models

Rui Hu, Delai Qiu, Shuyu Wei, Jiaming Zhang, Yining Wang, Shengping Liu, Jitao Sang

Annual Meeting of the Association for Computational Linguistics (ACL), Findings, 2025.

[Conference] [Paper]

sym

ODE: Open-Set Evaluation of Hallucinations in Multimodal Large Language Models

Yahan Tu, Rui Hu, Jitao Sang

IEEE Conference on Computer Vision and Pattern Recognition Conference (CVPR), 2025.

[Conference] [Paper]

sym

Prescribing the Right Remedy: Mitigating Hallucinations in Large Vision-Language Models via Targeted Instruction Tuning

Rui Hu, Yahan Tu, Shuyu Wei, Dongyuan Lu, Jitao Sang

Information Sciences (INS), 2025.

[Journal] [Paper]

sym

Echoes: Unsupervised Debiasing via Pseudo-bias Labeling in an Echo Chamber

Rui Hu, Yahan Tu, Jitao Sang

ACM International Conference on Multimedia (MM), Oral, 2023.

[Conference] [Paper] [Code]