Hi there! I am Xinlong Chen (陈鑫龙), a Ph.D. student at NLPR, CASIA, where I am fortunate to be advised by Prof. Tieniu Tan and co-advised by Prof. Qiang Liu. My research focuses on the training and application of MLLMs, with particular interests in video understanding and hallucination mitigation.
Currently, I am a research intern at Kling Team, Kuaishou Technology, under the guidance of Yuanxing Zhang and Weihong Lin.
I am always open to research discussions and collaboration opportunities——feel free to reach out! 😁
📝 Selected Publications (Full List)
Video Understanding
-
[ECCV 2026] | DiaDem: Advancing Dialogue Descriptions in Audiovisual Video Captioning for Multimodal Large Language Models
-
[ICLR 2026] | AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration
-
[ICLR 2026] | VidBridge-R1: Bridging QA and Captioning for RL-based Video Understanding Models with Intermediate Proxy Tasks
-
[Findings of ACL 2025] | VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video Generation
Hallucination Mitigation
-
[EMNLP 2025] | Attention-guided Self-reflection for Zero-shot Hallucination Detection in Large Language Models
-
[Findings of ACL 2025] | Mixture of Decoding: An Attention-Inspired Adaptive Decoding Strategy to Mitigate Hallucinations in Large Vision-Language Models
📖 Education
- 2025.09 - 2030.06 (expected), Ph.D. Student in AI, New Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences
- Supervisor: Prof. Tieniu Tan and Prof. Qiang Liu
- 2021.09 - 2025.06, B.Eng. in AI, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology
- Rank: 1 / 100 | Average Grade: 93.67 / 100
- National Scholarship (2022, 2023, 2024)
- Finalist in the Mathematical Contest in Modeling (Top 1%), serving as Team Leader
💻 Internships
- 2024.11 - Present, Kling Team, Kuaishou Technology
- Mentor: Yuanxing Zhang and Weihong Lin
- Focus: Multimodal understanding
