Qilang Ye (叶启朗)
Research AssistantSchool of Computing and Information Technology, Great Bay University Email: rikeilong[AT]gmail.com [Google Scholar] [GitHub] |
![]() |
Previously, I was a research assistant at YUV group, Great Bay University, supervised by Prof. Zitong Yu. My research interests include Multimodal Learning, and Multimodal Large Language Models. Recently, I am working on alignment learning to optimize LLMs outputs, e.g. audio-visual hallucinations, ambiguity , etc.
![]() |
CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios Qilang Ye, Zitong Yu, Rui Shao, Xinyu Xie, Philip Torr, Xiaochun CaoEuropean Conference on Computer Vision (ECCV), 2024. [paper] [Code] |
| |
![]() |
Pose-promote: Progressive Visual Perception for Activities of Daily Living Qilang Ye, Zitong YuIEEE Signal Processing Letters (IEEE SPL) [paper] [Code] |
| |
![]() |
3sG: Three-stage Guidance for Indoor Human Action Recognition Hai Nan*, Qilang Ye*, Zitong Yu, Kang AnIET Image Processing [paper] [Code] |
| |
![]() |
Answering Diverse Questions via Text Attached with Key Audio-Visual Clues Qilang Ye, Zitong Yu, Xin LiuPreprint [arXiv] [Code] |
| |
![]() |
一种基于人体骨架的任意角度坐姿识别方法 Qilang Ye, Hai Nan, Daixin Li中文核心 [paper] |
|