Qilang Ye (叶启朗)  

Research Assistant


School of Computing and Information Technology,

Great Bay University

Email: rikeilong[AT]gmail.com

[Google Scholar] [GitHub]


About Me

Previously, I was a research assistant at YUV group, Great Bay University, supervised by Prof. Zitong Yu. My research interests include Multimodal Learning, and Multimodal Large Language Models. Recently, I am working on alignment learning to optimize LLMs outputs, e.g. audio-visual hallucinations, ambiguity , etc.

Publications (* co-first authors)

CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios

Qilang Ye, Zitong Yu, Rui Shao, Xinyu Xie, Philip Torr, Xiaochun Cao

European Conference on Computer Vision (ECCV), 2024.

[paper] [Code]

Pose-promote: Progressive Visual Perception for Activities of Daily Living

Qilang Ye, Zitong Yu

IEEE Signal Processing Letters (IEEE SPL)

[paper] [Code]

3sG: Three-stage Guidance for Indoor Human Action Recognition

Hai Nan*, Qilang Ye*, Zitong Yu, Kang An

IET Image Processing

[paper] [Code]

Answering Diverse Questions via Text Attached with Key Audio-Visual Clues

Qilang Ye, Zitong Yu, Xin Liu

Preprint

[arXiv] [Code]

一种基于人体骨架的任意角度坐姿识别方法

Qilang Ye, Hai Nan, Daixin Li

中文核心

[paper]