I am a second-year PhD stduent at the SEUOneCoLab in SouthEast University advised by Prof. Wankou
Yang. Currently, my research focuses on visual grounding and multi-modal large language model.
Publications
* denotes equal contribution and † corresponding author
Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints
AAAI 2025
Ming Dai, Jian Li, Jiedong Zhuang, Xian Zhang, Wankou Yang†