I am an Assistant Professor at the Department of Electrical and Electronic Engineering and Institute of Data Science (IDS), The University of Hong Kong. Before joining HKU, I was a postdoc Scholar at EECS Department and BAIR at UC Berkeley, advised by Prof. Trevor Darrell. I obtained my Ph.D. degree from Multimedia Lab (MMLab), Chinese University of Hong Kong, supervised by Prof. Xiaogang Wang and Prof. Hongsheng Li. I received bachelor's degree in Electronic Engineering in Tsinghua University.

My research interests cover computer vision, deep learning, and artificial intelligence, with special emphasis on generative models and multimodal AI. I am also interested in their applications in embodied AI and AI for Science. I was awarded Adobe Research Fellowship 2020, Rising Stars in EECS 2021, and WAIC Rising Stars Award 2022. I serve as Area Chairs for CVPR 2024, ACM MultiMedia 2024, ICLR 2025, CVPR 2025, ACM MultiMedia 2025, and NeurIPS 2025. I co-organized ICML 2024 ICML 2024 Workshop on Multimodal Foundation Model Meets Embodied AI and CVPR 2025 WorkshopWorldModelBench: The 1st Workshop on Benchmarking World Models.

I am actively looking for self-motivated Ph.D. students, postdoctoral scholars, research assistants, and visiting students to join my group. Please drop me an email if you are interested. Eligible students can apply for Hong Kong PhD Fellowship Scheme (HKPFS) and HKU Presidential PhD Scholar Programme (HKU-PS). Postgraduate scholarships (PGS) are granted to other students without HKPFS and HKU-PS. Due to the large number of emails I received, I cannot reply to all of them. But I do read all emails and reply to those that I am interested in. There's no need to send duplicate emails.


News


Highlighted Publications (Full List of Publications and Google Scholar)


Position: Interactive Generative Video as Next-Generation Game Engine
Jiwen Yu*, Yiran Qin*, Haoxuan Che, Quande Liu, Xintao Wang, Pengfei Wan, Di Zhang, Xihui Liu
arXiv 2025
[Paper]

TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation
Yuqing Wang, Zhijie Lin, Yao Teng, Yuanzhi Zhu, Shuhuai Ren, Jiashi Feng, Xihui Liu
arXiv 2025
[Paper] [Project Page] [Code]

GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation
Tianwei Xiong, Jun Hao Liew, Zilong Huang, Jiashi Feng, Xihui Liu
arXiv 2025
[Paper] [Project Page] [Code]

HoloPart: Generative 3D Part Amodal Segmentation
Yunhan Yang, Yuan-Chen Guo, Yukun Huang, Zi-Xin Zou, Zhipeng Yu, Yangguang Li, Yan-Pei Cao, Xihui Liu
arXiv 2025
[Paper] [Project Page] [Code] [Interactive Demo]

TPAMI
DreamComposer++: Empowering Diffusion Models with Multi-View Conditions for 3D Content Generation
Yunhan Yang*, Shuo Chen*, Yukun Huang*, Xiaoyang Wu, Yuan-Chen Guo, Edmund Y. Lam, Hengshuang Zhao, Tong He, Xihui Liu
TPAMI 2025
[Paper]

TPAMI
T2I-CompBench++: An Enhanced and Comprehensive Benchmark for Compositional Text-to-Image Generation
Kaiyi Huang, Chengqi Duan, Kaiyue Sun, Enze Xie, Zhenguo Li, Xihui Liu
TPAMI 2025
[Paper] [Project page] [Code]

CVPR
Parallelized Autoregressive Visual Generation
Yuqing Wang, Shuhuai Ren, Zhijie Lin, Yujin Han, Haoyuan Guo, Zhenheng Yang, Difan Zou, Jiashi Feng, Xihui Liu
CVPR 2025
[Paper] [Project Page] [Code]

CVPR
T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation
Kaiyue Sun, Kaiyi Huang, Xian Liu, Yue Wu, Zihan Xu, Zhenguo Li, Xihui Liu
CVPR 2025
[Paper] [Project Page] [Code] [LeaderBoard]

ICLR
Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding
Yao Teng, Han Shi, Xian Liu, Xuefei Ning, Guohao Dai, Yu Wang, Zhenguo Li, Xihui Liu
ICLR 2025
[Paper] [Code]

NeurIPS Spotlight
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing
Zhenyu Wang, Aoxue Li, Zhenguo Li, Xihui Liu
NeurIPS 2024 Spotlight
[Paper] [Project Page] [Code]

NeurIPS
LVD-2M: A Long-take Video Dataset with Temporally Dense Captions
Tianwei Xiong, Yuqing Wang, Daquan Zhou, Zhijie Lin, Jiashi Feng, Xihui Liu
NeurIPS 2024
[Paper] [Project Page] [Code and Dataset]

NeurIPS
BEACON: Benchmark for Comprehensive RNA Tasks and Language Models
Yuchen Ren, Zhiyuan Chen, Lifeng Qiao, Hongtai Jing, Yuchen Cai, Sheng Xu, Peng Ye, Xinzhu Ma, Siqi Sun, Hongliang Yan, Dong Yuan, Wanli Ouyang, Xihui Liu
NeurIPS 2024
[Paper] [Project Page] [Code and Dataset]

ECCV
Empowering 3D Visual Grounding with Reasoning Capabilities
Chenming Zhu, Tai Wang, Wenwei Zhang, Kai Chen, Xihui Liu
ECCV 2024
[Paper] [Project Page] [Code] [Data]

CVPR
DreamComposer: Controllable 3D Object Generation via Multi-View Conditions
Yunhan Yang*, Yukun Huang*, Xiaoyang Wu, Yuan-Chen Guo, Song-Hai Zhang, Hengshuang Zhao, Tong He, Xihui Liu
CVPR 2024
[Paper] [Project Page] [Code] [Hugging Face Daily Papers]

NeurIPS
T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation
Kaiyi Huang, Kaiyue Sun, Enze Xie, Zhenguo Li, Xihui Liu
NeurIPS 2023
[Paper] [Project Page] [Code] [Data] [Hugging Face Daily Papers] [T2I-CompBench++]