I am an Assistant Professor at the Department of Electrical and Electronic Engineering and Institute of Data Science (IDS), The University of Hong Kong. Before joining HKU, I was a postdoc Scholar at EECS Department and BAIR at UC Berkeley, advised by Prof. Trevor Darrell. I obtained my Ph.D. degree from Multimedia Lab (MMLab), Chinese University of Hong Kong, supervised by Prof. Xiaogang Wang and Prof. Hongsheng Li. I received bachelor's degree in Electronic Engineering in Tsinghua University.

My research interests cover computer vision, deep learning, and artificial intelligence, with special emphasis on generative models and multimodal AI. I am also interested in their applications in embodied AI and AI for Science. I was awarded Adobe Research Fellowship 2020, EECS Rising Stars 2021, and WAIC Rising Stars Award 2022.

I am actively looking for self-motivated Ph.D. students, postdoctoral scholars, research assistants, and visiting students to join my group. Please drop me an email if you are interested. Eligible students can apply for Hong Kong PhD Fellowship Scheme (HKPFS) and HKU Presidential PhD Scholar Programme (HKU-PS). Postgraduate scholarships (PGS) are granted to other students without HKPFS and HKU-PS. Due to the large number of emails I received, I cannot reply to all of them. But I do read all emails and reply to those that I am interested in. There's no need to send duplicate emails.


News


Selected Publications

Full list here and Google Scholar


ECCV
Empowering 3D Visual Grounding with Reasoning Capabilities
Chenming Zhu, Tai Wang, Wenwei Zhang, Kai Chen, Xihui Liu
ECCV 2024
[Paper] [Project Page] [Code] [Data]

ECCV
TC4D: Trajectory-Conditioned Text-to-4D Generation
Sherwin Bahmani*, Xian Liu*, Yifan Wang*, Ivan Skorokhodov, Victor Rong, Ziwei Liu, Xihui Liu, Jeong Joon Park, Sergey Tulyakov, Gordon Wetzstein, Andrea Tagliasacchi, David B. Lindell
ECCV 2024
[Paper] [Project Page] [Code] [Hugging Face Daily Papers]

ECCV
PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines
ZiDong Wang, Zeyu Lu, Di Huang, Tong He, Xihui Liu, Wanli Ouyang, Lei Bai
ECCV 2024
Coming soon!

EgoPlan-Bench: Benchmarking Multimodal Large Language Models for Human-Level Planning
Yi Chen, Yuying Ge, Yixiao Ge, Mingyu Ding, Bohao Li, Rui Wang, Ruifeng Xu, Ying Shan, Xihui Liu
arXiv 2024
[Paper] [Project Page] [Code] [Challenge] [Data] [Leaderboard]

DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis
Yao Teng, Yue Wu, Han Shi, Xuefei Ning, Guohao Dai, Yu Wang, Zhenguo Li, Xihui Liu
arXiv 2024
[Paper] [Code] [Hugging Face Daily Papers]

4Diffusion: Multi-view Video Diffusion Model for 4D Generation
Haiyu Zhang, Xinyuan Chen, Yaohui Wang, Xihui Liu, Yunhong Wang, Yu Qiao
arXiv 2024
[Paper] [Project Page] [Code] [Hugging Face Daily Papers]

Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation
Zhenyu Wang, Enze Xie, Aoxue Li, Zhongdao Wang, Xihui Liu, Zhenguo Li
arXiv 2024
[Paper] [Code] [Hugging Face Daily Papers]

ICML
FiT: Flexible Vision Transformer for Diffusion Model
Zeyu Lu, Zidong Wang, Di Huang, Chengyue Wu, Xihui Liu, Wanli Ouyang, Lei Bai
ICML 2024
[Paper] [Code] [Hugging Face Daily Papers]

CVPR
DreamComposer: Controllable 3D Object Generation via Multi-View Conditions
Yunhan Yang*, Yukun Huang*, Xiaoyang Wu, Yuan-Chen Guo, Song-Hai Zhang, Hengshuang Zhao, Tong He, Xihui Liu
CVPR 2024
[Paper] [Project Page] [Code] [Hugging Face Daily Papers]

CVPR Highlight
HumanGaussian: Text-driven 3d Human Generation with Gaussian Splatting
Xian Liu, Xiaohang Zhan, Jiaxiang Tang, Ying Shan, Gang Zeng, Dahua Lin, Xihui Liu, Ziwei Liu
CVPR 2024 Highlight
[Paper] [Project Page] [Code] [video]

CVPR
EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
Tai Wang*, Xiaohan Mao*, Chenming Zhu*, Runsen Xu, Ruiyuan Lyu, Peisen Li, Xiao Chen, Wenwei Zhang, Kai Chen, Tianfan Xue, Xihui Liu, Cewu Lu, Dahua Lin, Jiangmiao Pang
CVPR 2024
[Paper] [Project Page] [Code] [Data]

CVPR Oral
Point Transformer V3: Simpler, Faster, Stronger
Xiaoyang Wu, Li Jiang, Peng-Shuai Wang, Zhijian Liu, Xihui Liu, Yu Qiao, Wanli Ouyang, Tong He, Hengshuang Zhao
CVPR 2024 Oral
[Paper] [Code] [Hugging Face Daily Papers]

CVPR
Towards Large-scale 3D Representation Learning with Multi-dataset Point Prompt Training
Xiaoyang Wu, Zhuotao Tian, Xin Wen, Bohao Peng, Xihui Liu, Kaicheng Yu, Hengshuang Zhao
CVPR 2024
[Paper] [Code]

ICLR
HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion
Xian Liu, Jian Ren, Aliaksandr Siarohin, Ivan Skorokhodov, Yanyu Li, Dahua Lin, Xihui Liu, Ziwei Liu, Sergey Tulyakov
ICLR 2024
[Paper] [Project Page] [Code] [Hugging Face Daily Papers] [Short video] [Long video]

NeurIPS
T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation
Kaiyi Huang, Kaiyue Sun, Enze Xie, Zhenguo Li, Xihui Liu
NeurIPS 2023
[Paper] [Project Page] [Code] [Data] [Hugging Face Daily Papers] [T2I-CompBench++]

NeurIPS
OV-PARTS: Towards Open-Vocabulary Part Segmentation
Meng Wei, Xiaoyu Yue, Wenwei Zhang, Shu Kong, Xihui Liu, Jiangmiao Pang
NeurIPS 2023
[Paper] [Code] [Data] [Challenge]

NeurIPS
Seeing is not always believing: A Quantitative Study on Human Perception of AI-Generated Images
Zeyu Lu*, Di Huang*, Lei Bai*, Jingjing Qu, Chengyue Wu, Xihui Liu, Wanli Ouyang
NeurIPS 2023
[Paper] [Project Page] [Data]

NeurIPS
CorresNeRF: Image Correspondence Priors for Neural Radiance Fields
Yixing Lao, Xiaogang Xu, Zhipeng Cai, Xihui Liu, Hengshuang Zhao
NeurIPS 2023
[Paper] [Project Page] [Code]

ICCV
DDP: Diffusion Model for Dense Visual Prediction
Yuanfeng Ji*, Zhe Chen*, Enze Xie, Lanqing Hong, Xihui Liu, Zhaoqiang Liu, Tong Lu, Zhenguo Li, Ping Luo
ICCV 2023
[Paper] [Code]

CVPR
Back to the Source: Diffusion-Driven Test-Time Adaptation
Jin Gao*, Jialing Zhang*, Xihui Liu, Trevor Darrell, Evan Shelhamer, Dequan Wang
CVPR 2023
[Paper] [Code]

CVPR
Learning Transferable Spatiotemporal Representations from Natural Script Knowledge
Ziyun Zeng*, Yuying Ge*, Xihui Liu, Bin Chen, Ping Luo, Shu-Tao Xia, Yixiao Ge
CVPR 2023
[Paper] [Code]

CVPR
Masked Scene Contrast: A Scalable Framework for Unsupervised 3D Representation Learning
Xiaoyang Wu, Xin Wen, Xihui Liu, Hengshuang Zhao
CVPR 2023
[Paper] [Code]

CVPR
RIFormer: Keep Your Vision Backbone Effective But Removing Token Mixer
Jiahao Wang, Songyang Zhang, Yong Liu, Taiqiang Wu, Yujiu Yang, Xihui Liu, Kai Chen, Ping Luo, Dahua Lin
CVPR 2023
[Paper] [Project Page] [Code]

CVPR
GLeaD: Improving GANs with A Generator-Leading Task
Qingyan Bai, Ceyuan Yang, Yinghao Xu, Xihui Liu, Yujiu Yang, Yujun Shen
CVPR 2023
[Paper] [Project Page] [Code]

NeurIPS
Point Transformer V2: Grouped Vector Attention and Partition-based Pooling
Xiaoyang Wu, Yixing Lao, Li Jiang, Xihui Liu, Hengshuang Zhao
NeurIPS 2022
[Paper] [Code]

ECCV
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval
Yuying Ge, Yixiao Ge, Xihui Liu, Alex Jinpeng Wang, Jianping Wu, Ying Shan, Xiaohu Qie, Ping Luo
ECCV 2022
[Paper] [Code]

CVPR Oral
Bridging Video-text Retrieval with Multiple Choice Questions
Yuying Ge, Yixiao Ge, Xihui Liu, Alex Jinpeng Wang, Jianping Wu, Ying Shan, Xiaohu Qie, Ping Luo
CVPR 2022 Oral
[Paper] [Project Page] [Code]

NeurIPS
Benchmark for Compositional Text-to-Image Synthesis
Dong Huk Park, Samaneh Azadi, Xihui Liu, Trevor Darrell, Anna Rohrbach
NeurIPS Datasets and Benchmarks 2021
[Paper] [Code] [Data]

ECCV
Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions
Xihui Liu, Zhe Lin, Jianming Zhang, Handong Zhao, Quan Tran, Xiaogang Wang, Hongsheng Li
ECCV 2020
[Paper] [Code] [Video] [Slides]

NeurIPS
Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis
Xihui Liu, Guojun Yin, Jing Shao, Xiaogang Wang, Hongsheng Li
NeurIPS 2019
[Paper] [Code] [Slides]

ICCV
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
Zihao Wang*, Xihui Liu*, Hongsheng Li, Lu Sheng, Junjie Yan, Xiaogang Wang, Jing Shao
ICCV 2019
[Paper] [Code]

CVPR
Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing
Xihui Liu, Zihao Wang, Jing Shao, Xiaogang Wang, Hongsheng Li
CVPR 2019
[Paper] [Code]

ECCV
Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled Data
Xihui Liu, Hongsheng Li, Jing Shao, Dapeng Chen, Xiaogang Wang
ECCV 2018
[Paper]

ICCV
HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis
Xihui Liu*, Haiyu Zhao*, Maoqing Tian, Lu Sheng, Jing Shao, Shuai Yi, Junjie Yan, Xiaogang Wang
ICCV 2017
[Paper] [Project Page] [Code] [Data]