Xintong Han

Xintong Han

I am a researcher at the Tencent Hunyuan 3D Generation Team. Previously, I was a Senior Tech Lead Manager at Huya Inc (2019-2025) and a Research Scientist at Malong Technologies (2018-2019).

I obtained my Ph.D. from the University of Maryland, College Park, advised by Prof. Larry S. Davis. During my Ph.D., I completed two awesome internships at Google. Prior to that, I received my B.S. degree from Shanghai Jiao Tong University, advised by Prof. Weiyao Lin.

HIRING: Multimodal GenAI Researchers/Interns in Shenzhen/Beijing/Shanghai.

System Log

system_log.txt --lines=5
Oct 2025 Will serve as an Area Chair at CVPR 2026.
Jul 2025 Back to working on 3D stuff at Tencent Hunyuan.
Nov 2024 Code released: MotionFollower & StableAnimator.
Jun 2024 MotionEditor accepted by CVPR 2024.
Mar 2023 Cloth4D accepted by CVPR 2023.
Jan 2023 MotionFormer accepted by ICLR 2023.
Oct 2022 FFCLIP accepted as a Spotlight by NeurIPS 2022.
Mar 2022 One paper accepted by CVPR 2022.
Oct 2021 One paper accepted by NeurIPS 2021.
Mar 2021 Four papers accepted by CVPR 2021.
Sep 2020 PGNet accepted by ACCV 2020 as an Oral.
Jul 2020 MakeItTalk accepted by SIGGRAPH Asia 2020.
Nov 2019 Three papers accepted by AAAI 2020.
Aug 2019 Joined Huya Inc. as a computer vision tech lead.
Jun 2019 One oral paper and one poster paper accepted by ICCV 2019.
Mar 2019 One paper accepted by CVPR 2019.
Jul 2018 One paper accepted by ECCV 2018.
Jun 2018 Defended my dissertation!
Feb 2018 Three papers accepted by CVPR 2018.

Selected Publications

Hunyuan3D Studio: End-to-End AI Pipeline for Game-Ready 3D Asset Generation
Hunyuan 3D Team
Arxiv 2025
StableAvatar: Infinite-Length Audio-Driven Avatar Video Generation
Shuyuan Tu, Yueming Pan, Yinming Huang, Xintong Han, Zhen Xing, Qi Dai, Chong Luo, Zuxuan Wu, Yu-Gang Jiang
Arxiv 2025
StableAnimator: High-Quality Identity-Preserving Human Image Animation
Shuyuan Tu, Zhen Xing, Xintong Han, Zhi-Qi Cheng, Qi Dai, Chong Luo, Zuxuan Wu
CVPR 2025
MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion
Shuyuan Tu, Qi Dai, Zihao Zhang, Sicheng Xie, Zhi-Qi Cheng, Chong Luo, Xintong Han, Zuxuan Wu, Yu-Gang Jiang
ICCV 2025
MotionEditor: Editing Video Motion via Content-aware Diffusion
Shuyuan Tu, Qi Dai, Zhi-Qi Cheng, Han Hu, Xintong Han, Zuxuan Wu, Yu-Gang Jiang
CVPR 2024
CLOTH4D: A Dataset for Clothed Human Reconstruction
Xingxing Zou*, Xintong Han*, Waikeung Wong
CVPR 2023
XFormer: Fast and Accurate Monocular 3D Body Capture
Lihui Qian, Xintong Han, Faqiang Wang, Hongyu Liu, Haoye Dong, Zhiwen Li, Huawei Wei, Zhe Lin, Cheng-Bin Jin
IJCAI 2023
CoverHunter: Cover Song Identification with Refined Attention and Alignments
Feng Liu, Deyi Tuo, Yinan Xu, Xintong Han
ICME 2023
Human MotionFormer: Transferring Human Motions with Vision Transformers
Hongyu Liu, Xintong Han, Chenbin Jin, Lihui Qian, Huawei Wei, Zhe Lin, Faqiang Wang, Haoye Dong, Yibing Song, Jia Xu, Qifeng Chen
ICLR 2023
One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations
Yiming Zhu, Hongyu Liu, Yibing Song, Ziyang Yuan, Xintong Han, Chun Yuan, Qifeng Chen, Jue Wang
NeurIPS 2022 Spotlight
ObjectFormer for Image Manipulation Detection and Localization
Junke Wang, Zuxuan Wu, Jingjing Chen, Xintong Han, Abhinav Shrivastava, Ser-Nam Lim, Yu-Gang Jiang
CVPR 2022
Action-guided 3D Human Motion Prediction
Jiangxin Sun, Zihang Lin, Xintong Han, Jian-Fang Hu, Jia Xu, Wei-Shi Zheng
NeurIPS 2021
Few-Shot Human Motion Transfer by Personalized Geometry and Texture Modeling
Zhichao Huang, Xintong Han, Jia Xu, Tong Zhang
CVPR 2021
PD-GAN: Probabilistic Diverse GAN for Image Inpainting
Hongyu Liu, Ziyu Wan, Wei Huang, Yibing Song, Xintong Han, Jing Liao
CVPR 2021
DeFLOCNet: Deep Image Editing via Flexible Low-level Controls
Hongyu Liu, Ziyu Wan, Wei Huang, Yibing Song, Xintong Han, Jing Liao, Bing Jiang, Wei Liu
CVPR 2021
Fine-Grained Shape-Appearance Mutual Learning for Cloth-Changing Person Re-Identification
Peixian Hong, Tao Wu, Ancong Wu, Xintong Han, Wei-Shi Zheng
CVPR 2021
Learning 3D Face Reconstruction with a Pose Guidance Network
Pengpeng Liu, Xintong Han, Michael Lyu, Irwin King, Jia Xu
ACCV 2020 Oral
MakeItTalk: Speaker-Aware Talking Head Animation
Yang Zhou, Dingzeyu Li, Xintong Han, Evangelos Kalogerakis, Eli Shechtman, Jose Echevarria
SIGGRAPH Asia 2020
iFAN: Image-Instance Full Alignment Networks for Adaptive Object Detection
Chenfan Zhuang, Xintong Han, Weilin Huang, Matthew R. Scott
AAAI 2020
Channel Interaction Networks for Fine-Grained Image Categorization
Yu Gao, Xintong Han, Weilin Huang, Matthew R. Scott
AAAI 2020
Generate, Segment and Refine: Towards Generic Manipulation Segmentation
Peng Zhou, Bor-Chun Chen, Xintong Han, Mahyar Najibi, Abhinav Shrivastava, Ser-Nam Lim, Larry S. Davis
AAAI 2020
Compatible and Diverse Fashion Image Inpainting
Xintong Han, Zuxuan Wu, Weilin Huang, Matthew R. Scott, Larry S. Davis
ICCV 2019 Oral
ClothFlow: A Flow-Based Model for Clothed Person Generation
Xintong Han, Xiaojun Hu, Weilin Huang, Matthew R. Scott
ICCV 2019
Multi-Similarity Loss with General Pair Weighting for Deep Metric Learning
Xun Wang, Xintong Han, Weilin Huang, Dengke Dong, Matthew R. Scott
CVPR 2019
DCAN: Dual Channel-wise Alignment Networks for Unsupervised Scene Adaptation
Zuxuan Wu, Xintong Han, Yen-Liang Lin, Mustafa Gkhan Uzunbas, Tom Goldstein, Ser Nam Lim, Larry Davis
ECCV 2018
VITON: An Image-based Virtual Try-on Network
Xintong Han, Zuxuan Wu, Zhe Wu, Ruichi Yu, Larry Davis
CVPR 2018 Spotlight
Learning Rich Features for Image Manipulation Detection
Peng Zhou, Xintong Han, Vlad Morariu, Larry Davis
CVPR 2018
NISP: Pruning Networks Using Neuron Importance Score Propagation
Ruichi Yu, Ang Li, Chun-Fu Chen, Jui-Hsin Lai, Vlad Morariu, Xintong Han, Mingfei Gao, Ching-Yung Lin, Larry Davis
CVPR 2018 Spotlight
Automatic Spatially-aware Fashion Concept Discovery
Xintong Han, Zuxuan Wu, Phoenix Huang, Xiao Zhang, Menglong Zhu, Yuan Li, Yang Zhao, Larry Davis
ICCV 2017
Learning Fashion Compatibility with Bidirectional LSTMs
Xintong Han, Zuxuan Wu, Yu-Gang Jiang, Larry Davis
ACM Multimedia 2017 Oral
Two-Stream Neural Networks for Tampered Face Detection
Peng Zhou*, Xintong Han*, Vlad Morariu, Larry Davis
CVPRW 2017
Son of Zorn's Lemma: Targeted Style Transfer Using Instance-aware Semantic Segmentation
Carlos Castillo, Soham De, Xintong Han, Bharat Singh, Abhay Kumar Yadav, Tom Goldstein
ICASSP 2017 Oral
VRFP: On-the-fly Video Retrieval using Web Images and Fast Fisher Vector Products
Xintong Han*, Bharat Singh*, Vlad Morariu, Larry Davis
IEEE TMM 2017
Machine Learning-based Early Termination in Prediction Block Decomposition for VP9
Xintong Han, Yunqing Wang, Yaowu Xu, Jim Bankoski
SPIE 2016