I am a senior tech lead manager at Huya Inc as of August 2019. Before that, I spent one wonderful year at Malong Technologies as a research scientist.
I obtained my Ph.D. degree at the Department of Electrical and Computer Engineering of the University of Maryland, College Park, under the supervision of Prof. Larry S. Davis. Prior, I got my B.S. degree from Shanghai Jiao Tong University in China, advised by Prof. Weiyao Lin.
I am looking for highly motivated researchers, engineers and interns working on exciting computer vision and graphics projects in Shenzhen/Guangzhou. If you are interested, please send me an email.
Email: xintong@umd.edu; hanxintong@huya.com
news
[Mar. 2023] Cloth4D accepted by CVPR 2023.
[Mar. 2023] XFormer accepted by IJCAI 2023.
[Jan. 2023] MotionFormer accepted by ICLR 2023.
[Oct. 2022] FFCLIP accepted as a spotlight paper by NeurIPS 2022.
[Mar. 2022] One paper accepted by CVPR 2022.
[Oct. 2021] One paper accepted by NeurIPS 2021.
Projects
HDR and Voice seperation of LOL S12 [media]
Virtual Streamer Animation [media]
Anime Talking Heads [media]
Several AI effects on LicoLico APP
publication
CLOTH4D: A Dataset for Clothed Human Reconstruction.
Xingxing Zou,
Xintong Han, Waikeung Wong
Conference on Computer Vision and Pattern Recognition (CVPR), 2023. [
pdf] [
dataset]
XFormer: Fast and Accurate Monocular 3D Body Capture.
Lihui Qian,
Xintong Han, Faqiang Wang, Hongyu Liu, Haoye Dong, Zhiwen Li, Huawei Wei, Zhe Lin, Cheng-Bin Jin
International Joint Conference on Artificial Intelligence (IJCAI), 2023. [
pdf]
CoverHunter: Cover Song Identification with Refined Attention and Alignments.
Feng Liu, Deyi Tuo, Yinan Xu,
Xintong Han
International Conference on Multimedia and Expo (ICME), 2023. [
pdf][
code]
Human MotionFormer: Transferring Human Motions with Vision Transformers.
Hongyu Liu,
Xintong Han, Chenbin Jin, Lihui Qian, Huawei Wei, Zhe Lin, Faqiang Wang, Haoye Dong, Yibing Song, Jia Xu, Qifeng Chen
International Conference on Learning Representations (ICLR), 2023. [
pdf]
One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations.
Yiming Zhu, Hongyu Liu, Yibing Song, ziyang Yuan,
Xintong Han, Chun Yuan, Qifeng Chen, Jue Wang
Conference on Neural Information Processing Systems (NeurIPS), 2022.
Spotlight. [
pdf][
code]
ObjectFormer for Image Manipulation Detection and Localization.
Junke Wang, Zuxuan Wu, Jingjing Chen,
Xintong Han, Abhinav Shrivastava, Ser-Nam Lim, Yu-Gang Jiang
Conference on Computer Vision and Pattern Recognition (CVPR), 2022. [
pdf]
Action-guided 3D Human Motion Prediction.
Jiangxin Sun, Zihang Lin,
Xintong Han , Jian-Fang Hu, Jia Xu, Wei-Shi Zheng
Conference on Neural Information Processing Systems (NeurIPS), 2021. [
pdf]
Few-Shot Human Motion Transfer by Personalized Geometry and Texture Modeling.
Zhichao Huang,
Xintong Han , Jia Xu, Tong Zhang
Conference on Computer Vision and Pattern Recognition (CVPR), 2021. [
pdf][
code]
PD-GAN: Probabilistic Diverse GAN for Image Inpainting.
Hongyu Liu, Ziyu Wan, Wei Huang, Yibing Song,
Xintong Han , Jing Liao
Conference on Computer Vision and Pattern Recognition (CVPR), 2021. [
pdf][
code]
DeFLOCNet: Deep Image Editing via Flexible Low-level Controls.
Hongyu Liu, Ziyu Wan, Wei Huang, Yibing Song,
Xintong Han , Jing Liao, Bing Jiang, Wei Liu
Conference on Computer Vision and Pattern Recognition (CVPR), 2021. [
pdf][
code]
Fine-Grained Shape-Appearance Mutual Learning for Cloth-Changing Person Re-Identification.
Peixian Hong*, Tao Wu*, Ancong Wu,
Xintong Han, Wei-Shi Zheng
Conference on Computer Vision and Pattern Recognition (CVPR), 2021. [
pdf]
Learning 3D Face Reconstruction with a Pose Guidance Network.
Pengpeng Liu,
Xintong Han, Michael Lyu, Irwin King, Jia Xu
Asian Conference on Computer Vision (ACCV), 2020.
Oral. [
pdf]
MakeItTalk: Speaker-Aware Talking Head Animation.
Yang Zhou, Dingzeyu Li,
Xintong Han, Evangelos Kalogerakis, Eli Shechtman and Jose Echevarria
SIGGRAPH Asia, 2020. [
pdf]
iFAN: Image-Instance Full Alignment Networks for Adaptive Object Detection.
Chenfan Zhuang,
Xintong Han, Weilin Huang and Matthew R. Scott
AAAI Conference on Artificial Intelligence (AAAI), 2020. [
pdf]
Channel Interaction Networks for Fine-Grained Image Categorization.
Yu Gao,
Xintong Han, Weilin Huang and Matthew R. Scott
AAAI Conference on Artificial Intelligence (AAAI), 2020. [
pdf]
Generate, Segment and Refine: Towards Generic Manipulation Segmentation.
Peng Zhou, Bor-Chun Chen,
Xintong Han, Mahyar Najibi, Abhinav Shrivastava, Ser-Nam Lim and Larry S. Davis
AAAI Conference on Artificial Intelligence (AAAI), 2020. [
pdf]
Compatible and Diverse Fashion Image Inpainting.
Xintong Han, Zuxuan Wu, Weilin Huang, Matthew R. Scott and Larry S. Davis
International Conference on Computer Vision (ICCV), 2019.
Oral. [
pdf][
supp]
ClothFlow: A Flow-Based Model for Clothed Person Generation.
Xintong Han, Xiaojun Hu, Weilin Huang and Matthew R. Scott
International Conference on Computer Vision (ICCV), 2019. [
pdf][
supp]
Multi-Similarity Loss with General Pair Weighting for Deep Metric Learning.
Xun Wang,
Xintong Han, Weilin Huang, Dengke Dong and Matthew R. Scott
Conference on Computer Vision and Pattern Recognition (CVPR), 2019. [
pdf][
code]
DCAN: Dual Channel-wise Alignment Networks for Unsupervised Scene Adaptation.
Zuxuan Wu,
Xintong Han, Yen-Liang Lin, Mustafa Gkhan Uzunbas, Tom Goldstein, Ser Nam Lim, and Larry Davis
European Conference on Computer Vision (ECCV), 2018. [
pdf]
Learning Rich Features for Image Manipulation Detection.
Peng Zhou,
Xintong Han, Vlad Morariu, and Larry Davis
Conference on Computer Vision and Pattern Recognition (CVPR), 2018. [
pdf][
code]
NISP: Pruning Networks Using Neuron Importance Score Propagation.
Ruichi Yu, Ang Li, Chun-Fu Chen, Jui-Hsin Lai, Vlad Morariu,
Xintong Han, Mingfei Gao, Ching-Yung Lin, and Larry Davis
Conference on Computer Vision and Pattern Recognition (CVPR), 2018.
Spotlight. [
pdf]
Automatic Spatially-aware Fashion Concept Discovery.
Xintong Han, Zuxuan Wu, Phoenix Huang, Xiao Zhang, Menglong Zhu, Yuan Li, Yang Zhao, and Larry Davis
International Conference on Computer Vision (ICCV), 2017. [
pdf] [
dataset]
Learning Fashion Compatibility with Bidirectional LSTMs.
Xintong Han, Zuxuan Wu, Yu-Gang Jiang, and Larry Davis
ACM Multimedia, 2017.
Oral. [
pdf] [
dataset] [
code]
Two-Stream Neural Networks for Tampered Face Detection.
Peng Zhou*,
Xintong Han*, Vlad Morariu,, and Larry Davis (* equal contribution)
Conference on Computer Vision and Pattern Recognition, Workshop on Media Forensics (CVPRW), 2017. [
pdf]
Son of Zorn's Lemma: Targeted Style Transfer Using Instance-aware Semantic Segmentation.
Carlos Castillo, Soham De,
Xintong Han, Bharat Singh, Abhay Kumar Yadav, and Tom Goldstein
International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017.
Oral. [
pdf]
VRFP: On-the-fly Video Retrieval using Web Images and Fast Fisher Vector Products
Xintong Han*, Bharat Singh*, Vlad Morariu, and Larry Davis (* equal contribution)
IEEE Transaction on Multimedia (TMM), 2017. [
pdf]
Presented at the WebVision workshop CVPR 2017 .
Machine Learning-based Early Termination in Prediction Block Decomposition for VP9
Xintong Han, Yunqing Wang, Yaowu Xu, and Jim Bankoski
IS&T/SPIE Electronic Imaging, 2016. [
pdf]