I am a senior tech lead manager at Huya Inc as of August 2019. Before that, I spent one wonderful year at Malong Technologies as a research scientist.

I obtained my Ph.D. degree at the Department of Electrical and Computer Engineering of the University of Maryland, College Park, under the supervision of Prof. Larry S. Davis. Prior, I got my B.S. degree from Shanghai Jiao Tong University in China, advised by Prof. Weiyao Lin.

I am looking for highly motivated researchers, engineers and interns working on exciting computer vision and graphics projects in Shenzhen/Guangzhou. If you are interested, please send me an email.

  • Email: xintong@umd.edu; hanxintong@huya.com
  • news

  • [Mar. 2023] Cloth4D accepted by CVPR 2023.
  • [Mar. 2023] XFormer accepted by IJCAI 2023.
  • [Jan. 2023] MotionFormer accepted by ICLR 2023.
  • [Oct. 2022] FFCLIP accepted as a spotlight paper by NeurIPS 2022.
  • [Mar. 2022] One paper accepted by CVPR 2022.
  • [Oct. 2021] One paper accepted by NeurIPS 2021.
  • Projects

  • HDR and Voice seperation of LOL S12 [media]
  • Virtual Streamer Animation [media]
  • Anime Talking Heads [media]
  • Several AI effects on LicoLico APP
  • publication

    CLOTH4D: A Dataset for Clothed Human Reconstruction.
    Xingxing Zou, Xintong Han, Waikeung Wong
    Conference on Computer Vision and Pattern Recognition (CVPR), 2023. [pdf] [dataset]
    XFormer: Fast and Accurate Monocular 3D Body Capture.
    Lihui Qian, Xintong Han, Faqiang Wang, Hongyu Liu, Haoye Dong, Zhiwen Li, Huawei Wei, Zhe Lin, Cheng-Bin Jin
    International Joint Conference on Artificial Intelligence (IJCAI), 2023. [pdf]
    CoverHunter: Cover Song Identification with Refined Attention and Alignments.
    Feng Liu, Deyi Tuo, Yinan Xu, Xintong Han
    International Conference on Multimedia and Expo (ICME), 2023. [pdf][code]
    Human MotionFormer: Transferring Human Motions with Vision Transformers.
    Hongyu Liu, Xintong Han, Chenbin Jin, Lihui Qian, Huawei Wei, Zhe Lin, Faqiang Wang, Haoye Dong, Yibing Song, Jia Xu, Qifeng Chen
    International Conference on Learning Representations (ICLR), 2023. [pdf]
    One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations.
    Yiming Zhu, Hongyu Liu, Yibing Song, ziyang Yuan, Xintong Han, Chun Yuan, Qifeng Chen, Jue Wang
    Conference on Neural Information Processing Systems (NeurIPS), 2022. Spotlight. [pdf][code]
    ObjectFormer for Image Manipulation Detection and Localization.
    Junke Wang, Zuxuan Wu, Jingjing Chen, Xintong Han, Abhinav Shrivastava, Ser-Nam Lim, Yu-Gang Jiang
    Conference on Computer Vision and Pattern Recognition (CVPR), 2022. [pdf]
    Action-guided 3D Human Motion Prediction.
    Jiangxin Sun, Zihang Lin, Xintong Han , Jian-Fang Hu, Jia Xu, Wei-Shi Zheng
    Conference on Neural Information Processing Systems (NeurIPS), 2021. [pdf]
    Few-Shot Human Motion Transfer by Personalized Geometry and Texture Modeling.
    Zhichao Huang, Xintong Han , Jia Xu, Tong Zhang
    Conference on Computer Vision and Pattern Recognition (CVPR), 2021. [pdf][code]
    PD-GAN: Probabilistic Diverse GAN for Image Inpainting.
    Hongyu Liu, Ziyu Wan, Wei Huang, Yibing Song, Xintong Han , Jing Liao
    Conference on Computer Vision and Pattern Recognition (CVPR), 2021. [pdf][code]
    DeFLOCNet: Deep Image Editing via Flexible Low-level Controls.
    Hongyu Liu, Ziyu Wan, Wei Huang, Yibing Song, Xintong Han , Jing Liao, Bing Jiang, Wei Liu
    Conference on Computer Vision and Pattern Recognition (CVPR), 2021. [pdf][code]
    Fine-Grained Shape-Appearance Mutual Learning for Cloth-Changing Person Re-Identification.
    Peixian Hong*, Tao Wu*, Ancong Wu, Xintong Han, Wei-Shi Zheng
    Conference on Computer Vision and Pattern Recognition (CVPR), 2021. [pdf]
    Learning 3D Face Reconstruction with a Pose Guidance Network.
    Pengpeng Liu, Xintong Han, Michael Lyu, Irwin King, Jia Xu
    Asian Conference on Computer Vision (ACCV), 2020. Oral. [pdf]
    MakeItTalk: Speaker-Aware Talking Head Animation.
    Yang Zhou, Dingzeyu Li, Xintong Han, Evangelos Kalogerakis, Eli Shechtman and Jose Echevarria
    SIGGRAPH Asia, 2020. [pdf]
    iFAN: Image-Instance Full Alignment Networks for Adaptive Object Detection.
    Chenfan Zhuang, Xintong Han, Weilin Huang and Matthew R. Scott
    AAAI Conference on Artificial Intelligence (AAAI), 2020. [pdf]
    Channel Interaction Networks for Fine-Grained Image Categorization.
    Yu Gao, Xintong Han, Weilin Huang and Matthew R. Scott
    AAAI Conference on Artificial Intelligence (AAAI), 2020. [pdf]
    Generate, Segment and Refine: Towards Generic Manipulation Segmentation.
    Peng Zhou, Bor-Chun Chen, Xintong Han, Mahyar Najibi, Abhinav Shrivastava, Ser-Nam Lim and Larry S. Davis
    AAAI Conference on Artificial Intelligence (AAAI), 2020. [pdf]
    Compatible and Diverse Fashion Image Inpainting.
    Xintong Han, Zuxuan Wu, Weilin Huang, Matthew R. Scott and Larry S. Davis
    International Conference on Computer Vision (ICCV), 2019. Oral. [pdf][supp]
    ClothFlow: A Flow-Based Model for Clothed Person Generation.
    Xintong Han, Xiaojun Hu, Weilin Huang and Matthew R. Scott
    International Conference on Computer Vision (ICCV), 2019. [pdf][supp]
    Multi-Similarity Loss with General Pair Weighting for Deep Metric Learning.
    Xun Wang, Xintong Han, Weilin Huang, Dengke Dong and Matthew R. Scott
    Conference on Computer Vision and Pattern Recognition (CVPR), 2019. [pdf][code]
    DCAN: Dual Channel-wise Alignment Networks for Unsupervised Scene Adaptation.
    Zuxuan Wu, Xintong Han, Yen-Liang Lin, Mustafa Gkhan Uzunbas, Tom Goldstein, Ser Nam Lim, and Larry Davis
    European Conference on Computer Vision (ECCV), 2018. [pdf]
    VITON: An Image-based Virtual Try-on Network.
    Xintong Han, Zuxuan Wu, Zhe Wu, Ruichi Yu, and Larry Davis
    Conference on Computer Vision and Pattern Recognition (CVPR), 2018. Spotlight. [pdf] [code]

    [Note: the dataset used in this paper is no longer available due to copyright infringements. For those who have already downloaded the data, please do not use or distribute it.]

    Learning Rich Features for Image Manipulation Detection.
    Peng Zhou, Xintong Han, Vlad Morariu, and Larry Davis
    Conference on Computer Vision and Pattern Recognition (CVPR), 2018. [pdf][code]
    NISP: Pruning Networks Using Neuron Importance Score Propagation.
    Ruichi Yu, Ang Li, Chun-Fu Chen, Jui-Hsin Lai, Vlad Morariu, Xintong Han, Mingfei Gao, Ching-Yung Lin, and Larry Davis
    Conference on Computer Vision and Pattern Recognition (CVPR), 2018. Spotlight. [pdf]
    Automatic Spatially-aware Fashion Concept Discovery.
    Xintong Han, Zuxuan Wu, Phoenix Huang, Xiao Zhang, Menglong Zhu, Yuan Li, Yang Zhao, and Larry Davis
    International Conference on Computer Vision (ICCV), 2017. [pdf] [dataset]
    Learning Fashion Compatibility with Bidirectional LSTMs.
    Xintong Han, Zuxuan Wu, Yu-Gang Jiang, and Larry Davis
    ACM Multimedia, 2017. Oral. [pdf] [dataset] [code]
    Two-Stream Neural Networks for Tampered Face Detection.
    Peng Zhou*, Xintong Han*, Vlad Morariu,, and Larry Davis (* equal contribution)
    Conference on Computer Vision and Pattern Recognition, Workshop on Media Forensics (CVPRW), 2017. [pdf]
    Son of Zorn's Lemma: Targeted Style Transfer Using Instance-aware Semantic Segmentation.
    Carlos Castillo, Soham De, Xintong Han, Bharat Singh, Abhay Kumar Yadav, and Tom Goldstein
    International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017. Oral. [pdf]
    VRFP: On-the-fly Video Retrieval using Web Images and Fast Fisher Vector Products
    Xintong Han*, Bharat Singh*, Vlad Morariu, and Larry Davis (* equal contribution)
    IEEE Transaction on Multimedia (TMM), 2017. [pdf]
    Presented at the WebVision workshop CVPR 2017 .
    Machine Learning-based Early Termination in Prediction Block Decomposition for VP9
    Xintong Han, Yunqing Wang, Yaowu Xu, and Jim Bankoski
    IS&T/SPIE Electronic Imaging, 2016. [pdf]