Lei Zhang
photo

Lei Zhang

Chief Scientist
Computer Vision & Robotics
International Digital Economy Academy (IDEA)
Guangdong-HongKong-Macau Greater Bay Area
Shenzhen, China

Email: leizhang AT idea dot edu dot cn

Google Scholar | LinkedIn

I am a Chief Scientist of Computer Vision and Robotics at International Digital Economy Academy (IDEA) and an Adjunct Professor of Hong Kong University of Science and Technology. Prior to this, I was a Principal Researcher and Research Manager at Microsoft, where I have worked since 2001 in Microsoft Research Asia (MSRA) for 12 years and later joined Bing Multimedia, Microsoft Research (MSR, Redmond), and Microsoft Cloud & AI from 2013 to 2021. My research interests are in computer vision and machine learning. I am particularly intersted in generic visual recognition at large scale and was named as IEEE Fellow for my contributions in this area.

I have served as editorial board members for IEEE T-MM, T-CSVT, and Multimedia System Journal, as program co-chairs, area chairs, or committee members for many top conferences. I have published 150+ papers and hold 60+ US patents.

I received all my degrees (B.E., M.E., and Ph.D) in Computer Science from Tsinghua University.


Recent Publications

  1. Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation
    Feng Li, Hao Zhang, Huaizhe Xu, Shilong Liu, Lei Zhang, Lionel M. Ni and Heung-Yeung Shum
    Computer Vision and Pattern Recognition (CVPR), 2023
    @inproceedings{li2022mask,
      title = {Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation},
      author = {Li, Feng and Zhang, Hao and Xu, Huaizhe and Liu, Shilong and Zhang, Lei and Ni, Lionel M. and Shum, Heung-Yeung},
      booktitle = {Computer Vision and Pattern Recognition (CVPR)},
      year = {2023}
    }
    
  2. MP-Former: Mask-piloted transformer for image segmentation
    Hao Zhang, Feng Li, Huaizhe Xu, Shijia Huang, Shilong Liu, Lionel M Ni and Lei Zhang
    Computer Vision and Pattern Recognition (CVPR), 2023
    @inproceedings{zhang2023mp,
      title = {MP-Former: Mask-piloted transformer for image segmentation},
      author = {Zhang, Hao and Li, Feng and Xu, Huaizhe and Huang, Shijia and Liu, Shilong and Ni, Lionel M and Zhang, Lei},
      booktitle = {Computer Vision and Pattern Recognition (CVPR)},
      year = {2023}
    }
    
  3. One-Stage 3D Whole-Body Mesh Recovery with Component Aware Transformer
    Jing Lin, Ailing Zeng, Haoqian Wang, Lei Zhang and Yu Li
    Computer Vision and Pattern Recognition (CVPR), 2023
    @inproceedings{lin2023one,
      title = {One-Stage 3D Whole-Body Mesh Recovery with Component Aware Transformer},
      author = {Lin, Jing and Zeng, Ailing and Wang, Haoqian and Zhang, Lei and Li, Yu},
      booktitle = {Computer Vision and Pattern Recognition (CVPR)},
      year = {2023}
    }
    
  4. Lite DETR: An interleaved multi-scale encoder for efficient detr
    Feng Li, Ailing Zeng, Shilong Liu, Hao Zhang, Hongyang Li, Lei Zhang and Lionel M Ni
    Computer Vision and Pattern Recognition (CVPR), 2023
    @inproceedings{li2023lite,
      title = {Lite DETR: An interleaved multi-scale encoder for efficient detr},
      author = {Li, Feng and Zeng, Ailing and Liu, Shilong and Zhang, Hao and Li, Hongyang and Zhang, Lei and Ni, Lionel M},
      booktitle = {Computer Vision and Pattern Recognition (CVPR)},
      year = {2023}
    }
    
  5. DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training
    Yihao Chen, Xianbiao Qi, Jianan Wang and Lei Zhang
    Computer Vision and Pattern Recognition (CVPR), 2023
    @inproceedings{chen2023disco,
      title = {DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training},
      author = {Chen, Yihao and Qi, Xianbiao and Wang, Jianan and Zhang, Lei},
      booktitle = {Computer Vision and Pattern Recognition (CVPR)},
      year = {2023}
    }
    
  6. Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes
    Xuan Ju, Ailing Zeng, Jianan Wang, Qiang Xu and Lei Zhang
    Computer Vision and Pattern Recognition (CVPR), 2023
    @inproceedings{ju2023human,
      title = {Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes},
      author = {Ju, Xuan and Zeng, Ailing and Wang, Jianan and Xu, Qiang and Zhang, Lei},
      booktitle = {Computer Vision and Pattern Recognition (CVPR)},
      year = {2023}
    }
    
  7. DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection
    Hao Zhang, Feng Li, Shilong Liu, Lei Zhang, Hang Su, Jun Zhu, Lionel M. Ni and Heung-Yeung Shum
    ICLR, 2023
    @inproceedings{zhang2023dino,
      title = {DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection},
      author = {Zhang, Hao and Li, Feng and Liu, Shilong and Zhang, Lei and Su, Hang and Zhu, Jun and Ni, Lionel M. and Shum, Heung-Yeung},
      booktitle = {ICLR},
      year = {2023}
    }
    
  8. Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation
    Jie Yang, Ailing Zeng, Shilong Liu, Feng Li, Ruimao Zhang and Lei Zhang
    ICLR, 2023
    @inproceedings{yang2023edpose,
      title = {Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation},
      author = {Yang, Jie and Zeng, Ailing and Liu, Shilong and Li, Feng and Zhang, Ruimao and Zhang, Lei},
      booktitle = {ICLR},
      year = {2023}
    }
    
  9. LipsFormer: Introducing Lipschitz Continuity to Vision Transformers
    Xianbiao Qi, Jianan Wang, Yihao Chen, Yukai Shi and Lei Zhang
    ICLR, 2023
    @inproceedings{qi2023lipsformer,
      title = {LipsFormer: Introducing Lipschitz Continuity to Vision Transformers},
      author = {Qi, Xianbiao and Wang, Jianan and Chen, Yihao and Shi, Yukai and Zhang, Lei},
      booktitle = {ICLR},
      year = {2023}
    }
    
  10. DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding
    Shilong Liu, Yaoyuan Liang, Feng Li, Shijia Huang, Hao Zhang, Hang Su, Jun Zhu and Lei Zhang
    AAAI, 2023
    @inproceedings{liu2023dq,
      title = {DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding},
      author = {Liu, Shilong and Liang, Yaoyuan and Li, Feng and Huang, Shijia and Zhang, Hao and Su, Hang and Zhu, Jun and Zhang, Lei},
      booktitle = {AAAI},
      year = {2023}
    }