About Me
I am a researcher at Beijing Academy of Artificial Intelligence (BAAI). I received my PhD degree from The University of Adelaide, supervised by Prof. Chunhua Shen. Before that I obtained my Bachelor degree from Tongji University. I am a recipient of the Google PhD Fellowship in 2021.
My research interests lie in the area of computer vision and foundation models. I worked on visual perception (SOLO, SOLOv2, VisTR, BoxInst), visual representation (DenseCL, EVA), visual generalist (Painter, SegGPT), multimodal representation (EVA-CLIP) and multimodal generalist (Emu).
Contact
We are always looking for full-time researchers, engineers and interns at BAAI, feel free to shoot an email if interested!
我们有少量与北大/自动化所的联培博士生名额,欢迎联系!
Email: wangxinlong@baai.ac.cn
News
[Jul.2023] SegGPT is accepted by ICCV 2023.
[Jul.2023] We have released Emu, a multimodal generalist that can seamlessly generate images and texts in multimodal context.
[Feb.2023] Painter and EVA are accepted by CVPR 2023.
[Feb.2023] I am awarded a University Doctoral Research Medal (top 3% PhD graduates).
[Dec.2022] We have released Painter, a generalist model using "image" as the general-purpose interface.
[Nov.2022] We have released EVA, the best 1B Vision Foundation Model to date. All the code and models are available.
[Sept.2022] My PhD thesis is awarded the Dean’s Commendation for Doctoral Thesis Excellence.
[Mar.2022] FreeSOLO is accepted by CVPR 2022.
[Sept.2021] I am awarded Google PhD Fellowship 2021.
[Aug.2021] Extension of SOLO series is accepted by TPAMI, with improved methods and more applications.
Recent Publications
- Generative Pretraining in Multimodality
Quan Sun*, Qiying Yu*, Yufeng Cui*, Fan Zhang*, Xiaosong Zhang*, Yueze Wang, Hongcheng Gao, Jingjing Liu, Tiejun Huang, Xinlong Wang#
arXiv, 2023
[arXiv] [code] [demo]
- SegGPT: Segmenting Everything In Context
Xinlong Wang*, Xiaosong Zhang*, Yue Cao*, Wen, Wang, Chunhua Shen, Tiejun Huang
IEEE International Conference on Computer Vision (ICCV), 2023
[arXiv] [code] [demo]
- Images Speak in Images: A Generalist Painter for In-Context Visual Learning
Xinlong Wang*, Wen Wang*, Yue Cao*, Chunhua Shen, Tiejun Huang
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
[arXiv] [code]
- EVA: Exploring the Limits of Masked Visual Representation Learning at Scale
Yuxin Fang, Wen Wang, Binhui Xie, Quan Sun, Ledell Wu, Xinggang Wang, Tiejun Huang, Xinlong Wang, Yue Cao
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
[arXiv] [code]
First-author Publications
- FreeSOLO: Learning to Segment Objects without Annotations
Xinlong Wang, Zhiding Yu, Shalini De Mello, Jan Kautz, Anima Anandkumar, Chunhua Shen, Jose M. Alvarez
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
[arXiv] [bibtex] [code]
- SOLO: A Simple Framework for Instance Segmentation
Xinlong Wang, Rufeng Zhang, Chunhua Shen, Tao Kong, Lei Li
IEEE T. Pattern Analysis and Machine Intelligence (TPAMI), 2021
[arXiv] [bibtex] [demo] [code] [code@adet]
- Dense Contrastive Learning for Self-Supervised Visual Pre-Training
Xinlong Wang, Rufeng Zhang, Chunhua Shen, Tao Kong, Lei Li
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021
Oral (4.3% acceptance rate)
[arXiv] [bibtex] [code][usage@adet]
- SOLOv2: Dynamic and Fast Instance Segmentation
Xinlong Wang, Rufeng Zhang, Tao Kong, Lei Li, Chunhua Shen
Advances in Neural Information Processing Systems (NeurIPS), 2020
[arXiv] [bibtex] [demo] [code] [code@adet]
- SOLO: Segmenting Objects by Locations
Xinlong Wang, Tao Kong, Chunhua Shen, Yuning Jiang, Lei Li
European Conference on Computer Vision (ECCV), 2020
[arXiv] [bibtex] [code]
- Task-Aware Monocular Depth Estimation for 3D Object Detection
Xinlong Wang, Wei Yin, Tao Kong, Yuning Jiang, Lei Li and Chunhua Shen
AAAI Conference on Artificial Intelligence (AAAI), 2020
Oral (4.5% acceptance rate)
[arXiv] [bibtex] [code]
- Associatively Segmenting Instances and Semantics in Point Clouds
Xinlong Wang, Shu Liu, Xiaoyong Shen, Chunhua Shen and Jiaya Jia
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019
[arXiv] [bibtex] [code]
- Repulsion Loss: Detecting Pedestrians in a Crowd
Xinlong Wang, Tete Xiao, Yuning Jiang, Shuai Shao, Jian Sun and Chunhua Shen
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018
[arXiv] [bibtex]
Co-author Publications
- Conditional Positional Encodings for Vision Transformers
Xiangxiang Chu, Zhi Tian, Bo Zhang, Xinlong Wang, Chunhua Shen
International Conference on Learning Representations (ICLR), 2023
[arXiv] [code]
- Poseur: Direct human pose regression with transformers
Weian Mao, Yongtao Ge, Chunhua Shen, Zhi Tian, Xinlong Wang, Zhibin Wang, Anton van den Hengel
European Conference on Computer Vision (ECCV), 2022
[arXiv]
- BoxInst: High-Performance Instance Segmentation with Box Annotations
Zhi Tian, Chunhua Shen, Xinlong Wang, Hao Chen
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021
[arXiv] [demo] [code]
- End-to-End Video Instance Segmentation with Transformers
Yuqing Wang, Zhaoliang Xu, Xinlong Wang, Chunhua Shen, Baoshan Cheng, Hao Shen, Huaxia Xia
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021
Oral (4.3% acceptance rate)
[arXiv] [code]
- FCPose: Fully Convolutional Multi-Person Pose Estimation with Dynamic Instance-Aware Convolutions
Weian Mao, Zhi Tian, Xinlong Wang, Chunhua Shen
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021
[arXiv]
- Diverse Knowledge Distillation for End-to-End Person Search
Xinyu Zhang, Xinlong Wang, Jia-Wang Bian, Chunhua Shen, Mingyu You
AAAI Conference on Artificial Intelligence (AAAI), 2021
[arXiv]
- Instance-Aware Embedding for Point Cloud Instance Segmentation
Tong He, Yifan liu, Chunhua Shen, Xinlong Wang, Changming Sun
European Conference on Computer Vision (ECCV), 2020
[Paper]
Professional Activities
Journal Reviewer
IEEE Transactions on Pattern Analysis and Machine Intelligence, IEEE Transactions on Image Processing, IEEE Transactions on Multimedia, IEEE Transactions on Robotics, Neurocomputing, Pattern Recognition, Transactions on Machine Learning ResearchConference Reviewer
ICLR 2023, NeurIPS 2022, ICML 2022, ECCV 2022, CVPR 2022, AAAI 2022, ICLR 2021, NeurIPS 2021, ICCV 2021, ICML 2021, CVPR 2021, AAAI 2021, NeurIPS 2020, AAAI 2020