Jiangmiao Pang

Hi! I am a Research Scientist and Team Lead at Shanghai AI Laboratory, where I lead the OpenRobotLab. We are dedicated to building Embodied AGI systems and empowering academia and industry through open-source initiatives. Before that, I was with MMLab@CUHK and worked with Prof. Dahua Lin.

Currently, we are focusing on multi-modal perception, interaction, and their conjunction with robot learning. We are building various real embodied systems so as expected to create positive impacts on humans. NOTE: We are looking for self-motivated researchers, engineers, interns, and joint-training PhD students in Multimodal Learning, Robot Learning, and Embodied AI. If you are interested or experienced in related fields, please feel free to directly contact me!

I was also part-time affiliated with OpenMMLab as a core contributor, where I led/co-authored several visual perception codebases, mainly including MMDetection GitHub Stars, MMTracking GitHub Stars, and MMDetection3D GitHub Stars. My works have won 1st prizes at COCO Object Detection Challenge 2018 and 2019 (without external data), and 1st runner-up at Waymo 2D Object Tracking Challenge 2020.

Email  /  GitHub  /  Google Scholar  /  Twitter  /  Zhihu

profile photo

News

  • [2024/01] The website is updated occasionally. For the most recent updates, please refer to my Google Scholar. : )
  • [2024/01] We present PointLLM & EmbodiedScan for Perception, UniHSI for Interaction, and HIMLoco for Control.
  • [2023/06] OC-SORT is accepted to CVPR 2023. It is recognized as one of the Most Influential CVPR Papers.
  • [2022/10] DfM and DenseSiam are accepted to ECCV 2022. DfM is selected as Oral Presentation.
  • [2022/06] Video K-Net is accepted to CVPR 2022 as Oral Presentation.

Selected Publications

--> For the full publication list, please refer to my Google Scholar.

† denotes corresponding author. * denotes equal contribution. My teammates/students/mentees are underline marked.

project image

GRUtopia: Dream General Robots in a City at Scale


Hanqing Wang*, Jiahe Chen*, Wensi Huang*, Qingwei Ben*, Tai Wang*, Boyu Mi*, Tao Huang, Siheng Zhao, Yilun Chen, Sizhe Yang, Peizhou Cao, Wenye Yu, Zichao Ye, Jialun Li, Junfeng Long, Zirui Wang, Huiling Wang, Ying Zhao, Zhongying Tu, Yu Qiao, Dahua Lin, Jiangmiao Pang
In Submission, 2024
project page / arXiv / code / bibtex

project image

Learning H-Infinity Locomotion Control


Junfeng Long*, Wenye Yu*, Quanyi Li, Zirui Wang, Dahua Lin, Jiangmiao Pang
In Submission, 2024
project page / arXiv / code / bibtex

project image

CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics


Jiawei Gao*, Ziqin Wang*, Zeqi Xiao, Jingbo Wang, Tai Wang, Jinkun Cao, Xiaolin Hu, Si Liu, Jifeng Dai, Jiangmiao Pang
In Submission, 2024
arXiv / code / bibtex

project image

MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations


Ruiyuan Lyu*, Tai Wang*, Jingli Lin*, Shuai Yang*, Xiaohan Mao, Yilun Chen, Runsen Xu, Haifeng Huang, Chenming Zhu, Dahua Lin, Jiangmiao Pang
In Submission, 2024
project page / arXiv / code / bibtex

project image

Grounded 3D-LLM with Referent Tokens


Yilun Chen*, Shuai Yang*, Haifeng Huang*, Tai Wang, Ruiyuan Lyu, Runsen Xu, Dahua Lin, Jiangmiao Pang
In Submission, 2024
project page / arXiv / code / bibtex

project image

PointLLM: Empowering Large Language Models to Understand Point Clouds


Runsen Xu, Xiaolong Wang, Tai Wang, Yilun Chen, Jiangmiao Pang, Dahua Lin
European Conference on Computer Vision (ECCV), 2024, 2024
Oral Presentation with all Strong Accept recommendations
project page / arXiv / code / bibtex

project image

GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction


Xiao Chen, Quanyi Li, Tai Wang, Tianfan Xue, Jiangmiao Pang
Computer Vision and Pattern Recognition (CVPR), 2024
project page / arXiv / bibtex

project image

EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI


Tai Wang*, Xiaohan Mao*, Chenming Zhu*, Runsen Xu, Ruiyuan Lyu, Peisen Li, Xiao Chen, Wenwei Zhang, Kai Chen, Tianfan Xue, Xihui Liu, Cewu Lu, Dahua Lin, Jiangmiao Pang
Computer Vision and Pattern Recognition (CVPR), 2024
project page / arXiv / code / bibtex

project image

Hybrid Internal Model: Learning Agile Legged Locomotion with Simulated Robot Response


Junfeng Long*, Zirui Wang*, Quanyi Li, Jiawei Gao, Liu Cao, Jiangmiao Pang
International Conference on Learning Representations (ICLR), 2024
project page / arXiv / code / bibtex

project image

Unified Human-Scene Interaction via Prompted Chain-of-Contacts


Zeqi Xiao, Tai Wang, Jingbo Wang, Jinkun Cao, Wenwei Zhang, Bo Dai, Dahua Lin, Jiangmiao Pang
International Conference on Learning Representations (ICLR), 2024
Spotlight Presentation
project page / arXiv / code / bibtex

project image

DORT: Modeling Dynamic Objects in Recurrent for Multi-Camera 3D Object Detection and Tracking


Qing Lian, Tai Wang, Dahua Lin, Jiangmiao Pang
Conference on Robot Learning (CoRL), 2023
arXiv / paper / code / bibtex

project image

OV-PARTS: Towards Open-Vocabulary Part Segmentation


Meng Wei, Xiaoyu Yue, Wenwei Zhang, Shu Kong, Xihui Liu, Jiangmiao Pang
Neural Information Processing Systems Datasets and Benchmarks Track, 2023
arXiv / code / bibtex

project image

Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking


Jinkun Cao, Jiangmiao Pang, Xinshuo Weng, Rawal Khirodkar, Kris Kitani
Computer Vision and Pattern Recognition (CVPR), 2023
Most Influential CVPR Papers
arXiv / paper / code / bibtex

project image

Monocular 3D Object Detection with Depth from Motion


Tai Wang, Jiangmiao Pang, Dahua Lin
European Conference on Computer Vision (ECCV), 2022
Oral Presentation
arXiv / code / Zhihu / bibtex

project image

K-Net: Towards Unified Image Segmentation


Wenwei Zhang, Jiangmiao Pang, Kai Chen, Chen Change Loy
Neural Information Processing Systems (NeurIPS), 2021
project page / arXiv / code / video / Zhihu / bibtex

project image

FCOS3D: Fully Convolutional One-Stage Monocular 3D Object Detection


Tai Wang, Xinge Zhu, Jiangmiao Pang, Dahua Lin
International Conference on Computer Vision Workshops (ICCVW), 2021
Best Paper Award at ICCV 2021 workshop on 3DODI
arXiv / code / slide / Zhihu / bibtex

project image

Quasi-Dense Similarity Learning for Multiple Object Tracking


Jiangmiao Pang, Linlu Qiu, Xia Li, Haofeng Chen, Qi Li, Trevor Darrell, Fisher Yu
Computer Vision and Pattern Recognition (CVPR), 2021
Oral Presentation
project page / arXiv / code / video / bibtex

project image

Libra R-CNN: Towards Balanced Learning for Object Detection


Jiangmiao Pang, Kai Chen, Qi Li, Zhihai Xu, Huajun Feng, Jianping Shi, Wanli Ouyang, Dahua Lin
Computer Vision and Pattern Recognition (CVPR), 2019
arXiv / code / Zhihu / bibtex

project image

Hybrid Task Cascade for Instance Segmentation


Kai Chen, Jiangmiao Pang, Jiaqi Wang, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Jianping Shi, Wanli Ouyang, Chen Change Loy, Dahua Lin
Computer Vision and Pattern Recognition (CVPR), 2019
Winning entry at COCO Object Detection Challenge, ECCV 2018
arXiv / code / Zhihu / bibtex

project image

R2-CNN: Fast Tiny Object Detection in Large-scale Remote Sensing Images


Jiangmiao Pang, Cong Li, Jianping Shi, Zhihai Xu, Huajun Feng
IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2019
ESI Highly Cited Paper
arXiv / paper / bibtex

Selected Awards

  • World's Top 2% Scientist, Stanford University
  • Most Influential CVPR Papers, OC-SORT in CVPR 2023
  • Best Paper Award of Workshop on 3D Object Detection from Images, ICCV 2021
  • Doctoral Consortium Award, CVPR 2021
  • 1st runner up at Waymo 2D Object Tracking Challenge, CVPR 2020
  • Outstanding Reviewer, ICCV 2019
  • 1st prize at COCO Object Detection Challenge (without external data), ICCV 2019
  • 1st prize at COCO Object Detection Challenge, ECCV 2018

Services

  • Conference Reviewer: CVPR, ICCV, ECCV, RSS, CoRL, NeurIPS, ICLR, ICML, AAAI, IROS, ACCV
  • Journal Reviewer: TPAMI, IJCV, TRO, TIP, RA-L, TGRS, Neurocomputing, Pattern Recognition Letters, Signal Processing Letters

HTML Counter
Design and source code from Jon Barron's website