Jiangmiao Pang

Hi! I am a Research Scientist at Shanghai AI Laboratory and head the Embodied AI Center.

Our mission is to develop Embodied AGI systems and drive innovation in academia and industry by fostering open-source collaboration. We hope to solve real-world challenges, transforming cutting-edge research into practical, scalable solutions that create meaningful impact.

If you share our vision, please do not hesitate to contact me! We have positions for researchers, engineers, interns, and joint-training PhD students, with directions spanning humanoids, perception, interaction, manipulation, navigation, physical simulation, and 4D AIGC.

We open source through OpenRobotLab. I was also part-time affiliated with OpenMMLab as a core contributor, where I led/co-authored several visual perception codebases such as MMDetection GitHub Stars, MMTracking GitHub Stars, and MMDetection3D GitHub Stars.

Email  /  GitHub  /  Google Scholar  /  Twitter  /  Zhihu

profile photo

News

  • The website is updated occasionally. For the most recent updates, please refer to my Google Scholar. : )
  • [2025/02] We release our recent progresses on humanoids, including HugWBC, HoST, BeamDojo, and Homie.
  • [2025/02] We find some interesting observations on Sim2Real in manipulation, check out Re3Sim and stay tuned.
  • [2025/02] Seer, a scalable learner for robotic manipulation, is accepted at ICLR 2025 as a Oral Presentation.
  • [2024/11] We present Perceptive Internal Model (PIM) for Humanoid Locomotion.
  • [2024/09] PointLLM receives Best Paper Candidate Award at ECCV 2024!
  • [2024/07] We present GRUtopia, a simulation platform with versatile scenes for Embodied AI.
  • [2024/01] We present PointLLM & EmbodiedScan for Perception, UniHSI for Interaction, and HIMLoco for Control.
  • [2023/06] OC-SORT is accepted to CVPR 2023. It is recognized as one of the Most Influential CVPR Papers.

Selected Publications

--> For the full publication list, please refer to my Google Scholar.
All Selected Publications Simulation/Hardware Systems for Embodied AI
Humanoids and Motion Intelligence Manipulation and Navigation Intelligence
3D Vision and Embodied Perception Others

project image

HOMIE: Humanoid Loco-Manipulation with Isomorphic Exoskeleton Cockpit


Qingwei Ben*, Feiyu Jia*, Jia Zeng, Junting Dong, Dahua Lin, Jiangmiao Pang
In Submission, 2025
project page / arXiv / bibtex

project image

BeamDojo: Learning Agile Humanoid Locomotion on Sparse Footholds


Huayi Wang, Zirui Wang, Junli Ren, Qingwei Ben, Tao Huang, Weinan Zhang, Jiangmiao Pang
In Submission, 2025
project page / arXiv / bibtex

project image

Learning Humanoid Standing-up Control across Diverse Postures


Tao Huang, Junli Ren, Huayi Wang, Zirui Wang, Qingwei Ben, Muning Wen, Xiao Chen, Jianan Li, Jiangmiao Pang
In Submission, 2025
project page / arXiv / bibtex

project image

HugWBC: A Unified and General Humanoid Whole-Body Controller for Fine-Grained Locomotion


Yufei Xue*, Wentao Dong*, Minghuan Liu, Weinan Zhang, Jiangmiao Pang
In Submission, 2025
project page / arXiv / bibtex

project image

Re3Sim: Generating High-Fidelity Simulation Data via 3D-Photorealistic Real-to-Sim for Robotic Manipulation


Xiaoshen Han, Minghuan Liu, Yilun Chen, Junqiu Yu, Xiaoyang Lyu, Yang Tian, Bolun Wang, Weinan Zhang, Jiangmiao Pang
In Submission, 2025
project page / arXiv / bibtex

project image

Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation


Yang Tian*, Sizhe Yang*, Jia Zeng, Ping Wang, Dahua Lin, Hao Dong, Jiangmiao Pang
International Conference on Learning Representations (ICLR), 2025
Oral Presentation
project page / arXiv / bibtex

project image

Learning Humanoid Locomotion with Perceptive Internal Model


Junfeng Long*, Junli Ren*, Moji Shi*, Zirui Wang, Tao Huang, Ping Luo, Jiangmiao Pang
International Conference on Robotics and Automation (ICRA), 2025
project page / arXiv / bibtex

project image

GRUtopia: Dream General Robots in a City at Scale


Hanqing Wang*, Jiahe Chen*, Wensi Huang*, Qingwei Ben*, Tai Wang*, Boyu Mi*, Tao Huang, Siheng Zhao, Yilun Chen, Sizhe Yang, Peizhou Cao, Wenye Yu, Zichao Ye, Jialun Li, Junfeng Long, Zirui Wang, Huiling Wang, Ying Zhao, Zhongying Tu, Yu Qiao, Dahua Lin, Jiangmiao Pang
In Submission, 2024
project page / arXiv / code / bibtex

project image

Learning H-Infinity Locomotion Control


Junfeng Long*, Wenye Yu*, Quanyi Li, Zirui Wang, Dahua Lin, Jiangmiao Pang
Conference on Robot Learning (CoRL), 2024
project page / arXiv / code / bibtex

project image

CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics


Jiawei Gao*, Ziqin Wang*, Zeqi Xiao, Jingbo Wang, Tai Wang, Jinkun Cao, Xiaolin Hu, Si Liu, Jifeng Dai, Jiangmiao Pang
Neural Information Processing Systems (NeurIPS), 2024
arXiv / code / bibtex

project image

MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations


Ruiyuan Lyu*, Tai Wang*, Jingli Lin*, Shuai Yang*, Xiaohan Mao, Yilun Chen, Runsen Xu, Haifeng Huang, Chenming Zhu, Dahua Lin, Jiangmiao Pang
Neural Information Processing Systems (NeurIPS) Datasets and Benchmarks Track, 2024
project page / arXiv / code / bibtex

project image

PointLLM: Empowering Large Language Models to Understand Point Clouds


Runsen Xu, Xiaolong Wang, Tai Wang, Yilun Chen, Jiangmiao Pang, Dahua Lin
European Conference on Computer Vision (ECCV), 2024, 2024
Best Paper Candidate
project page / arXiv / code / bibtex

project image

GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction


Xiao Chen, Quanyi Li, Tai Wang, Tianfan Xue, Jiangmiao Pang
Computer Vision and Pattern Recognition (CVPR), 2024
project page / arXiv / bibtex

project image

EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI


Tai Wang*, Xiaohan Mao*, Chenming Zhu*, Runsen Xu, Ruiyuan Lyu, Peisen Li, Xiao Chen, Wenwei Zhang, Kai Chen, Tianfan Xue, Xihui Liu, Cewu Lu, Dahua Lin, Jiangmiao Pang
Computer Vision and Pattern Recognition (CVPR), 2024
project page / arXiv / code / bibtex

project image

Hybrid Internal Model: Learning Agile Legged Locomotion with Simulated Robot Response


Junfeng Long*, Zirui Wang*, Quanyi Li, Jiawei Gao, Liu Cao, Jiangmiao Pang
International Conference on Learning Representations (ICLR), 2024
project page / arXiv / code / bibtex

project image

Unified Human-Scene Interaction via Prompted Chain-of-Contacts


Zeqi Xiao, Tai Wang, Jingbo Wang, Jinkun Cao, Wenwei Zhang, Bo Dai, Dahua Lin, Jiangmiao Pang
International Conference on Learning Representations (ICLR), 2024
Spotlight Presentation
project page / arXiv / code / bibtex

project image

DORT: Modeling Dynamic Objects in Recurrent for Multi-Camera 3D Object Detection and Tracking


Qing Lian, Tai Wang, Dahua Lin, Jiangmiao Pang
Conference on Robot Learning (CoRL), 2023
arXiv / paper / code / bibtex

project image

OV-PARTS: Towards Open-Vocabulary Part Segmentation


Meng Wei, Xiaoyu Yue, Wenwei Zhang, Shu Kong, Xihui Liu, Jiangmiao Pang
Neural Information Processing Systems Datasets and Benchmarks Track, 2023
arXiv / code / bibtex

project image

Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking


Jinkun Cao, Jiangmiao Pang, Xinshuo Weng, Rawal Khirodkar, Kris Kitani
Computer Vision and Pattern Recognition (CVPR), 2023
Most Influential CVPR Papers
arXiv / paper / code / bibtex

project image

Monocular 3D Object Detection with Depth from Motion


Tai Wang, Jiangmiao Pang, Dahua Lin
European Conference on Computer Vision (ECCV), 2022
Oral Presentation
arXiv / code / Zhihu / bibtex

project image

K-Net: Towards Unified Image Segmentation


Wenwei Zhang, Jiangmiao Pang, Kai Chen, Chen Change Loy
Neural Information Processing Systems (NeurIPS), 2021
project page / arXiv / code / video / Zhihu / bibtex

project image

FCOS3D: Fully Convolutional One-Stage Monocular 3D Object Detection


Tai Wang, Xinge Zhu, Jiangmiao Pang, Dahua Lin
International Conference on Computer Vision Workshops (ICCVW), 2021
Best Paper Award at ICCV 2021 workshop on 3DODI
arXiv / code / slide / Zhihu / bibtex

project image

Quasi-Dense Similarity Learning for Multiple Object Tracking


Jiangmiao Pang, Linlu Qiu, Xia Li, Haofeng Chen, Qi Li, Trevor Darrell, Fisher Yu
Computer Vision and Pattern Recognition (CVPR), 2021
Oral Presentation
project page / arXiv / code / video / bibtex

project image

Libra R-CNN: Towards Balanced Learning for Object Detection


Jiangmiao Pang, Kai Chen, Qi Li, Zhihai Xu, Huajun Feng, Jianping Shi, Wanli Ouyang, Dahua Lin
Computer Vision and Pattern Recognition (CVPR), 2019
arXiv / code / Zhihu / bibtex

project image

Hybrid Task Cascade for Instance Segmentation


Kai Chen, Jiangmiao Pang, Jiaqi Wang, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Jianping Shi, Wanli Ouyang, Chen Change Loy, Dahua Lin
Computer Vision and Pattern Recognition (CVPR), 2019
Winning entry at COCO Object Detection Challenge, ECCV 2018
arXiv / code / Zhihu / bibtex

project image

R2-CNN: Fast Tiny Object Detection in Large-scale Remote Sensing Images


Jiangmiao Pang, Cong Li, Jianping Shi, Zhihai Xu, Huajun Feng
IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2019
ESI Highly Cited Paper
arXiv / paper / bibtex

project image

HOMIE: Humanoid Loco-Manipulation with Isomorphic Exoskeleton Cockpit


Qingwei Ben*, Feiyu Jia*, Jia Zeng, Junting Dong, Dahua Lin, Jiangmiao Pang
In Submission, 2025
project page / arXiv / bibtex

project image

GRUtopia: Dream General Robots in a City at Scale


Hanqing Wang*, Jiahe Chen*, Wensi Huang*, Qingwei Ben*, Tai Wang*, Boyu Mi*, Tao Huang, Siheng Zhao, Yilun Chen, Sizhe Yang, Peizhou Cao, Wenye Yu, Zichao Ye, Jialun Li, Junfeng Long, Zirui Wang, Huiling Wang, Ying Zhao, Zhongying Tu, Yu Qiao, Dahua Lin, Jiangmiao Pang
In Submission, 2024
project page / arXiv / code / bibtex

project image

HOMIE: Humanoid Loco-Manipulation with Isomorphic Exoskeleton Cockpit


Qingwei Ben*, Feiyu Jia*, Jia Zeng, Junting Dong, Dahua Lin, Jiangmiao Pang
In Submission, 2025
project page / arXiv / bibtex

project image

BeamDojo: Learning Agile Humanoid Locomotion on Sparse Footholds


Huayi Wang, Zirui Wang, Junli Ren, Qingwei Ben, Tao Huang, Weinan Zhang, Jiangmiao Pang
In Submission, 2025
project page / arXiv / bibtex

project image

Learning Humanoid Standing-up Control across Diverse Postures


Tao Huang, Junli Ren, Huayi Wang, Zirui Wang, Qingwei Ben, Muning Wen, Xiao Chen, Jianan Li, Jiangmiao Pang
In Submission, 2025
project page / arXiv / bibtex

project image

HugWBC: A Unified and General Humanoid Whole-Body Controller for Fine-Grained Locomotion


Yufei Xue*, Wentao Dong*, Minghuan Liu, Weinan Zhang, Jiangmiao Pang
In Submission, 2025
project page / arXiv / bibtex

project image

Learning Humanoid Locomotion with Perceptive Internal Model


Junfeng Long*, Junli Ren*, Moji Shi*, Zirui Wang, Tao Huang, Ping Luo, Jiangmiao Pang
International Conference on Robotics and Automation (ICRA), 2025
project page / arXiv / bibtex

project image

Learning H-Infinity Locomotion Control


Junfeng Long*, Wenye Yu*, Quanyi Li, Zirui Wang, Dahua Lin, Jiangmiao Pang
Conference on Robot Learning (CoRL), 2024
project page / arXiv / code / bibtex

project image

CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics


Jiawei Gao*, Ziqin Wang*, Zeqi Xiao, Jingbo Wang, Tai Wang, Jinkun Cao, Xiaolin Hu, Si Liu, Jifeng Dai, Jiangmiao Pang
Neural Information Processing Systems (NeurIPS), 2024
arXiv / code / bibtex

project image

Hybrid Internal Model: Learning Agile Legged Locomotion with Simulated Robot Response


Junfeng Long*, Zirui Wang*, Quanyi Li, Jiawei Gao, Liu Cao, Jiangmiao Pang
International Conference on Learning Representations (ICLR), 2024
project page / arXiv / code / bibtex

project image

Unified Human-Scene Interaction via Prompted Chain-of-Contacts


Zeqi Xiao, Tai Wang, Jingbo Wang, Jinkun Cao, Wenwei Zhang, Bo Dai, Dahua Lin, Jiangmiao Pang
International Conference on Learning Representations (ICLR), 2024
Spotlight Presentation
project page / arXiv / code / bibtex

project image

MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations


Ruiyuan Lyu*, Tai Wang*, Jingli Lin*, Shuai Yang*, Xiaohan Mao, Yilun Chen, Runsen Xu, Haifeng Huang, Chenming Zhu, Dahua Lin, Jiangmiao Pang
Neural Information Processing Systems (NeurIPS) Datasets and Benchmarks Track, 2024
project page / arXiv / code / bibtex

project image

PointLLM: Empowering Large Language Models to Understand Point Clouds


Runsen Xu, Xiaolong Wang, Tai Wang, Yilun Chen, Jiangmiao Pang, Dahua Lin
European Conference on Computer Vision (ECCV), 2024, 2024
Best Paper Candidate
project page / arXiv / code / bibtex

project image

GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction


Xiao Chen, Quanyi Li, Tai Wang, Tianfan Xue, Jiangmiao Pang
Computer Vision and Pattern Recognition (CVPR), 2024
project page / arXiv / bibtex

project image

EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI


Tai Wang*, Xiaohan Mao*, Chenming Zhu*, Runsen Xu, Ruiyuan Lyu, Peisen Li, Xiao Chen, Wenwei Zhang, Kai Chen, Tianfan Xue, Xihui Liu, Cewu Lu, Dahua Lin, Jiangmiao Pang
Computer Vision and Pattern Recognition (CVPR), 2024
project page / arXiv / code / bibtex

project image

DORT: Modeling Dynamic Objects in Recurrent for Multi-Camera 3D Object Detection and Tracking


Qing Lian, Tai Wang, Dahua Lin, Jiangmiao Pang
Conference on Robot Learning (CoRL), 2023
arXiv / paper / code / bibtex

project image

Monocular 3D Object Detection with Depth from Motion


Tai Wang, Jiangmiao Pang, Dahua Lin
European Conference on Computer Vision (ECCV), 2022
Oral Presentation
arXiv / code / Zhihu / bibtex

project image

FCOS3D: Fully Convolutional One-Stage Monocular 3D Object Detection


Tai Wang, Xinge Zhu, Jiangmiao Pang, Dahua Lin
International Conference on Computer Vision Workshops (ICCVW), 2021
Best Paper Award at ICCV 2021 workshop on 3DODI
arXiv / code / slide / Zhihu / bibtex

project image

OV-PARTS: Towards Open-Vocabulary Part Segmentation


Meng Wei, Xiaoyu Yue, Wenwei Zhang, Shu Kong, Xihui Liu, Jiangmiao Pang
Neural Information Processing Systems Datasets and Benchmarks Track, 2023
arXiv / code / bibtex

project image

Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking


Jinkun Cao, Jiangmiao Pang, Xinshuo Weng, Rawal Khirodkar, Kris Kitani
Computer Vision and Pattern Recognition (CVPR), 2023
Most Influential CVPR Papers
arXiv / paper / code / bibtex

project image

K-Net: Towards Unified Image Segmentation


Wenwei Zhang, Jiangmiao Pang, Kai Chen, Chen Change Loy
Neural Information Processing Systems (NeurIPS), 2021
project page / arXiv / code / video / Zhihu / bibtex

project image

Quasi-Dense Similarity Learning for Multiple Object Tracking


Jiangmiao Pang, Linlu Qiu, Xia Li, Haofeng Chen, Qi Li, Trevor Darrell, Fisher Yu
Computer Vision and Pattern Recognition (CVPR), 2021
Oral Presentation
project page / arXiv / code / video / bibtex

project image

Libra R-CNN: Towards Balanced Learning for Object Detection


Jiangmiao Pang, Kai Chen, Qi Li, Zhihai Xu, Huajun Feng, Jianping Shi, Wanli Ouyang, Dahua Lin
Computer Vision and Pattern Recognition (CVPR), 2019
arXiv / code / Zhihu / bibtex

project image

Hybrid Task Cascade for Instance Segmentation


Kai Chen, Jiangmiao Pang, Jiaqi Wang, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Jianping Shi, Wanli Ouyang, Chen Change Loy, Dahua Lin
Computer Vision and Pattern Recognition (CVPR), 2019
Winning entry at COCO Object Detection Challenge, ECCV 2018
arXiv / code / Zhihu / bibtex

project image

R2-CNN: Fast Tiny Object Detection in Large-scale Remote Sensing Images


Jiangmiao Pang, Cong Li, Jianping Shi, Zhihai Xu, Huajun Feng
IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2019
ESI Highly Cited Paper
arXiv / paper / bibtex

Selected Awards

  • Best Paper Candidate Award, ECCV 2024
  • World's Top 2% Scientist, Stanford University
  • Most Influential CVPR Papers, OC-SORT in CVPR 2023
  • Best Paper Award of Workshop on 3D Object Detection from Images, ICCV 2021
  • 1st runner up at Waymo 2D Object Tracking Challenge, CVPR 2020
  • Outstanding Reviewer, ICCV 2019
  • 1st prize at COCO Object Detection Challenge (without external data), ICCV 2019
  • 1st prize at COCO Object Detection Challenge, ECCV 2018

HTML Counter
Design and source code from Jon Barron's website