Jiangmiao Pang
Hi! I am a Research Scientist at
Shanghai AI Laboratory
and head the Embodied AI Center.
Our mission is to develop Embodied AGI systems and drive innovation in academia and industry by
fostering open-source collaboration.
We hope to solve real-world challenges, transforming cutting-edge research into practical,
scalable solutions that create meaningful impact.
If you share our vision, please do not hesitate to
contact me!
We have positions for researchers, engineers, interns, and
joint-training PhD students, with directions spanning humanoids,
perception, interaction, manipulation, navigation, physical
simulation, and 4D AIGC.
We open source through
OpenRobotLab. I
was also part-time affiliated with
OpenMMLab as a core
contributor, where I led/co-authored several visual perception
codebases such as
MMDetection
,
MMTracking
, and
MMDetection3D
.
Email
/
GitHub
/
Google Scholar
/
Twitter
/
Zhihu
|
|
News
-
The website is updated occasionally. For the most recent
updates, please refer to my
Google
Scholar. : )
-
[2025/02] We release our recent progresses on humanoids,
including HugWBC,
HoST,
BeamDojo, and Homie.
-
[2025/02] We find some interesting observations on Sim2Real
in manipulation, check out
Re3Sim
and stay tuned.
-
[2025/02]
Seer, a
scalable learner for robotic manipulation, is accepted at
ICLR 2025 as a Oral Presentation.
-
[2024/11] We present
Perceptive Internal Model (PIM)
for Humanoid Locomotion.
-
[2024/09] PointLLM receives
Best Paper Candidate Award at ECCV
2024!
-
[2024/07] We present GRUtopia, a simulation platform with
versatile scenes for Embodied AI.
-
[2024/01] We present PointLLM & EmbodiedScan for Perception,
UniHSI for Interaction, and HIMLoco for Control.
-
[2023/06] OC-SORT is accepted to CVPR 2023. It is recognized
as one of the Most Influential CVPR Papers.
|
Selected Publications
--> For the full publication list, please refer to my
Google
Scholar.
|
|
HOMIE: Humanoid Loco-Manipulation with Isomorphic Exoskeleton Cockpit
Qingwei Ben*, Feiyu Jia*, Jia Zeng, Junting Dong, Dahua Lin, Jiangmiao Pang
In Submission, 2025
project page / arXiv /
bibtex
|
|
BeamDojo: Learning Agile Humanoid Locomotion on Sparse Footholds
Huayi Wang, Zirui Wang, Junli Ren, Qingwei Ben, Tao Huang, Weinan Zhang, Jiangmiao Pang
In Submission, 2025
project page / arXiv /
bibtex
|
|
Learning Humanoid Standing-up Control across Diverse Postures
Tao Huang, Junli Ren, Huayi Wang, Zirui Wang, Qingwei Ben, Muning Wen, Xiao Chen, Jianan Li, Jiangmiao Pang
In Submission, 2025
project page / arXiv /
bibtex
|
|
HugWBC: A Unified and General Humanoid Whole-Body Controller for Fine-Grained Locomotion
Yufei Xue*, Wentao Dong*, Minghuan Liu, Weinan Zhang, Jiangmiao Pang
In Submission, 2025
project page / arXiv /
bibtex
|
|
Re3Sim: Generating High-Fidelity Simulation Data via 3D-Photorealistic Real-to-Sim for Robotic Manipulation
Xiaoshen Han, Minghuan Liu, Yilun Chen, Junqiu Yu, Xiaoyang Lyu, Yang Tian, Bolun Wang, Weinan Zhang, Jiangmiao Pang
In Submission, 2025
project page / arXiv /
bibtex
|
|
Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation
Yang Tian*, Sizhe Yang*, Jia Zeng, Ping Wang, Dahua Lin, Hao Dong, Jiangmiao Pang
International Conference on Learning Representations (ICLR), 2025
Oral Presentation
project page / arXiv /
bibtex
|
|
Learning Humanoid Locomotion with Perceptive Internal Model
Junfeng Long*, Junli Ren*, Moji Shi*, Zirui Wang, Tao Huang, Ping Luo, Jiangmiao Pang
International Conference on Robotics and Automation (ICRA), 2025
project page / arXiv /
bibtex
|
|
GRUtopia: Dream General Robots in a City at Scale
Hanqing Wang*, Jiahe Chen*, Wensi Huang*, Qingwei Ben*, Tai Wang*, Boyu Mi*, Tao Huang, Siheng Zhao, Yilun Chen, Sizhe Yang, Peizhou Cao, Wenye Yu, Zichao Ye, Jialun Li, Junfeng Long, Zirui Wang, Huiling Wang, Ying Zhao, Zhongying Tu, Yu Qiao, Dahua Lin, Jiangmiao Pang
In Submission, 2024
project page / arXiv /
code /
bibtex
|
|
Learning H-Infinity Locomotion Control
Junfeng Long*, Wenye Yu*, Quanyi Li, Zirui Wang, Dahua Lin, Jiangmiao Pang
Conference on Robot Learning (CoRL), 2024
project page / arXiv /
code /
bibtex
|
|
CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics
Jiawei Gao*, Ziqin Wang*, Zeqi Xiao, Jingbo Wang, Tai Wang, Jinkun Cao, Xiaolin Hu, Si Liu, Jifeng Dai, Jiangmiao Pang
Neural Information Processing Systems (NeurIPS), 2024
arXiv /
code /
bibtex
|
|
MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations
Ruiyuan Lyu*, Tai Wang*, Jingli Lin*, Shuai Yang*, Xiaohan Mao, Yilun Chen, Runsen Xu, Haifeng Huang, Chenming Zhu, Dahua Lin, Jiangmiao Pang
Neural Information Processing Systems (NeurIPS) Datasets and Benchmarks Track, 2024
project page / arXiv /
code /
bibtex
|
|
PointLLM: Empowering Large Language Models to Understand Point Clouds
Runsen Xu, Xiaolong Wang, Tai Wang, Yilun Chen, Jiangmiao Pang†, Dahua Lin
European Conference on Computer Vision (ECCV), 2024, 2024
Best Paper Candidate
project page / arXiv /
code /
bibtex
|
|
GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction
Xiao Chen, Quanyi Li, Tai Wang, Tianfan Xue, Jiangmiao Pang
Computer Vision and Pattern Recognition (CVPR), 2024
project page / arXiv /
bibtex
|
|
EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
Tai Wang*, Xiaohan Mao*, Chenming Zhu*, Runsen Xu, Ruiyuan Lyu, Peisen Li, Xiao Chen, Wenwei Zhang, Kai Chen, Tianfan Xue, Xihui Liu, Cewu Lu, Dahua Lin, Jiangmiao Pang
Computer Vision and Pattern Recognition (CVPR), 2024
project page / arXiv /
code /
bibtex
|
|
Hybrid Internal Model: Learning Agile Legged Locomotion with Simulated Robot Response
Junfeng Long*, Zirui Wang*, Quanyi Li, Jiawei Gao, Liu Cao, Jiangmiao Pang
International Conference on Learning Representations (ICLR), 2024
project page / arXiv /
code /
bibtex
|
|
Unified Human-Scene Interaction via Prompted Chain-of-Contacts
Zeqi Xiao, Tai Wang, Jingbo Wang, Jinkun Cao, Wenwei Zhang, Bo Dai, Dahua Lin, Jiangmiao Pang
International Conference on Learning Representations (ICLR), 2024
Spotlight Presentation
project page / arXiv /
code /
bibtex
|
|
DORT: Modeling Dynamic Objects in Recurrent for Multi-Camera 3D Object Detection and Tracking
Qing Lian, Tai Wang, Dahua Lin, Jiangmiao Pang
Conference on Robot Learning (CoRL), 2023
arXiv /
paper / code /
bibtex
|
|
OV-PARTS: Towards Open-Vocabulary Part Segmentation
Meng Wei, Xiaoyu Yue, Wenwei Zhang, Shu Kong, Xihui Liu, Jiangmiao Pang
Neural Information Processing Systems Datasets and Benchmarks Track, 2023
arXiv /
code /
bibtex
|
|
Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking
Jinkun Cao, Jiangmiao Pang, Xinshuo Weng, Rawal Khirodkar, Kris Kitani
Computer Vision and Pattern Recognition (CVPR), 2023
Most Influential CVPR Papers
arXiv /
paper / code /
bibtex
|
|
Monocular 3D Object Detection with Depth from Motion
Tai Wang, Jiangmiao Pang†, Dahua Lin
European Conference on Computer Vision (ECCV), 2022
Oral Presentation
arXiv /
code /
Zhihu /
bibtex
|
|
K-Net: Towards Unified Image Segmentation
Wenwei Zhang, Jiangmiao Pang, Kai Chen, Chen Change Loy
Neural Information Processing Systems (NeurIPS), 2021
project page / arXiv /
code /
video /
Zhihu /
bibtex
|
|
FCOS3D: Fully Convolutional One-Stage Monocular 3D Object Detection
Tai Wang, Xinge Zhu, Jiangmiao Pang, Dahua Lin
International Conference on Computer Vision Workshops (ICCVW), 2021
Best Paper Award at ICCV 2021 workshop on 3DODI
arXiv /
code /
slide /
Zhihu /
bibtex
|
|
Quasi-Dense Similarity Learning for Multiple Object Tracking
Jiangmiao Pang, Linlu Qiu, Xia Li, Haofeng Chen, Qi Li, Trevor Darrell, Fisher Yu
Computer Vision and Pattern Recognition (CVPR), 2021
Oral Presentation
project page / arXiv /
code /
video /
bibtex
|
|
Libra R-CNN: Towards Balanced Learning for Object Detection
Jiangmiao Pang, Kai Chen, Qi Li, Zhihai Xu, Huajun Feng, Jianping Shi, Wanli Ouyang, Dahua Lin
Computer Vision and Pattern Recognition (CVPR), 2019
arXiv /
code /
Zhihu /
bibtex
|
|
Hybrid Task Cascade for Instance Segmentation
Kai Chen, Jiangmiao Pang, Jiaqi Wang, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Jianping Shi, Wanli Ouyang, Chen Change Loy, Dahua Lin
Computer Vision and Pattern Recognition (CVPR), 2019
Winning entry at COCO Object Detection Challenge, ECCV 2018
arXiv /
code /
Zhihu /
bibtex
|
|
R2-CNN: Fast Tiny Object Detection in Large-scale Remote Sensing Images
Jiangmiao Pang, Cong Li, Jianping Shi, Zhihai Xu, Huajun Feng
IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2019
ESI Highly Cited Paper
arXiv /
paper /
bibtex
|
|
HOMIE: Humanoid Loco-Manipulation with Isomorphic Exoskeleton Cockpit
Qingwei Ben*, Feiyu Jia*, Jia Zeng, Junting Dong, Dahua Lin, Jiangmiao Pang
In Submission, 2025
project page / arXiv /
bibtex
|
|
GRUtopia: Dream General Robots in a City at Scale
Hanqing Wang*, Jiahe Chen*, Wensi Huang*, Qingwei Ben*, Tai Wang*, Boyu Mi*, Tao Huang, Siheng Zhao, Yilun Chen, Sizhe Yang, Peizhou Cao, Wenye Yu, Zichao Ye, Jialun Li, Junfeng Long, Zirui Wang, Huiling Wang, Ying Zhao, Zhongying Tu, Yu Qiao, Dahua Lin, Jiangmiao Pang
In Submission, 2024
project page / arXiv /
code /
bibtex
|
|
HOMIE: Humanoid Loco-Manipulation with Isomorphic Exoskeleton Cockpit
Qingwei Ben*, Feiyu Jia*, Jia Zeng, Junting Dong, Dahua Lin, Jiangmiao Pang
In Submission, 2025
project page / arXiv /
bibtex
|
|
BeamDojo: Learning Agile Humanoid Locomotion on Sparse Footholds
Huayi Wang, Zirui Wang, Junli Ren, Qingwei Ben, Tao Huang, Weinan Zhang, Jiangmiao Pang
In Submission, 2025
project page / arXiv /
bibtex
|
|
Learning Humanoid Standing-up Control across Diverse Postures
Tao Huang, Junli Ren, Huayi Wang, Zirui Wang, Qingwei Ben, Muning Wen, Xiao Chen, Jianan Li, Jiangmiao Pang
In Submission, 2025
project page / arXiv /
bibtex
|
|
HugWBC: A Unified and General Humanoid Whole-Body Controller for Fine-Grained Locomotion
Yufei Xue*, Wentao Dong*, Minghuan Liu, Weinan Zhang, Jiangmiao Pang
In Submission, 2025
project page / arXiv /
bibtex
|
|
Learning Humanoid Locomotion with Perceptive Internal Model
Junfeng Long*, Junli Ren*, Moji Shi*, Zirui Wang, Tao Huang, Ping Luo, Jiangmiao Pang
International Conference on Robotics and Automation (ICRA), 2025
project page / arXiv /
bibtex
|
|
Learning H-Infinity Locomotion Control
Junfeng Long*, Wenye Yu*, Quanyi Li, Zirui Wang, Dahua Lin, Jiangmiao Pang
Conference on Robot Learning (CoRL), 2024
project page / arXiv /
code /
bibtex
|
|
CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics
Jiawei Gao*, Ziqin Wang*, Zeqi Xiao, Jingbo Wang, Tai Wang, Jinkun Cao, Xiaolin Hu, Si Liu, Jifeng Dai, Jiangmiao Pang
Neural Information Processing Systems (NeurIPS), 2024
arXiv /
code /
bibtex
|
|
Hybrid Internal Model: Learning Agile Legged Locomotion with Simulated Robot Response
Junfeng Long*, Zirui Wang*, Quanyi Li, Jiawei Gao, Liu Cao, Jiangmiao Pang
International Conference on Learning Representations (ICLR), 2024
project page / arXiv /
code /
bibtex
|
|
Unified Human-Scene Interaction via Prompted Chain-of-Contacts
Zeqi Xiao, Tai Wang, Jingbo Wang, Jinkun Cao, Wenwei Zhang, Bo Dai, Dahua Lin, Jiangmiao Pang
International Conference on Learning Representations (ICLR), 2024
Spotlight Presentation
project page / arXiv /
code /
bibtex
|
|
Re3Sim: Generating High-Fidelity Simulation Data via 3D-Photorealistic Real-to-Sim for Robotic Manipulation
Xiaoshen Han, Minghuan Liu, Yilun Chen, Junqiu Yu, Xiaoyang Lyu, Yang Tian, Bolun Wang, Weinan Zhang, Jiangmiao Pang
In Submission, 2025
project page / arXiv /
bibtex
|
|
Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation
Yang Tian*, Sizhe Yang*, Jia Zeng, Ping Wang, Dahua Lin, Hao Dong, Jiangmiao Pang
International Conference on Learning Representations (ICLR), 2025
Oral Presentation
project page / arXiv /
bibtex
|
|
MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations
Ruiyuan Lyu*, Tai Wang*, Jingli Lin*, Shuai Yang*, Xiaohan Mao, Yilun Chen, Runsen Xu, Haifeng Huang, Chenming Zhu, Dahua Lin, Jiangmiao Pang
Neural Information Processing Systems (NeurIPS) Datasets and Benchmarks Track, 2024
project page / arXiv /
code /
bibtex
|
|
PointLLM: Empowering Large Language Models to Understand Point Clouds
Runsen Xu, Xiaolong Wang, Tai Wang, Yilun Chen, Jiangmiao Pang†, Dahua Lin
European Conference on Computer Vision (ECCV), 2024, 2024
Best Paper Candidate
project page / arXiv /
code /
bibtex
|
|
GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction
Xiao Chen, Quanyi Li, Tai Wang, Tianfan Xue, Jiangmiao Pang
Computer Vision and Pattern Recognition (CVPR), 2024
project page / arXiv /
bibtex
|
|
EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
Tai Wang*, Xiaohan Mao*, Chenming Zhu*, Runsen Xu, Ruiyuan Lyu, Peisen Li, Xiao Chen, Wenwei Zhang, Kai Chen, Tianfan Xue, Xihui Liu, Cewu Lu, Dahua Lin, Jiangmiao Pang
Computer Vision and Pattern Recognition (CVPR), 2024
project page / arXiv /
code /
bibtex
|
|
DORT: Modeling Dynamic Objects in Recurrent for Multi-Camera 3D Object Detection and Tracking
Qing Lian, Tai Wang, Dahua Lin, Jiangmiao Pang
Conference on Robot Learning (CoRL), 2023
arXiv /
paper / code /
bibtex
|
|
Monocular 3D Object Detection with Depth from Motion
Tai Wang, Jiangmiao Pang†, Dahua Lin
European Conference on Computer Vision (ECCV), 2022
Oral Presentation
arXiv /
code /
Zhihu /
bibtex
|
|
FCOS3D: Fully Convolutional One-Stage Monocular 3D Object Detection
Tai Wang, Xinge Zhu, Jiangmiao Pang, Dahua Lin
International Conference on Computer Vision Workshops (ICCVW), 2021
Best Paper Award at ICCV 2021 workshop on 3DODI
arXiv /
code /
slide /
Zhihu /
bibtex
|
|
OV-PARTS: Towards Open-Vocabulary Part Segmentation
Meng Wei, Xiaoyu Yue, Wenwei Zhang, Shu Kong, Xihui Liu, Jiangmiao Pang
Neural Information Processing Systems Datasets and Benchmarks Track, 2023
arXiv /
code /
bibtex
|
|
Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking
Jinkun Cao, Jiangmiao Pang, Xinshuo Weng, Rawal Khirodkar, Kris Kitani
Computer Vision and Pattern Recognition (CVPR), 2023
Most Influential CVPR Papers
arXiv /
paper / code /
bibtex
|
|
K-Net: Towards Unified Image Segmentation
Wenwei Zhang, Jiangmiao Pang, Kai Chen, Chen Change Loy
Neural Information Processing Systems (NeurIPS), 2021
project page / arXiv /
code /
video /
Zhihu /
bibtex
|
|
Quasi-Dense Similarity Learning for Multiple Object Tracking
Jiangmiao Pang, Linlu Qiu, Xia Li, Haofeng Chen, Qi Li, Trevor Darrell, Fisher Yu
Computer Vision and Pattern Recognition (CVPR), 2021
Oral Presentation
project page / arXiv /
code /
video /
bibtex
|
|
Libra R-CNN: Towards Balanced Learning for Object Detection
Jiangmiao Pang, Kai Chen, Qi Li, Zhihai Xu, Huajun Feng, Jianping Shi, Wanli Ouyang, Dahua Lin
Computer Vision and Pattern Recognition (CVPR), 2019
arXiv /
code /
Zhihu /
bibtex
|
|
Hybrid Task Cascade for Instance Segmentation
Kai Chen, Jiangmiao Pang, Jiaqi Wang, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Jianping Shi, Wanli Ouyang, Chen Change Loy, Dahua Lin
Computer Vision and Pattern Recognition (CVPR), 2019
Winning entry at COCO Object Detection Challenge, ECCV 2018
arXiv /
code /
Zhihu /
bibtex
|
|
R2-CNN: Fast Tiny Object Detection in Large-scale Remote Sensing Images
Jiangmiao Pang, Cong Li, Jianping Shi, Zhihai Xu, Huajun Feng
IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2019
ESI Highly Cited Paper
arXiv /
paper /
bibtex
|
Selected Awards
- Best Paper Candidate Award, ECCV 2024
- World's Top 2% Scientist, Stanford University
- Most Influential CVPR Papers, OC-SORT in CVPR 2023
-
Best Paper Award of Workshop on 3D Object Detection from
Images, ICCV 2021
-
1st runner up at Waymo 2D Object Tracking Challenge, CVPR
2020
- Outstanding Reviewer, ICCV 2019
-
1st prize at COCO Object Detection Challenge (without
external data), ICCV 2019
-
1st prize at COCO Object Detection Challenge, ECCV 2018
|
|