Wei Mao

Wei Mao 毛伟

Wei is an AI Research Scientist at Reality Labs of Meta, working on 3D AI agents. Prior to that, he was a senior research scientist at XR Vision Labs of Tencent working on 3D AIGC for games. From 2018 to 2024, he was a PhD student and Postdoc researcher at Australian National University (ANU) advised by Prof. Richard Hartley (ANU), Dr. Miaomiao Liu (ANU), and Dr. Mathieu Salzmann (EPFL) where his research interests mainly lie in two folds: human motion understanding including human motion prediction/generation, human-scene/hand-object interaction; 3D vision including 3D reconstuction of static/dynamic scenes, neural rendering, and depth estimation. He is also fortunate to have collaborated with Prof. Hongdong Li (ANU) and Dr Jose M. Alvarez (Nvidia).

Email / CV (中文简历) / Google Scholar / Github / Linkedin

News

09/2025: One paper has been accepted to NeurIPS2025 Congrates to Hoang.
07/2024: Two papers have been accepted to ECCV2024. Congrates to Rong and Chaoyue.
01/2024: Our paper "Neural SDF Flow for 3D Reconstruction of Dynamic Scenes" has been accepted to ICLR2024.
09/2023: One paper has been accepted to NeurIPS2023, Congrates to Rong.
03/2023: One paper has been accepted to CVPR2023, Congrates to Huiyu.
12/2022: My PhD thesis titled "Human Motion Prediction: From Deterministic to Stochastic" has been published (here).
10/2022: Our paper "Contact-aware Human Motion Forecasting" has been accepted to NeurIPS2022
03/2022: One paper accepted to CVPR2022 (oral)
01/2022: I will work as a Postdoc with Prof. Richard Hartley and Dr. Miaomiao Liu after finishing my PhD !!!
07/2021: One paper has been accepted to ICCV2021 (oral)
03/2020: One paper has been accepted to CVPR2020 (oral). Congrates to Jiayu.

Research

I have broad interests in human related research and 3D vision.

	Neural SDF Flow for 3D Reconstruction of Dynamic Scenes Wei Mao, Miaomiao Liu, Richard Hartley, Mathieu Salzmann ICLR, 2024 openreview / code We propose SDF flow, a new implicit representation for dynamic scenes. We derived a linear relationship between scene flow and SDF flow bridging the scene geometry with its motion.
	DeepSimHO: Stable Pose Estimation for Hand-Object Interaction via Physics Simulation Rong Wang, Wei Mao, Hongdong Li NeurIPS, 2023 openreview / video / code We propose to learn the stability score from physical simulators for predicting hand poses that can firmly grasp the objects.
	VisFusion: Visibility-aware Online 3D Scene Reconstruction from Videos Huiyu Gao, Wei Mao, Miaomiao Liu CVPR, 2023 arXiv / project page / code Estimating visibility aware feature volume with ray-based sparsification for volumetric-based 3D reconstruction.
	Interacting Hand-Object Pose Estimation via Dense Mutual Attention Rong Wang, Wei Mao, Hongdong Li WACV, 2023 arXiv / video / code Mutual attention between hand and object for hand object pose estimation.
	Contact-aware Human Motion Forecasting Wei Mao, Miaomiao Liu, Richard Hartley, Mathieu Salzmann NeurIPS, 2022 (Spotlight) arXiv / video / code Predicting per-joint contact maps for human motion forecasting.
	Weakly-supervised Action Transition Learning for Stochastic Human Motion Prediction Wei Mao, Miaomiao Liu, Mathieu Salzmann CVPR, 2022 (Oral Presentation) arXiv / video / code Generating diverse human motion with a sequence of action labels.
	Generating Smooth Pose Sequences for Diverse Human Motion Prediction Wei Mao, Miaomiao Liu, Mathieu Salzmann ICCV, 2021 (Oral Presentation) arXiv / video / code A model to predict diverse human motions without diverse training samples.
	History Repeats Itself: Human Motion Prediction via Motion Attention Wei Mao, Miaomiao Liu, Mathieu Salzmann ECCV, 2020 arXiv / video / code Capture similar motion pattern with motion attention.
	Cost Volume Pyramid Based Depth Inference for Multi-View Stereo Jiayu Yang, Wei Mao, Jose M. Alvarez, Miaomiao Liu CVPR, 2020 (Oral Presentation) arXiv / video / code A coarse-to-fine approach for multi-view stereo.
	Learning Trajectory Dependencies for Human Motion Prediction Wei Mao, Miaomiao Liu, Mathieu Salzmann Hongdong Li ICCV, 2019 (Oral Presentation) arXiv / video / code Human motion prediction using DCT for temporal encoding and GCN for spatial encoding.

Website template is from Jon Barron