Minh Phuoc Vo

Head of Engineering
SpreeAI

Email: [middle][first][last] at [gmail dot com
Google Scholar

I am the Head of Engineering at SpreeAI, a high-tech virtual try-on startup. Besides overseeing all products R&D, I also lead and grow a world-class team of passionate Machine Learning researchers and engineers to develop and productionize our photorealistic avatar technology. Previously, I was a Research Scientist at Meta Reality Labs Research, where I tech-led a group of researchers to develop 3D perception and human sensing algorithms for Meta Aria glasses. Before that, I was a Ph.D. student at The Robotics Institute, Carnegie Mellon University where I worked with Prof. Srinivasa Narasimhan and Prof. Yaser Sheikh on novel methods to capture dense and accurate 3D shape of human bodies. I also worked with Prof. Zhaoyang Wang at the Catholic University of America, where I got B.E. degree in Electrical Engineering, on camera calibration, structured light system, and tracking algorithms.
 

Research

I am very interested in various aspects of 3D vision, physics-based vision, and generative models for photorealistic digitial avatar creation and human scene understanding. The goal is to develop holistic and end-to-end machine learning systems that understand and recreate virtual environments that are perceptually indistinguishable from reality.

Jobs opportuninty: I am hiring full time CV&ML&Graphics researchers. I strike to balance between core and applied research with patents, papers, and product as outputs. Send me an email if you are interested in working with me.

Award

Patent

Publication

Efficient Human Vision Inspired Action Recognition using Adaptive Spatiotemporal Sampling
Khoi-Nguyen Mac, Minh Do, Minh Vo
IEEE Trans. Image Process. 2023  
PDF

EgoHumans: An Egocentric 3D Multi-Human Benchmark
Rawal Khirodkar, Aayush Bansal, Lingni Ma, Richard Newcombe, Minh Vo, Kris Kitani
ICCV 2023 (Oral and distingished egocentric papers)  Acceptance ratio: 152/8260 = 1.8%
PDF Project Page

Snipper: A Spatiotemporal Transformer for Simultaneous Multi-Person 3D Pose Estimation Tracking and Forecasting on a Video Snippet
Shihao Zou, Yuanlu Xu, Chao Li, Lingni Ma, Li Chen, Minh Vo
IEEE Trans. on Circuits and Systems for Video Technology, 2023  
PDF Project Page

IDEO: Large Scale Egocentric 3D Object Dataset and Benchmark Challenges
Tien Do, Lance Lemke, Jingfan Guo, Khiem Vuong, Minh Vo, Hyun Soo Park
arxiv 2022  
PDF Project Page

TAVA: Template-free Animatable Volumetric Actors
Ruilong Li, Julian Tanke, Minh Vo, Michael Zollhoefer, Jurgen Gall, Angjoo Kanazawa, Christoph Lassner
ECCV 2022  
PDF Project Page

LISA: Learning Implicit Shape and Appearance of Hands
Enric Corona, Tomas Hodan, Minh Vo, Francesc Moreno-Noguer, Chris Sweeney, Richard Newcombe, and Lingni Ma
CVPR 2022  
PDF Project Page

BANMo: Building Animatable 3D Neural Models from Many Casual Videos
Gengshan Yang, Minh Vo Natalia Neverova, Deva Ramanan, Andrea Vedaldi, Hanbyul Joo
CVPR 2022 (Oral)   Acceptance ratio: 344/8161 = 4.2%
PDF Project Page

Ego4D: Around the World in 3,000 Hours of Egocentric Video
K. Grauman et al.
CVPR 2022 (Oral - Best paper finalist and distingished egocentric papers)   Acceptance ratio: 344/8161 = 4.2%
PDF Project Page

ODAM: Object Detection, Association, and Mapping using Posed RGB Video
Kejie Li, Daniel DeTone, Steven Chen, Minh Vo, Ian Reid, Hamid Rezatofighi, Chris Sweeney, Julian Straub, Richard Newcombe
ICCV 2021 (Oral)   Acceptance ratio: 210/6152 = 3.3%
PDF Project Page

ContactOpt: Optimizing Contact to Improve Grasps
Patrick Grady, Chengcheng Tang, Christopher D. Twigg, Minh Vo, Samarth Brahmbhatt, Charles C. Kemp
CVPR 2021 (Oral)   Acceptance ratio: 210/6152 = 3.3%
PDF Project Page

ANR: Articulated Neural Rendering for Virtual Avatars
Amit Raj, Julian Tanke, James Hays, Minh Vo, Carsten Stoll, and Christoph Lassner
CVPR 2021  
PDF Project Page

TexMesh: Reconstructing Detailed Human Texture and Geometry from Monocular Video
Tiancheng Zhi, Christoph Lassner, Tony Tung, Carsten Stoll, Srinivasa Narasimhan, and Minh Vo
ECCV 2020  
PDF Project Page

Long-term Human Motion Prediction with Scene Context
Zhe Cao, Hang Gao, Karttikeya Mangalam, Qi-Zhi Cai, Minh Vo, and Jitendra Malik
ECCV 2020 (Oral)   Acceptance ratio: 104/5025 = 2.0%
PDF Project Page

4D Visualization of Dynamic Events from Unconstrained Multi-View Videos
Aayush Bansal, Minh Vo, Yaser Sheikh, Deva Ramanan, and Srinivasa Narasimhan
CVPR 2020  
PDF Project Page
Press Coverage: CMU, ACM, TechXplore, ScienceMag, and many others.

Spatiotemporal Bundle Adjustment for Dynamic 3D Human Reconstruction in the Wild
Minh Vo, Srinivasa Narasimhan, and Yaser Sheikh
TPAMI 2020 and CVPR 2016  
PDF Project Page

Self-supervised Multi-view Person Association and Its Applications
Minh Vo, Ersin Yumer, Kalyan Sunkavalli, Sunil Hadap, Yaser Sheikh, and Srinivasa Narasimhan
TPAMI 2020  
PDF Project Page

Occlusion-Net: 2D/3D Occluded Keypoint Localization Using Graph Networks
Dinesh Reddy, Minh Vo, and Srinivasa Narasimhan,
CVPR 2019  
PDF Project Page

CarFusion: Combining Part Detection and Point Tracking for Dynamic 3D Reconstruction of Vehicles
Dinesh Reddy, Minh Vo, and Srinivasa Narasimhan,
CVPR 2018  
PDF Project Page

Texture Illumination Separation for Single-shot Structured Light Reconstruction
Minh Vo, Srinivasa Narasimhan, and Yaser Sheikh
CCD 2014 and TPAMI 2015  
PDF Project Page

Passive Tomography of Turbulance Strength
Marina Alterman, Yoav Schechner, Minh Vo, and Srinivasa Narasimhan
ECCV 2014  
PDF Project Page

Automated fast initial guess in digital image correlation
Zhaoyang Wang, Minh Vo, Hien Kieu, Tongyan Pan
Strain 2014  
PDF

Hyper-accurate flexible calibration technique for fringe-projection-based three-dimensional imaging
Minh Vo, Zhaoyang Wang, Bing Pan, and Tongyan Pan
Optics Express 2012  
PDF Supplementary videos

Three-dimensional phantoms for curvature correction in spatial frequency domain imaging
Thu Nguyen, Hanh Le, Minh Vo, Zhaoyang Wang, Long Luu, and Jessica Ramella-Roman
Biomedical Optics Express 2012  
PDF

Advanced geometric camera calibration for machine vision
Minh Vo, Zhaoyang Wang, Long Luu, and Jun Ma
Optical Engineering 2011  
PDF Software

Accuracy enhancement of digital image correlation with B-spline interpolation
Long Luu, Zhaoyang Wang, Minh Vo, Thang Hoang, and Jun Ma
Optics Letters 2011  
PDF

Phase extraction from optical interferograms in presence of intensity nonlinearity and arbitrary phase shifts
Thang Hoang, Zhaoyang Wang, Minh Vo, Jun Ma, Long Luu, and Bing Pan
Applied Physics Letters 2011  
PDF

Flexible calibration technique for fringe-projection-based three-dimensional imaging
Minh Vo, Zhaoyang Wang, Thang Hoang, and Dung Nguyen
Optics Letters 2010  
PDF

Others

Exploiting Point Motion, Shape Deformation, and Semantic Priors for Dynamic 3D Reconstruction in the Wild
Minh Vo
Ph.D. Thesis  
PDF

External Coverage