Email: [middle][first][last] at [gmail dot com
Google Scholar
I am the Head of Engineering at SpreeAI, a high-tech virtual try-on
startup. Besides overseeing all products R&D, I also lead and grow a world-class team of passionate Machine Learning researchers and engineers to develop and productionize
our photorealistic avatar technology.
Previously, I was a Research Scientist at Meta Reality Labs
Research, where I tech-led a group of researchers to develop 3D perception and human sensing algorithms
for Meta Aria glasses.
Before that, I was a Ph.D. student at The Robotics Institute, Carnegie Mellon University where I worked with Prof. Srinivasa Narasimhan and Prof. Yaser Sheikh on novel methods to capture dense and accurate 3D
shape of human bodies.
I also worked with Prof. Zhaoyang Wang at the Catholic University of
America, where I got B.E. degree in Electrical Engineering, on camera calibration, structured light system, and
tracking algorithms.
Research
I am very interested in various aspects of 3D vision, physics-based vision, and generative models for
photorealistic digitial avatar creation and human scene understanding. The goal is to develop holistic and
end-to-end machine learning systems that understand and recreate virtual environments that are perceptually
indistinguishable from reality.
Jobs opportuninty: I am hiring full time CV&ML&Graphics researchers. I strike to balance between core
and applied research with patents, papers, and product as outputs. Send me an email if you are interested in
working with me.
Efficient Human Vision Inspired Action Recognition using Adaptive Spatiotemporal Sampling Khoi-Nguyen Mac, Minh Do, Minh Vo IEEE Trans. Image Process. 2023 PDF
EgoHumans: An Egocentric 3D Multi-Human Benchmark Rawal Khirodkar, Aayush Bansal, Lingni Ma, Richard Newcombe, Minh Vo, Kris Kitani ICCV 2023 (Oral and distingished egocentric papers)
Acceptance ratio: 152/8260 = 1.8%
PDFProject Page
Snipper: A Spatiotemporal Transformer for Simultaneous Multi-Person 3D Pose Estimation Tracking and
Forecasting on a Video Snippet Shihao Zou, Yuanlu Xu, Chao Li, Lingni Ma, Li Chen, Minh Vo IEEE Trans. on Circuits and Systems for Video Technology, 2023 PDFProject
Page
IDEO: Large Scale Egocentric 3D Object Dataset and Benchmark Challenges Tien Do, Lance Lemke, Jingfan Guo, Khiem Vuong, Minh Vo, Hyun Soo Park arxiv 2022 PDFProject Page
LISA: Learning Implicit Shape and Appearance of Hands Enric Corona, Tomas Hodan, Minh Vo, Francesc Moreno-Noguer, Chris Sweeney, Richard Newcombe,
and Lingni Ma CVPR 2022 PDFProject Page
BANMo: Building Animatable 3D Neural Models from Many Casual Videos Gengshan Yang, Minh Vo Natalia Neverova, Deva Ramanan, Andrea Vedaldi, Hanbyul Joo CVPR 2022 (Oral) Acceptance ratio: 344/8161 = 4.2%
PDFProject
Page
Ego4D: Around the World in 3,000 Hours of Egocentric Video K. Grauman et al. CVPR 2022 (Oral - Best paper finalist and distingished egocentric papers) Acceptance ratio: 344/8161 = 4.2%
PDFProject Page
ODAM: Object Detection, Association, and Mapping using Posed RGB Video
Kejie Li, Daniel DeTone, Steven Chen, Minh Vo, Ian Reid, Hamid Rezatofighi, Chris Sweeney, Julian
Straub, Richard Newcombe
ICCV 2021
(Oral)
Acceptance ratio: 210/6152 = 3.3%
PDFProject Page
ContactOpt: Optimizing Contact to Improve Grasps
Patrick Grady, Chengcheng Tang, Christopher D. Twigg, Minh Vo, Samarth Brahmbhatt, Charles C.
Kemp
CVPR 2021
(Oral)
Acceptance ratio: 210/6152 = 3.3%
PDFProject Page
ANR: Articulated Neural Rendering for Virtual Avatars
Amit Raj, Julian Tanke, James Hays, Minh Vo, Carsten Stoll, and Christoph Lassner
CVPR 2021 PDFProject Page
TexMesh: Reconstructing Detailed Human Texture and Geometry from Monocular Video
Tiancheng Zhi, Christoph Lassner, Tony Tung, Carsten Stoll, Srinivasa Narasimhan, and Minh Vo ECCV 2020 PDFProject
Page
Long-term Human Motion Prediction with Scene Context
Zhe Cao, Hang Gao, Karttikeya Mangalam, Qi-Zhi Cai, Minh Vo, and Jitendra Malik
ECCV 2020
(Oral)
Acceptance ratio: 104/5025 = 2.0%
PDFProject Page
4D Visualization of Dynamic Events from Unconstrained Multi-View Videos
Aayush Bansal, Minh Vo, Yaser Sheikh, Deva Ramanan, and Srinivasa Narasimhan
CVPR 2020 PDFProject Page Press Coverage:
CMU,
ACM,
TechXplore,
ScienceMag,
and many others.
Spatiotemporal Bundle Adjustment for Dynamic 3D Human Reconstruction in the Wild Minh Vo, Srinivasa Narasimhan, and Yaser Sheikh
TPAMI 2020 and CVPR 2016 PDFProject Page
Self-supervised Multi-view Person Association and Its Applications Minh Vo, Ersin Yumer, Kalyan Sunkavalli, Sunil Hadap, Yaser Sheikh, and Srinivasa
Narasimhan
TPAMI 2020 PDFProject Page
Occlusion-Net: 2D/3D Occluded Keypoint Localization Using Graph Networks
Dinesh Reddy, Minh Vo, and Srinivasa Narasimhan,
CVPR 2019 PDFProject
Page
CarFusion: Combining Part Detection and Point Tracking for Dynamic 3D Reconstruction of
Vehicles
Dinesh Reddy, Minh Vo, and Srinivasa Narasimhan,
CVPR 2018 PDFProject
Page
Texture Illumination Separation for Single-shot Structured Light Reconstruction Minh Vo, Srinivasa Narasimhan, and Yaser Sheikh
CCD 2014 and TPAMI 2015 PDFProject Page
Passive Tomography of Turbulance Strength Marina Alterman, Yoav Schechner, Minh Vo, and Srinivasa Narasimhan ECCV 2014 PDF Project Page
Automated fast initial guess in digital image correlation Zhaoyang Wang, Minh Vo, Hien Kieu, Tongyan Pan Strain 2014 PDF
Hyper-accurate flexible calibration technique for fringe-projection-based
three-dimensional imaging Minh Vo, Zhaoyang Wang, Bing Pan, and Tongyan Pan Optics Express 2012 PDF
Supplementary videos
Three-dimensional phantoms for curvature correction in spatial frequency domain
imaging Thu Nguyen, Hanh Le, Minh Vo, Zhaoyang Wang, Long Luu, and Jessica
Ramella-Roman Biomedical Optics Express 2012 PDF
Advanced geometric camera calibration for machine vision Minh Vo, Zhaoyang Wang, Long Luu, and Jun Ma Optical Engineering 2011 PDF
Software
Accuracy enhancement of digital image correlation with B-spline interpolation Long Luu, Zhaoyang Wang, Minh Vo, Thang Hoang, and Jun Ma Optics Letters 2011 PDF
Phase extraction from optical interferograms in presence of intensity nonlinearity and
arbitrary phase shifts Thang Hoang, Zhaoyang Wang, Minh Vo, Jun Ma, Long Luu, and Bing Pan Applied Physics Letters 2011 PDF
Flexible calibration technique for fringe-projection-based three-dimensional
imaging Minh Vo, Zhaoyang Wang, Thang Hoang, and Dung Nguyen Optics Letters 2010 PDF
Others
Exploiting Point Motion, Shape Deformation, and Semantic Priors for Dynamic 3D
Reconstruction in the Wild Minh Vo Ph.D. Thesis PDF