Computer Vision and Deep Learning for 4D Augmented Reality
By: Karthik Shivashankar
Potential Business Impact:
Makes 3D videos work better in virtual reality.
The prospect of 4D video in Extended Reality (XR) platform is huge and exciting, it opens a whole new way of human computer interaction and the way we perceive the reality and consume multimedia. In this thesis, we have shown that feasibility of rendering 4D video in Microsoft mixed reality platform. This enables us to port any 3D performance capture from CVSSP into XR product like the HoloLens device with relative ease. However, if the 3D model is too complex and is made up of millions of vertices, the data bandwidth required to port the model is a severe limitation with the current hardware and communication system. Therefore, in this project we have also developed a compact representation of both shape and appearance of the 4d video sequence using deep learning models to effectively learn the compact representation of 4D video sequence and reconstruct it without affecting the shape and appearance of the video sequence.
Similar Papers
EX-4D: EXtreme Viewpoint 4D Video Synthesis via Depth Watertight Mesh
CV and Pattern Recognition
Makes videos look real from any angle.
Geometry-aware 4D Video Generation for Robot Manipulation
CV and Pattern Recognition
Robots predict future movements from new angles.
Transcending Dimensions using Generative AI: Real-Time 3D Model Generation in Augmented Reality
Graphics
Makes 3D models from pictures in AR.