Shubham Tulsiani
Shubham Tulsiani

@shubhtuls

4 Tweets 10 reads Aug 24, 2022
[1/4] Camera poses are essential for (neural) 3D reconstruction. But what about sparse-view settings where obtaining these via COLMAP isnโ€™t feasible? Our ECCV paper tackles this using an energy-based formulation for predicting relative rotation (jasonyzhang.com)
[2/N] We use the pairwise predictions to find a maximally consistent set of rotations across multiple images. Starting with a greedy initialization, we iteratively update rotations to maximize the joint likelihood.
[3/N] This approach is more robust in sparse-view settings compared to SfM-based methods (although you should still use COLMAP given many views!), and also generalizes to unseen object categories.
[4/N] Assuming centre-facing cameras, the recovered rotations can initialize 6D poses and help bootstrap sparse-view neural 3D reconstruction methods (e.g. @jasonyzhang2's prior work on NeRS).

Loading suggestions...