We create a synthetic dataset, SynFoot, of 50K images of feet, along with surface normals, keypoints and masks.
These were created using our custom library BlenderSynth, and are available for download.
We train a network to predict both normals and corresponding uncertainties.
Even though our synthetic dataset only has 8 foot scans, we find that, with aggressive data augmentation, our normal predictor achieves high quality surface normal predictions on in-the-wild images.
We obtain ground truth normals from our reconstruction dataset, and show that our method significantly outperforms COLMAP, and SOTA normal predictors.
We fit the parameters of the FIND model to the surface normals, using the uncertainty to weight the loss function.
We do this in a multiview setting, and produce better reconstructions than COLMAP, evaluated on our new benchmark foot reconstruction dataset, available for download.
We can do this accurately with as few as 3 views, whereas COLMAP needs 15+.
We acknowledge the collaboration and financial support of Trya Srl.
@inproceedings{boyne2024found, title={FOUND: {F}oot {O}ptimisation with {U}ncertain {N}ormals for Surface {D}eformation using Synthetic Data}, author={Boyne, Oliver and Bae, Gwangbin and Charles, James and Cipolla, Roberto}, booktitle={Winter Conference on Applications of Computer Vision (WACV)}, year={2024} }