Skip to content

Attention-based block-by-block 3D object reconstruction using Transformer

License

Notifications You must be signed in to change notification settings

faridyagubbayli/LegoFormer

Repository files navigation

LegoFormer

Source code for the paper LegoFormer: Transformers for Block-by-Block Multi-view 3D Reconstruction.

Legoformer Architecture

Setup

Dependency installation

Run ./install_dependencies.sh to install the dependencies. You may want to create a virtual environment before you do so.

Dataset

ShapeNet dataset is used in the experiments. It consists of two parts - rendered images and ground truth voxel grids. You can access the dataset as follows,

Please download and save them in any location you prefer. Don't forget to adjust the dataset path in the config file.

Pre-trained models

Pre-trained multi-view and single-view models can be found here. Please download and place them under the checkpoints directory.

Usage

The eval.py can be used to evaluate the models on the ShapeNet dataset. The script takes three inputs - path to the config file, path to the checkpoint and number of views.

Following command will evaluate the multi-view model on 4 input views from ShapeNet:

python eval.py legoformer/config/legoformer_m.yaml checkpoints/legoformer_m.ckpt 4

Similar command to evaluate the single-view model will be:

python eval.py legoformer/config/legoformer_s.yaml checkpoints/legoformer_s.ckpt 1

Cite this work

@misc{yagubbayli2021legoformer,
      title={LegoFormer: Transformers for Block-by-Block Multi-view 3D Reconstruction}, 
      author={Farid Yagubbayli and Alessio Tonioni and Federico Tombari},
      year={2021},
      eprint={2106.12102},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

About

Attention-based block-by-block 3D object reconstruction using Transformer

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published