DPHM

Paper | SupplearXiv | Video | Project Page

This is the repository that contains source code for the paper:

DPHMs: Diffusion Parametric Head Models for Depth-based Tracking

1. Installation

Note that some of the steps below can take a while

conda env create -f environment.yml   
conda activate DPHM

pip install -e .

# Install pytorch with CUDA support
conda install pytorch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 pytorch-cuda=11.7 -c pytorch -c nvidia

# Install PytorchGeometry and helper packages with CUDA support
conda install pyg -c pyg
pip install pyg_lib torch_scatter torch_sparse torch_cluster torch_spline_conv -f https://data.pyg.org/whl/torch-2.0.0+cu117.html

# Install Pytorch3D with CUDA support
conda install -c fvcore -c iopath -c conda-forge fvcore iopath
conda install pytorch3d=0.7.4 -c pytorch3d

Next, you need to fill in some paths in ./src/dphm_tum/env_paths.py. Before this use cp src/dphm_tum/env_paths.template src/dphm_tum/env_paths.py to create your local version and set the paths according to your system. The provided comments are hopefully enough explanation.

Also, install ChamferDistancePytorch for nearest neighborhood search:

cd ChamferDistancePytorch/chamfer3D
python setup.py install

Finally, fix some versioning:

pip install numpy==1.23.1
pip install pyopengl==3.1.5

2. Data and Model Checkpoints

2.1 DPHM-Kinect Data

To evaluation depth-based tracking, we collect a set of RGB-D sequences containing complicated facial expression and fast transitions. To download the DPHM-Kinect dataset, you will need to fill out the terms of use. After we receive your application, we will share the data access to your email address.

2.2 Pretrained Model Checkpoints

We provide the pre-trained model checkpoints of NPHM with backward deformations, identity diffusion, and expression diffusion.

2.3 Training Data

For training, we use the NPHM dataset

3. Train DPHMs

Before we train Diffusion Parametric Head Models, we need to train Neural Parametric Head Models (NPHMs) with backward deformations.

3.1 Pre-train NPHMs

To train NPHMs, please follow its instructions to preprocess datasets for SDF field learning. You can train your NPHMs through

python scripts/training/launch.py --cfg-file configs/n3dmm_anc39_iden1344_expr200.yaml --exp-name nphm_backward

or we suggest you use a pretrained model of NPHMs with backward deformations, as the training would take several days. You can test the pre-trained NPHMs by decoding and visualizing the fitted parametric latents on the train dataset. For example, you can generate the fitted 3D head of 100-th identity with 0-th expression.

python scripts/diffusion/nphm_vis.py -iden 100 -expr 0

3.2 Train identity / Expression Latent Diffusion Models

Based on the fitted latents on the train dataset, we use latent diffusion models to explicitly learn the distribution of identity and expression parametric latents. The latent diffusion is based on UNet-1D + attention layers from DDPM.

DISPLAY=:0 xvfb-run -a  python scripts/diffusion/train_diff_1d_backward.py -cfg_file scripts/diffusion/configs/diff_iden_dim64_mults4.yaml -exp_name nphm_diff_iden_dim64_mults4
DISPLAY=:0 xvfb-run -a  python scripts/diffusion/train_diff_1d_backward.py -cfg_file scripts/diffusion/configs/diff_expre_dim64_mults4.yaml -exp_name nphm_diff_expre_dim64_mults4

We add DISPLAY=:0 xvfb-run -a, as we want to render the randomly generated meshes during training as images for debugging.

3.3 Unconditionally generate 3D head avatars

After training parametric latent diffusion models, we can randomly sample noises, and then apply diffusion models to transform them into meaningful identity or expression latents.

DISPLAY=:0 xvfb-run -a  python scripts/diffusion/sample_diff_1d_backward.py  -cfg_file scripts/diffusion/configs/diff_iden_dim64_mults4.yaml -exp_name dphm_iden_dim64_mults4 
DISPLAY=:0 xvfb-run -a  python scripts/diffusion/sample_diff_1d_backward.py  -cfg_file scripts/diffusion/configs/diff_expre_dim64_mults4.yaml -exp_name dphm_expre_dim64_mults4

4. Apply DPHMs for head tracking

The following gives intructions how to run the depth-based head reconstruction and tracking from commodity sensors using diffusion priors.

4.1 Installing the Preprocessing Pipeline

Our preprocessing pipeline relies on the FLAME model. Therefore, you will need an account for the FLAME website. Let me know if you have any trouble concerning that.

During the FLAME tracking, we need to use landmarks. Please follow the instructions of scripts/preprocess/instructions.md. For eay-to-use, we also provide the detected landmarks in our DPHM-Kinect data.

4.2 Running the Preprocessing

First we need to preprocess a bunch of data, namely this includes:

unproject depth and normal maps to partial scans
landmark detection, and obtain 3d landmarks
FLAME fitting to get a rough initialization of the camera pose

python scripts/preprocess/00_create_kinect_scan_with_normals.py --datadir $DPHM_Kinect_dir
python scripts/preprocess/01_fit_flame_to_kinect_seq.py

4.3 Running the Tracking

Once the preprocessing is done, you can start the dphm depth tracking using:

cfg_file=scripts/tracker/dphm_ncos.05_a.1_exprsmo1_rigsmo1_sds.5.yaml
exp_tag="ncos.05_a.1_exprsmo1_rigsmo1_sds.5"
DISPLAY=:0 xvfb-run -a  python scripts/tracker/tracker.py -cfg_file $cfg_file -exp_tag $exp_tag  -subj aria_talebizadeh_mouthmove  -init
DISPLAY=:0 xvfb-run -a  python scripts/tracker/tracker.py -cfg_file $cfg_file -exp_tag $exp_tag  -subj aria_talebizadeh_mouthmove  -incre
DISPLAY=:0 xvfb-run -a  python scripts/tracker/tracker.py -cfg_file $cfg_file -exp_tag $exp_tag  -subj aria_talebizadeh_mouthmove  -joint

Acknowledgement

Special thanks to authors of NPHMs for proving their neural 3DMM and NeRSemble for preprocessing their dataset.

Citation

If you find our dataset or paper useful for your research , please include the following citation:

@article{tang2024dphms,
      title={DPHMs: Diffusion Parametric Head Models for Depth-based Tracking},
      author={Tang, Jiapeng and Dai, Angela and Nie, Yinyu and Markhasin, Lev and Thies, Justus and Niessner, Matthias},
      booktitle={Proceedings of the ieee/cvf conference on computer vision and pattern recognition},
      year={2024}
    }

Contact

If you have questions regarding the dataset or code, please email to Jiapeng Tang. We will get back to you as soon as possible.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
ChamferDistancePytorch		ChamferDistancePytorch
assets		assets
configs		configs
media		media
scripts		scripts
src/dphm_tum		src/dphm_tum
.gitignore		.gitignore
.gitmodules		.gitmodules
LISCENSE.txt		LISCENSE.txt
README.md		README.md
environment.yml		environment.yml
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DPHM

2. Data and Model Checkpoints

2.1 DPHM-Kinect Data

2.2 Pretrained Model Checkpoints

2.3 Training Data

3. Train DPHMs

3.1 Pre-train NPHMs

3.2 Train identity / Expression Latent Diffusion Models

3.3 Unconditionally generate 3D head avatars

4. Apply DPHMs for head tracking

4.1 Installing the Preprocessing Pipeline

4.2 Running the Preprocessing

4.3 Running the Tracking

Acknowledgement

Citation

Contact

About

Releases

Packages

Languages

tangjiapeng/DPHM

Folders and files

Latest commit

History

Repository files navigation

DPHM

2. Data and Model Checkpoints

2.1 DPHM-Kinect Data

2.2 Pretrained Model Checkpoints

2.3 Training Data

3. Train DPHMs

3.1 Pre-train NPHMs

3.2 Train identity / Expression Latent Diffusion Models

3.3 Unconditionally generate 3D head avatars

4. Apply DPHMs for head tracking

4.1 Installing the Preprocessing Pipeline

4.2 Running the Preprocessing

4.3 Running the Tracking

Acknowledgement

Citation

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages