Skip to content

Latest commit

 

History

History
80 lines (51 loc) · 3.97 KB

README.md

File metadata and controls

80 lines (51 loc) · 3.97 KB

Black-box Dataset Ownership Verification via Backdoor Watermarking

This is the official implementation of our paper Black-box Dataset Ownership Verification via Backdoor Watermarking, accepted by IEEE Transactions on Information Forensics and Security (2023). This research project is developed based on Python 3 and Pytorch, created by Yiming Li and Mingyan Zhu.

Citation

If you think this work or our codes are useful for your research, please cite our paper via:

@article{li2023DVBW,
  title={Black-box Dataset Ownership Verification via Backdoor Watermarking},
  author={Li, Yiming and Zhu, Mingyan and Yang, Xue and Jiang, Yong and Wei, Tao and Xia, Shu-Tao},
  journal={IEEE Transactions on Information Forensics and Security},
  volume = {18},
  pages = {2318--2332},
  year={2023},
  publisher={IEEE}
}

The Main Pipeline of Our Methods

pipeline

Requirements

To install requirements:

pip install -r requirements.txt

Running Examples

For CV tasks

Please follow the instructions in main.sh. Before you run our codes, please make sure you have downloaded the CIFAR-10 or the sub-imagenet-200 dataset and put it in the ./CIFAR/data or ./Sub-ImageNet/data directory. Besides, for convenience, we also provide the checkpoints so that you can run the hypothesis-test directly.

For example, to train a watermarking ResNet-18 model on CIFAR-10 and test it, you can run the following command:

# make sure you are in the CIFAR directory `./CIFAR`
# ResNet Standard Training
python3 train_standard.py --gpu-id 0 --checkpoint './checkpoint/benign/resnet'

# ResNet Backdoor watermarked training
python3 train_watermarked.py --gpu-id 0 --poison-rate 0.1 \
   --checkpoint 'checkpoint/infected/resnet_badnets_cross_0_010' \
   --trigger './triggers/Trigger_cross.png' --alpha './triggers/Alpha_cross.png' --y-target 0

# ResNet Backdoor watermarked testing
python3 Ttest.py --gpu-id 0 --model 'resnet' --trigger './triggers/Trigger_cross.png' --alpha './triggers/Alpha_cross.png' \
   --model-path './checkpoint/infected/resnet_badnets_cross_0_010/checkpoint.pth.tar' --target-label 0 --num-img 100
python3 Wtest.py --gpu-id 1 --model 'resnet' --trigger './triggers/Trigger_cross.png' --alpha './triggers/Alpha_cross.png' \
    --model-path './checkpoint/infected/resnet_badnets_cross_0_010/checkpoint.pth.tar' --target-label 0 --num-img 100

For NLP tasks

We don't provide the codes for NLP tasks due to some copyright issues. In general, we built our codes based on "A backdoor attack against lstm-based text classification systems".

If you are interested in these tasks, please contact the authors of the above paper via email.

For GNN tasks

We don't provide the codes for GNN tasks also due to some copyright issues. In general, we built our codes based on "Backdoor Attacks to Graph Neural Networks".

If you are interested in these tasks, please follow their github repository.

License

This project is licensed under the terms of the Apache License 2.0. See the LICENSE file for the full text.

Acknowledgments

We built our code based on BackdoorBox, Open-sourced Dataset Protection, and "Backdoor Attacks to Graph Neural Networks". We also thank the authors of "A backdoor attack against lstm-based text classification systems" for providig their codes.

Thanks for their excellent works!