chinese-attack-issc

This is the official repository for EMNLP 2024 paper "Adaptive Immune-based Sound-Shape Code Substitutions for Adversarial Chinese Text Attacks" by Ao Wang, Xinghao Yang, Chen Li, Baodi Liu, and Weifeng Liu.

Installation

Code

You can download the packages needed as follow:

pip install -r requirement.txt

Pay attention!

We make some changes on the original textattack package, you have to replace it with ours. The changes are listed:

Prevent multiple jieba segmentations in the Chinese attack flow and segment it only once.
Add the metric output for Chinese text, e.g., multi-lingual USE, and support the BERTScore.
Support the Chinese WordNet candidates.
Save the checkpoint to the checkpoints folder when attack is done.

Models and Datasets

We fine-tune victim models of several datasets based on the pre-training models.

You can download the models and datasets in https://huggingface.co/WangA and put the them to models and attack_datasets folders respectively.

Attacks

Untargeted Attacks

You can conduct untargeted attack experiments as follow:

python untargeted_attack.py -t mix-ssc -s ia -m bert-chinanews -n 500

The logger file will be store at checkpoints folder, you can get the summary of this attack by runing:

python untargeted_attack.py -ckp YOUR_CHECKPOINT

You can find more details about it by checking the untargeted_attack.py.

Targeted Attacks

You can conduct targeted attack experiments on Chinanews by specifying a target label, e.g., 0, as follow:

python untargeted_attack.py -t mix-ssc -s ia -m bert-chinanews -n 500 -tgt 0

The logger file will be store at checkpoints folder.

You can find more details about it by checking the targeted_attack.py.

Adversarial Training

You can conduct adversarial training experiments as follow:

Download the dataset from https://huggingface.co/WangA/attack_datasets and put them in ./attack_datasets
Run the demo

python adv_train.py -a issc -t jd -model bert -nadv 500

We use a default training setting for the adversarial training experiments. You can learn more details and customize it easily.

Transfer Attacks

You can conduct transfer attacks for PLMs via a checkpoint as follow:

python transfer_attack.py -model TARGET_MODEL_FOLDER -source SOURCE_MODEL -ckp YOUR_CHECKPOINT -d DATASET

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
search_methods		search_methods
siamese_cnn		siamese_cnn
soundshapecode		soundshapecode
transformations		transformations
utils		utils
LICENSE		LICENSE
README.md		README.md
adv_train.py		adv_train.py
requirements.txt		requirements.txt
targeted_attack.py		targeted_attack.py
textattack.zip		textattack.zip
transfer_attack.py		transfer_attack.py
untargeted_attack.py		untargeted_attack.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

chinese-attack-issc

Installation

Code

Models and Datasets

Attacks

Untargeted Attacks

Targeted Attacks

Adversarial Training

Transfer Attacks

About

Releases

Packages

Languages

License

nohuma/chinese-attack-issc

Folders and files

Latest commit

History

Repository files navigation

chinese-attack-issc

Installation

Code

Models and Datasets

Attacks

Untargeted Attacks

Targeted Attacks

Adversarial Training

Transfer Attacks

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages