Saving and loading HF transformer model fine tuned with PL? #8893

brijow · 2021-08-13T22:03:05Z

brijow
Aug 13, 2021

I am fine-tuning hugging face transformer models, essentially exactly as shown in the following example found in the pytorch lightning docs:

https://pytorch-lightning.readthedocs.io/en/latest/notebooks/lightning_examples/text-transformers.html

Where we instantiate the LightningModule doing something like this:

class GLUETransformer(LightningModule):

    def __init__(self, ... ):
        super().__init__()
        self.config = AutoConfig.from_pretrained(model_name_or_path, num_labels=num_labels)
        self.model = AutoModelForSequenceClassification.from_pretrained(
            model_name_or_path, config=self.config
        )

But I have been confused about how I should be saving and loading checkpoints.

When saving checkpoints, should I be using
mymodel.model.save_pretrained("model_save_dir"),
and reloading from this checkpoint using
AutoModelForSequenceClassification.from_pretrained("model_save_dir"),

or saving with
trainer.save_checkpoint("model_save_dir/checkpoint.ckpt"),
and reloading with
GLUETransformer.load_from_checkpoint("model_save_dir/checkpoint.ckpt")?

Answered by tchaton

Aug 16, 2021

Dear @brijow,

You should be using the second approach. An even better one would be to rely on ModelCheckpoint to save the checkpoints and provide Trainer(resume_from_checkpoint=...) for reloading all the states.

Best,
T.C

View full answer

tchaton · 2021-08-16T07:26:02Z

tchaton
Aug 16, 2021
Maintainer

Dear @brijow,

You should be using the second approach. An even better one would be to rely on ModelCheckpoint to save the checkpoints and provide Trainer(resume_from_checkpoint=...) for reloading all the states.

Best,
T.C

1 reply

brijow Aug 16, 2021
Author

Thank you again, @tchaton

vblagoje · 2021-12-02T16:51:51Z

vblagoje
Dec 2, 2021

@brijow Is there a way to unpack HF PL checkpoints into constituents (e.g. pytorch_model.bin, config.json, tokenizer.json etc.) usually found on the HF hub hosted models. Most importantly, how would I extract just the pytorch_model.bin from the PL checkpoint?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Saving and loading HF transformer model fine tuned with PL? #8893

{{title}}

Replies: 2 comments 1 reply

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Saving and loading HF transformer model fine tuned with PL? #8893

brijow Aug 13, 2021

Replies: 2 comments · 1 reply

tchaton Aug 16, 2021 Maintainer

brijow Aug 16, 2021 Author

vblagoje Dec 2, 2021

brijow
Aug 13, 2021

Replies: 2 comments 1 reply

tchaton
Aug 16, 2021
Maintainer

brijow Aug 16, 2021
Author

vblagoje
Dec 2, 2021