Merge pull request #166 from rasbt/update-lora-init · rasbt/LLMs-from-scratch@451a629

d-kleine · 2024-05-21T12:03:23Z

@rasbt Maybe you could also add a check for rank=0 like here:
https://github.com/microsoft/LoRA/blob/4c0333854cb905966f8cc4e9a74068c1e507c7b7/loralib/layers.py#L46C1-L53C32

I think it also can be useful to add a check for alpha=0, because - no matter of the rank - this would nullify the update, resulting in no effective adaptation of the model.

Might be also relevant for Appendix E

rasbt · 2024-05-22T01:17:46Z

@d-kleine Thanks for the suggestion, but I'd say it doesn't really need additional code to check for that.
If you set the rank to 0, there will already be a PyTorch warning: UserWarning: Initializing zero-element tensors is a no-op. And if you set alpha to 0, that's kind of similar to setting the learning rate to 0, which PyTorch also allows you to do, even though it's not training the model then. But thanks for suggesting!

d-kleine · 2024-05-22T02:06:39Z

Alright, makes sense - thank you! 👍🏻

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Commit

There are no files selected for viewing

3 comments on commit `451a629`

d-kleine commented on `451a629` May 21, 2024 •

edited

rasbt commented on `451a629` May 22, 2024

d-kleine commented on `451a629` May 22, 2024

Commit

There are no files selected for viewing

3 comments on commit 451a629

d-kleine commented on 451a629 May 21, 2024 • edited

Choose a reason for hiding this comment

rasbt commented on 451a629 May 22, 2024

Choose a reason for hiding this comment

d-kleine commented on 451a629 May 22, 2024

Choose a reason for hiding this comment

3 comments on commit `451a629`

d-kleine commented on `451a629` May 21, 2024 •

edited

rasbt commented on `451a629` May 22, 2024

d-kleine commented on `451a629` May 22, 2024