New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

QuIP results on Llama-2 models not reproducible. #15

Open

seannz opened this issue Oct 8, 2024 · 0 comments

seannz commented Oct 8, 2024 •

edited

Loading

Hello Jerry,

Thank you for sharing your QuIP code with us! I'm running into a bit of trouble with correctly running QuIP on the Llama-2 (7b/14b/70b) models:

The latest llama.py seems slightly broken, would you be able to verify for us whether it is working as intended for you?
So I put together my own llama.py using GPTQ's version of the code + your opt.py and got it to run without errors for Llama-2-7b-hf. However, the evaluation PPLs seem a bit high (wikitext2: 15.611, ptr-new: 353.37), suggesting that I must have done something wrong...

Any help/suggestions would be appreciated!

Sean,

The text was updated successfully, but these errors were encountered:

seannz changed the title ~~Some trouble with correctly running QuIP on Llama-2 models~~ QuIP results on Llama-2 models not reproducible.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment