-
Notifications
You must be signed in to change notification settings - Fork 44
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Different with gemma2 / gemma #236
Comments
Hi. Apologies for the confusion. In our docs here I notice that we give the correct link to the Kaggle PyTorch Gemma 2 2B (instruction-tuned) checkpoint, which is this link. However, the documentation mistakenly identifies this as just "Gemma" (which one would assume refers to just Gemma version 1). We will update the docs for additional clarity. From your error, I suspect you are using a Hugging Face checkpoint for Gemma 2 2B. It is a high priority to support HF checkpoints for Gemma 2, but it is not available yet. Can you try with the Kaggle PyTorch checkpoint for Gemma 2 2B and see if that resolves the issue? |
hi @talumbau , it can download two files. e.g., model.ckpt, tokenizer.model . so i just change model.ckpt path in convert_gemma2_to_tflite.py ? thanks you again!! |
Yes, please download the
|
For fixing error due to using Hugging Face checkpoint you can apply below diff:
but conversion still runs out of memory on a 80GB system memory colab |
Hi @a8nova thanks very much for the diff here. Can you prepare it as a PR? My understanding is that the quantizer code is landing a fix to reduce memory usage very soon. I think once that fix is in it will be a much better experience on colab |
Hi @talumbau! Sure, I can prepare a PR. How do you suggest we handle the attributes differences between HF and Kaggle checkpoints? Or do you just want me to apply above diff? I am worried that will break the conversion for people using kaggle checkpoints. |
Marking this issue as stale since it has been open for 7 days with no activity. This issue will be closed if no further activity occurs. |
This issue was closed because it has been inactive for 14 days. Please post a new issue if you need further assistance. Thanks! |
Description of the bug:
hi @pkgoogle ,
i try to use convert_gemm2_to_tflite.py to treansfer model. i can see 2 error when i transfer.
Actual vs expected behavior:
No response
Any other information you'd like to share?
No response
The text was updated successfully, but these errors were encountered: