Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Running demo code on V100 #8

Open
mikeleatila opened this issue Jan 16, 2025 · 2 comments
Open

Running demo code on V100 #8

mikeleatila opened this issue Jan 16, 2025 · 2 comments

Comments

@mikeleatila
Copy link

mikeleatila commented Jan 16, 2025

Hi,

Congrats on the great work!

I only have V100 GPUs available to me.

Is there a way to run your inference/demo code and how (e.g. with no flash attention)?

Many thanks in advance!

@lxtGH
Copy link
Collaborator

lxtGH commented Jan 17, 2025

@zhang-tao-whu Check the issues. It seems like you need to modify the code to remove the flash attention.

@mikeleatila
Copy link
Author

mikeleatila commented Jan 17, 2025

@lxtGH @zhang-tao-whu Thanks a lot. Do you have some hints how to remove flash attention? Can it be done by "passing-a-parameter-kind-of-thing"? Been looking around and trying out but it still does not work.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants