Fix LLaMA model loading · mdegans/weave@77b9eda

Commit

Fix LLaMA model loading

Fix hot model reload with `drama_llama` backend not working #7

* Loading is now entirely in the worker thread.
* During load, the settings are locked but the brief blocking before is gone.
* Several new `Request` and `Response` messages were added.

The issue with the panic for unsupported models (on Metal at least) is still there. That requires changes in `llama.cpp` itself or duplication of code.

Loading branch information

mdegans committed Jun 14, 2024

1 parent d75254a commit 77b9eda

0 comments on commit `77b9eda`

Please sign in to comment.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Commit

There are no files selected for viewing

0 comments on commit `77b9eda`

Commit

There are no files selected for viewing

0 comments on commit 77b9eda

0 comments on commit `77b9eda`