Skip to content

Pull requests: vllm-project/llm-compressor

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Update ReadMe and test for cpu_offloading
#1013 opened Dec 23, 2024 by dsikka Loading…
Dataset Processing Args
#1006 opened Dec 20, 2024 by kylesayrs Draft
[E2E Testing] KV-Cache
#1004 opened Dec 20, 2024 by horheynm Loading…
Remove Neural Magic copyright from files
#992 opened Dec 18, 2024 by kylesayrs Loading…
Add example for fp8 kv cache of phi3.5 and gemma2
#991 opened Dec 18, 2024 by mgoin Loading…
[Test Fix] Sparse model reload
#974 opened Dec 11, 2024 by horheynm Draft
Bitmask test
#956 opened Dec 5, 2024 by rahul-tuli Draft
Dataset split fallbacks
#953 opened Dec 4, 2024 by kylesayrs Loading…
Add int8 discussion section in readme
#944 opened Nov 29, 2024 by kylesayrs Loading…
Remove uses of get_observer
#939 opened Nov 27, 2024 by kylesayrs Loading…
[E2E Testing] Add recipe check vllm e2e
#929 opened Nov 21, 2024 by horheynm Loading…
Allow Shortcutting Min-max Observer
#887 opened Nov 1, 2024 by kylesayrs Loading…
FSDP utils cleanup
#854 opened Oct 19, 2024 by kylesayrs Loading…
Awq re implementation
#824 opened Oct 7, 2024 by rahul-tuli Draft
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.