-
Notifications
You must be signed in to change notification settings - Fork 211
Conversation
Signed-off-by: n1ck-guo <[email protected]>
⚡ Required checks status: All passing 🟢Groups summary🟢 Format Scan Tests workflow
These checks are required after the changes to 🟢 Optimize Unit Test workflow
These checks are required after the changes to 🟢 Engine Unit Test workflow
These checks are required after the changes to Thank you for your contribution! 💜
|
for more information, see https://pre-commit.ci
...ion_for_transformers/transformers/modeling/kv_cahe_compression/h2o_real_drop/modify_llama.py
Outdated
Show resolved
Hide resolved
...ion_for_transformers/transformers/modeling/kv_cahe_compression/h2o_real_drop/modify_llama.py
Outdated
Show resolved
Hide resolved
...on_for_transformers/transformers/modeling/kv_cahe_compression/h2o_sim_drop/modify_gptneox.py
Outdated
Show resolved
Hide resolved
Signed-off-by: biao.fang <[email protected]>
Signed-off-by: biao.fang <[email protected]>
for more information, see https://pre-commit.ci
intel_extension_for_transformers/transformers/modeling/kv_cache_compression/h2o.py
Outdated
Show resolved
Hide resolved
Signed-off-by: n1ck-guo <[email protected]>
Signed-off-by: n1ck-guo <[email protected]>
Signed-off-by: n1ck-guo <[email protected]>
for more information, see https://pre-commit.ci
...nsion_for_transformers/transformers/modeling/kv_cache_compression/models/modeling_mistral.py
Outdated
Show resolved
Hide resolved
examples/huggingface/pytorch/text-generation/h2o/run_lm_eval_harness.py
Outdated
Show resolved
Hide resolved
Signed-off-by: n1ck-guo <[email protected]>
for more information, see https://pre-commit.ci
Signed-off-by: n1ck-guo <[email protected]>
Signed-off-by: n1ck-guo <[email protected]>
for more information, see https://pre-commit.ci
Signed-off-by: biao.fang <[email protected]>
Signed-off-by: n1ck-guo <[email protected]>
Could we add a document introducing what h2o is? |
format scan improved by #1647. merged. |
Signed-off-by: n1ck-guo <[email protected]>
for more information, see https://pre-commit.ci
add in the example/readme |
Signed-off-by: n1ck-guo <[email protected]>
Signed-off-by: n1ck-guo <[email protected]>
for more information, see https://pre-commit.ci
Signed-off-by: n1ck-guo <[email protected]>
Signed-off-by: n1ck-guo <[email protected]>
Signed-off-by: n1ck-guo <[email protected]>
for more information, see https://pre-commit.ci
Type of Change
feature
Description
H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models
paper
NTD
Expected Behavior & Potential Risk
None
How has this PR been tested?
how to reproduce the test (including hardware information)
Dependency Change?
any library dependency introduced or removed