-
Notifications
You must be signed in to change notification settings - Fork 138
Pull requests: microsoft/onnxruntime-genai
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Recompute KV cache for Phi3 when switching from short to long factor
#1161
opened Dec 20, 2024 by
ajindal1
Loading…
Address a DML regression caused by the continuous decoding changes
#1159
opened Dec 19, 2024 by
baijumeswani
Loading…
Avoid potential desynchronization of cpu and device memory
#1132
opened Dec 9, 2024 by
aciddelgado
Loading…
Add Quantized_model + float LoRA model scenario to model builder
#1043
opened Nov 7, 2024 by
apsonawane
Loading…
Add an IChatClient implementation to OnnxRuntimeGenAI
#987
opened Oct 16, 2024 by
stephentoub
Loading…
Make Microsoft.ML.OnnxRuntimeGenAI.Tokenizer a Microsoft.ML.Tokenizers.Tokenizer
#970
opened Oct 11, 2024 by
stephentoub
Loading…
ProTip!
Mix and match filters to narrow down what you’re looking for.