Issues: InternLM/lmdeploy
[Benchmark] benchmarks on different cuda architecture with mo...
#815
opened Dec 11, 2023 by
lvhan028
Open
6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Feature]- Support for the microsoft/Phi-3-vision-128k-instruct Vision Model
#1637
opened May 22, 2024 by
sabarish244
lmdeploy搭建的服务,是否支持通过传输stop_words的方式来控制模型输出
awaiting response
#1631
opened May 21, 2024 by
qiuxuezhe123
2 tasks
使用KV cache(int8或int4)量化internvl-v1.5后,显存反而增加了
#1626
opened May 21, 2024 by
qingchunlizhi
1 of 2 tasks
[Feature] Layer Wise Calibration and Quantization of Models (To quantize model on Low VRAM GPU)
#1625
opened May 21, 2024 by
Tushar-ml
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.