Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

对话内容超限 Query or prefix is too long #1705

Closed
sjn920336697 opened this issue Dec 6, 2023 · 1 comment
Closed

对话内容超限 Query or prefix is too long #1705

sjn920336697 opened this issue Dec 6, 2023 · 1 comment
Labels
🐞 bug Something isn't working

Comments

@sjn920336697
Copy link

Dify version

0.3.32

Cloud or Self Hosted

Self Hosted (Docker)

Steps to reproduce

Dify框架报Query or prefix prompt is too long, you can reduce the prefix prompt, or shrink the max token, or switch to a llm with a larger token limit size。

后端模型基于Xinference进行本地部署的ChatGLM2,max_tokens 为8192(8K),直接在Xinference输入一段内容2000多字的中文文本,可以正常回答。

基于Dify构建的对话应用,max_tokens 设置为450,输入同一段内容就报上述错误。

✔️ Expected Behavior

正常回复

❌ Actual Behavior

Query or prefix prompt is too long, you can reduce the prefix prompt, or shrink the max token, or switch to a llm with a larger token limit size。

@sjn920336697 sjn920336697 added the 🐞 bug Something isn't working label Dec 6, 2023
@crazywoola
Copy link
Member

Close due to #1542

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
🐞 bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants