o1模型部分时候无法正常回复 #4192
-
📦 部署环境Official Preview 📌 软件版本v1.20.2 💻 系统环境Windows 🌐 浏览器Chrome 🐛 问题描述直接使用的在 https://chat-preview.lobehub.com 📷 复现步骤🚦 期望结果正常返回结果 📝 补充信息No response |
Beta Was this translation helpful? Give feedback.
Replies: 7 comments
-
👀 @huoxianribao Thank you for raising an issue. We will investigate into the matter and get back to you as soon as possible. |
Beta Was this translation helpful? Give feedback.
-
📦 Deployment environmentOfficial Preview 📌 Software versionv1.20.2 💻 System environmentWindows 🌐 BrowserChrome 🐛 Problem descriptionDirectly used at https://chat-preview.lobehub.com 📷 Steps to reproduceThe maximum number of tokens set, using the o1mini model 🚦 Expected resultsNormal return results 📝 Supplementary informationNo response |
Beta Was this translation helpful? Give feedback.
-
@huoxianribao 你好!我是Dosu,一个可以帮助你解决bug、回答问题并成为贡献者的机器人。我会在等待人类维护人员的过程中帮助你。 I found a similar discussion that might be relevant to your issue. The discussion titled 连接性检查,无相应 mentions that updating to the latest version (124.2) resolved the issue [1]. Regarding the 504 Gateway Timeout error, common causes include server overload, network issues, upstream server issues, and configuration errors. Here are some troubleshooting steps you can follow:
Additionally, setting a high token limit like 32000 can lead to performance issues, increased processing time, and higher costs. It is recommended to set a more reasonable token limit to avoid these issues [3]. For the 'o1' model, the framework includes specific error handling mechanisms such as handling invalid API keys, business errors, and location not supported errors. These mechanisms ensure robust error handling and provide meaningful feedback [4][5][6]. If you have already tried these steps and the issue persists, consider updating to the latest version of the framework as suggested in the similar discussion. |
Beta Was this translation helpful? Give feedback.
-
I found a similar discussion that might be relevant to your issue. The discussion titled 连接性检查,无相应 mentions that updating to the latest version (124.2) resolved the issue [1]. Regarding the 504 Gateway Timeout error, common causes include server overload, network issues, upstream server issues, and configuration errors. Here are some troubleshooting steps you can follow:
Additionally, setting a high token limit like 32000 can lead to performance issues, increased processing time, and higher costs. It is recommended to set a more reasonable token limit to avoid these issues [3]. For the 'o1' model, the framework includes specific error handling mechanisms such as handling invalid API keys, business errors, and location not supported errors. These mechanisms ensure robust error handling and provide meaningful feedback [4][5][6]. If you have already tried these steps and the issue persists, consider updating to the latest version of the framework as suggested in the similar discussion. |
Beta Was this translation helpful? Give feedback.
-
504 是服务器响应超时导致的,Vercel Edge Runtime 对首字节有25s内响应的要求,应用层解决不了。 临时解法是切到客户端请求模式,或者你自己用 Docker 部署。 |
Beta Was this translation helpful? Give feedback.
-
✅ @huoxianribao This issue is closed, If you have any questions, you can comment and reply. |
Beta Was this translation helpful? Give feedback.
-
504 is caused by server response timeout. Vercel Edge Runtime has a requirement to respond to the first byte within 25 seconds, and the application layer cannot solve it. The temporary solution is to switch to client request mode, or deploy it yourself using Docker. |
Beta Was this translation helpful? Give feedback.
504 是服务器响应超时导致的,Vercel Edge Runtime 对首字节有25s内响应的要求,应用层解决不了。
临时解法是切到客户端请求模式,或者你自己用 Docker 部署。