xinference启动一段时间后，embedding API访问异常,错误详情：[地址=0.0.0.0:37167,进程ID=193948] [Errno 5] 输入/输出错误,

wwwo4jvm 于 10个月前发布在其他

关注(0)|答案(2)|浏览(85)

xinference, version 0.12.2.post1
CUDA Version: 12.4

0.12.2.post1

source venv_xinference/bin/activate
XINFERENCE_MODEL_SRC=modelscope xinference-local --host 0.0.0.0 --port 9997

启动inference
launch embedding model bge-large-en-v1.5 and llm model mistral-instruct-v0.2
多个人本地启动自己的chatchat, 使用chatchat的过程中会反复调用embedding API, 一直是可以正常工作的
过一段时间后，基本过一天再访问embdding模型就会报错如下，UI上看embbding模型还是正常的
curl http://0.0.0.0:9997/v1/embeddings -H "Content-Type: application/json" -d '{
"input": "测试ebmeddings",
"model": "bge-large-en-v1.5"
}'
{"detail":"[address=0.0.0.0:37167, pid=193948] [Errno 5] Input/output error"}(venv_xinference)
重启xinference, 重新launch embedding model bge-large-en-v1.5 and llm model mistral-instruct-v0.2，又可以正常访问embedding API了

要怎么解决这个一段时间后embedding API访问异常的问题？

2条答案

请查看以下链接是否有所帮助：https://stackoverflow.com/questions/52376942/python-ioerror-errno-5-input-output-error
感谢，通过使用命令nohup xinference-local --host 0.0.0.0 --port 9997 > nohup.out 2>&1 &而不是原始的xinference-local --host 0.0.0.0 --port 9997 &解决了问题。