ragflow [Bug]:在进行自我RAG时,知识库未包含,

3mpgtkmj  于 4个月前  发布在  其他
关注(0)|答案(2)|浏览(162)

是否存在相同问题的现有问题?

  • 我已检查了现有的问题。

分支名称

main

提交ID

5ec19b5

其他环境信息

additional context from self rag:
100.82.53.123 - - [19/Jun/2024:17:20:18 -0400] "POST /api/chat HTTP/1.1" 200 935 "-" "ollama-python/0.1.8 (x86_64 linux) Python/3.10.12" "{\x22model\x22: \x22llama3:70b-instruct-fp16\x22, \x22messages\x22: [{\x22role\x22: \x22system\x22, \x22content\x22: \x22\x5Cn        You are an expert at query expansion to generate a paraphrasing of a question.\x5Cn        I can't retrieval relevant information from the knowledge base by using user's question directly.     \x5Cn        You need to expand or paraphrase user's question by multiple ways such as using synonyms words/phrase, \x5Cn        writing the abbreviation in its entirety, adding some extra descriptions or explanations, \x5Cn        changing the way of expression, translating the original question into another language (English/Chinese), etc. \x5Cn        And return 5 versions of question and one is from translation.\x5Cn        Just list the question. No other words are needed.\x5Cn    \x22}, {\x22role\x22: \x22user\x22, \x22content\x22: \x22list cyber security problems that are evidence of a latent security breach\x22}], \x22stream\x22: false, \x22format\x22: \x22\x22, \x22options\x22: {\x22temperature\x22: 0.8}, \x22keep_alive\x22: -1}"
100.82.53.123 - - [19/Jun/2024:17:20:20 -0400] "POST /api/embeddings HTTP/1.1" 200 20625 "-" "ollama-python/0.1.8 (x86_64 linux) Python/3.10.12" "{\x22model\x22: \x22mxbai-embed-large\x22, \x22prompt\x22: \x22Here are five paraphrased questions:\x5Cn\x5Cn1. What are the signs or indicators of an underlying cybersecurity threat?\x5Cn2. List potential warning signals of a previously undetected security compromise\x5Cn3. Identify common manifestations of hidden network vulnerabilities\x5Cn4. What are the typical red flags of an unreported cyber attack?\x5Cn5. \x5Cu00bfCu\x5Cu00e1les son los problemas de seguridad inform\x5Cu00e1tica que sugieren una violaci\x5Cu00f3n de seguridad latente?\x22, \x22options\x22: {}, \x22keep_alive\x22: null}"
100.82.53.123 - - [19/Jun/2024:17:23:17 -0400] "POST /api/chat HTTP/1.1" 200 61892 "-" "ollama-python/0

实际行为

我在ollama和ragflow之间代理流量以检测此问题。在进行自我rag时,会生成并提交问题,但知识库为空(我加粗了以便更容易看到):

100.82.53.123 - - [19/Jun/2024:17:27:39 -0400] "POST /api/chat HTTP/1.1" 200 10517 "-" "ollama-python/0.1.8 (x86_64 linux) Python/3.10.12" "{x22model\x22: \x22llama3:70b-instruct-fp16\x22, \x22messages\x22: [{\x22role\x22: \x22system\x22, x22content\x22: \x22You are an intelligent assistant. Please summarize the content of the knowledge base to answer the question. Please list the data in the knowledge base and answer in detail. When all knowledge base content is irrelevant to the question, your answer must include the sentence \x5C\x22The answer you are looking for is not found in the knowledge base!\x5C\x22 Answers need to consider chat history.\x5Cn **Here is the knowledge base:x5Cn \x5Cn The above is the knowledge base.\x**22}, {\x22role\x22: \x22user\x22, \x22contentx22: \x22list cyber security problems that are evidence of a latent security breach\x22}], \x22streamx22: true, \x22format\x22: \x22\x22, \x22options\x22: {\x22temperature\x22: 0.1, \x22num_predict\x22: 933, \x22top_k\x22: 0.3, \x22presence_penaltyx22: 0.4, \x22frequency_penalty\x22: 0.7}, \x22keep_alive\x22: -1}"

预期行为

我希望有知识库内容。

重现步骤

use llama3
configure with self-rag
ask a question of knowledge base where you know you have relevant content
km0tfn4u

km0tfn4u1#

Retievaled nothing even using self-rag.
Could you list the user question and the relevant content in knowledge base in order to analyze the reason of nothing retrievaled?

xbp102n0

xbp102n02#

这是一些额外的详细条目,当尝试使用RAG对数据进行索引时(似乎没有任何内容被传递回来)。

100.80.54.128 - - [20/Jun/2024:10:59:27 -0400] "POST /api/embeddings HTTP/1.1" 200 20585 "-" "ollama-python/0.1.8 (x86_64 linux) Python/3.10.12" "{\x22model\x22: \x22mxbai-embed-large\x22, \x22prompt\x22: \x22There are no paragraphs provided for me to summarize. Please provide the paragraphs you would like me to summarize, and I'll be happy to assist you!\x22, \x22options\x22: {}, \x22keep_alive\x22: null}"

100.80.54.128 - - [20/Jun/2024:10:59:27 -0400] "POST /api/embeddings HTTP/1.1" 200 20585 "-" "ollama-python/0.1.8 (x86_64 linux) Python/3.10.12" "{\x22model\x22: \x22mxbai-embed-large\x22, \x22prompt\x22: \x22There are no paragraphs provided for me to summarize. Please provide the paragraphs you would like me to summarize, and I'll be happy to assist you!\x22, \x22options\x22: {}, \x22keep_alive\x22: null}"

100.80.54.128 - - [20/Jun/2024:10:59:27 -0400] "POST /api/embeddings HTTP/1.1" 200 20684 "-" "ollama-python/0.1.8 (x86_64 linux) Python/3.10.12" "{\x22model\x22: \x22mxbai-embed-large\x22, \x22prompt\x22: \x22There are no paragraphs provided for me to summarize. Please provide the actual text, and I'll be happy to help!\x22, \x22options\x22: {}, \x22keep_alive\x22: null}"

100.80.54.128 - -

相关问题