llama_index [问题]:聊天引擎返回的过多内容，如何让它不匹配那么多？

我猜我假设你使用了向量索引，但是你创建了哪种索引？
DocumentSummaryIndex
这是我如何使用它的：
storage_context = StorageContext.from_defaults(persist_dir=persist_dir)
index = load_index_from_storage(storage_context)

赞(0）回复(0）举报 10个月前

disbfnqx5#

我猜我假设你使用了向量索引，但是你创建了哪种索引？
DocumentSummaryIndex
这是我如何使用它的：storage_context = StorageContext.from_defaults(persist_dir=persist_dir) index = load_index_from_storage(storage_context)
这是我如何创建索引的：
splitter = SentenceSplitter(chunk_size=1024)
response_synthesizer = get_response_synthesizer(
response_mode="tree_summarize", use_async=True
)
doc_summary_index = DocumentSummaryIndex.from_documents(
docs,
llm=OurLLM(), #自定义本地模型
transformations=[splitter],
response_synthesizer=response_synthesizer,
show_progress=True,
)
doc_summary_index.storage_context.persist(persist_dir)

展开查看全部

赞(0）回复(0）举报 10个月前

gzszwxb46#

@yuyu990116 文档摘要索引通过生成文档摘要来工作，并使用这些摘要来决定将哪些文档发送给LLM。
这将把与所选文档相关的所有节点发送到LLM。
实际上没有方法限制使用此索引发送的节点数量，因为它会从所选文档中提取所有节点。

赞(0）回复(0）举报 10个月前

我来回答

llama_index [问题]:聊天引擎返回的过多内容，如何让它不匹配那么多？

问题验证

问题

6条答案

相关问题

热门标签

最新问答