Overview
Workflow to index PDFs in a folder into a Chroma collection
Tags
rag, start
Workflow Diagram
graph TD
listfiles_8eb41c["ListFiles"]
papers2_49cc8e["papers2"]
loaddocumentfile_4e1c87["LoadDocumentFile"]
extracttext_2124c9["ExtractText"]
indextextchunks_38b8d1["IndexTextChunks"]
sentencesplitter_be7780["SentenceSplitter"]
pathtostring_7fdb62["PathToString"]
extracttext_2124c9 --> sentencesplitter_be7780
loaddocumentfile_4e1c87 --> extracttext_2124c9
sentencesplitter_be7780 --> indextextchunks_38b8d1
papers2_49cc8e --> indextextchunks_38b8d1
listfiles_8eb41c --> loaddocumentfile_4e1c87
listfiles_8eb41c --> pathtostring_7fdb62
pathtostring_7fdb62 --> sentencesplitter_be7780
How to Use
- Open NodeTool and create a new workflow
- Import this workflow from the examples gallery or build it manually following the diagram above
- Configure the input nodes with your data
- Run the workflow to see results
Related Workflows
Browse other workflow examples to discover more capabilities.