Automatic breaking large sections into smaller ones

Steve_Farthing · June 30, 2023, 9:05pm

If I index a large “section” when indexing a document will it break it up into multiple subsections? If it doesn’t is there a good way to take a large markdown blob an break it down into subsections?

shane · July 3, 2023, 4:40pm

If you use the file upload API, Vectara will automatically attempt to handle basically everything for you. It will attempt to extract the text and metadata and to directly answer your question, yes, it will also attempt to break documents into appropriate subsections. Here’s an example document I uploaded to the console, where you can see Vectara has sectioned it automatically and the first result here is from section 2:

Just for completeness sake, the “standard indexing API” on the other hand, assumes that you will section documents as needed before sending them to Vectara. If you’re uploading markdown documents directly, you’ll be using the file upload API instead of the standard indexing API, so feel free to ignore this paragraph until you do some application development on non-files.

Topic		Replies	Views
What is an appropriate amount of "text" in a document part, and what is the recommended way to split something? Vectara Platform Q&A	9	1090	January 29, 2023
How to define a specific Chunking method? What Vectara Platform Q&A indexing	1	31	January 22, 2025
Uploading files & Indexing Vectara Platform Q&A indexing	3	863	July 21, 2023
Unable to Retrieve Document Section Information via Search Vectara Platform Q&A indexing	1	751	October 17, 2023
Is it possible to index metadata? Vectara Platform Q&A	7	1093	January 20, 2023

Automatic breaking large sections into smaller ones

Related topics