See document text in the UI for testing purposes

Sab_S · April 19, 2024, 12:21pm

Rn all I see is this:

I want to check if the content is what I wanted it to be

ofermend · April 19, 2024, 1:05pm

From what I understand you ingested a document (e.g. a PDF) into your Vectara corpus. This shows the document ID and metadata. Do you want to see the text of the original document?
This can be available via the API when you ingest the file (but not in the console) - File Upload API Definition | Vectara Docs. If you set d=True in he request, the API call returns the extracted document that was indexed.

Sab_S · April 19, 2024, 1:26pm

I ingested a .txt file

Yes I want to see the text of the original document in the console

ofermend · April 19, 2024, 1:39pm

Sorry, that is not possible in the console currently.

David_Cawley · June 25, 2024, 7:00pm

I would like to expand on this feature request:

it would be neat to see the scraped text contents in the console during testing
I would like to call an API on your side to ‘only’ do the text extraction part of your logic - again for testing, and maybe some other uses.
Since we are planning to trust Vectara that you are doing an excellent text extraction and not changing the meaning of the contents of a PDF or Word document - being able to see that scraped out text is very valuable.
In my application - my users may want to upload files into a different section of my app to store full copies of the text content - primarily for smaller files that they will want to use in their Gen AI prompts without RAG - just include the whole thing. Maybe a 1 page mission statement, or our principles, our brand, our coporate history. If I could pass those word/pdf documents up to vectara and get a ‘vectara scraped text’ back - then I’d be using the same logic for all my ‘text scraping’ needs.

also a concern: many of these threads get answered wtih ‘sorry that is not possible in the api or console currently’ - are these being recorded as potential future enhancements?

Topic		Replies	Views
List of Uploaded Documnts?	10	1185	November 3, 2023
Search not picking up programmatically added doc	5	880	June 14, 2023
How do you see the list of documents in a corpus? Vectara Platform Q&A	8	1241	September 26, 2023
Cannot see `Text` as a type in FilterMetadata Vectara Platform Q&A indexing	2	1037	March 9, 2023
Files in Corpus Vectara Platform Q&A indexing	1	1372	July 9, 2023

See document text in the UI for testing purposes

Related topics