How to return metadata with query results?

zigguratt · July 5, 2023, 12:38pm

I’ve been able to add metadata to the PDFs I upload to a Vectara corpus. The attached image shows one of the pieces of metadata. When I make a query to the corpus from my application I only get one piece of metadata returned with the chunks:

{
    "section": "296"
}

How can I return the metadata that I’ve added to each document?

shane · July 6, 2023, 3:29pm

What you’re probably getting tripped up on is that there are 2 types of metadata: document-level metadata and section-level metadata. Seeing just { "section": "296" } is showing you that there’s no section-level metadata other than the section number. But each section returned should also have a documentIndex JSON attribute which will point you to the index of the document array which may contain document-level metadata. So if you want to get the document-level metadata for a specific section, you should use some logic like

documentIndex = responseSet[0]['response'][i]['documentIndex']
documentMetadata = responseSet[0]['document'][documentIndex]['metadata']

There’s an example in the docs at Reading Metadata | Vectara Docs.

This format is because multiple sections that answer the user’s question could come back from the same document, so it’s possible to get >1 hit (in the form of > 1 sections) for the same document.

zigguratt · July 7, 2023, 11:05am

Thank you, @shane! That’s the information I was looking for.

Topic		Replies	Views
Unable to Retrieve Document Section Information via Search Vectara Platform Q&A indexing	1	751	October 17, 2023
Is it possible to index metadata? Vectara Platform Q&A	7	1093	January 20, 2023
Return link to a document's file source Vectara Platform Q&A indexing	3	955	September 20, 2023
Querying custom metadata in the Vectara platform Vectara Platform Q&A query , admin-functions	4	874	July 21, 2023
Search accuracy for product data Vectara Platform Q&A	2	902	April 24, 2023

How to return metadata with query results?

Related topics