Read extracted data

Once you have uploaded a resource and it got processed by Nuclia, you can access the data extracted from it.

CLI:

nuclia kb resource get --rid=RESOURCE_ID --show=extracted --json

SDK:

from nuclia import sdk
resource = sdk.NucliaResource()
resource.get(rid=RESOURCE_ID, show='extracted')

Note: If the resource is not processed yet, a warning will be shown.

The extracted data is nested in the extracted key of each resource's field.

For example, if you have upload a file with:

nuclia kb upload file --path=FILE_PATH --field=file1

the corresponding extracted data will be in data.files.file1.extracted.

It contains the extracted text, the paragraphs, the entities, the relations between entities and all the file metadata.

Get embeddings

You can get the embeddings of the indexed text by using the extracted=vectors option:

CLI:

nuclia kb resource get --rid=RESOURCE_ID --show=extracted --extracted=vectors --json

SDK:

from nuclia import sdk
resource = sdk.NucliaResource()
resource.get(rid=RESOURCE_ID, show='extracted', extracted='vectors')