Suggest a tag taxonomy based on file content and existing metadata.
| Field | Type | Description |
|---|---|---|
data_connector_name | str | Name of the data connector (vector database profile) to use. |
file_names | Optional[List[str]] | List of specific files to analyze for the hierarchy suggestion. |
dataslice_id | Optional[str] | ID of a dataslice to pull files from for the suggestion. |
progress_tracking_id | Optional[str] | Custom tracking ID for monitoring the suggestion progress. |
schema_name | Optional[str] | Name for the suggested schema/taxonomy. |
current_tree | Optional[Dict] | Existing hierarchy tree to build upon. |
condition | Optional[Condition] | Filtering condition to select specific files for analysis. |
node | Optional[GraphNode] | Node location in the existing hierarchy tree to build upon. Default: {} |
user_context | Optional[str] | User-provided context to guide the suggestion process. |
context_level | Optional[str] | Level at which to analyze content: file or chunk. Default: "file" |
max_height | Optional[int] | Maximum depth of the generated hierarchy tree. Default: 2 |
use_existing_tags | Optional[bool] | Whether to incorporate existing tags in suggestions. Default: false |
use_extracted_tags | Optional[bool] | Whether to use previously extracted tags. Default: false |
use_mix_llm_and_source | Optional[bool] | Whether to mix LLM-generated and source-based tags. Default: false |
{ "status_code": int, "message": str, "suggestion": Dict, "suggested_tags": Optional[Dict], "node": Optional[GraphNode], "tag_not_found_rates": Optional[Dict] }{
"data_connector_name": "my-connector",
"file_names": ["document1.pdf", "document2.pdf"],
"context_level": "file",
"max_height": 3,
"use_existing_tags": true,
"user_context": "Suggest categories for financial documents"
}
open_ended, binary, mixed, defined_values, hierarchy