Supported Data Connectors
Amazon S3
Connect to AWS S3 buckets for scalable cloud storage.
SharePoint
Integrate with Microsoft 365 document libraries.
PostgreSQL
Connect to PostgreSQL databases with pgvector support.
Qdrant
Vector database integration for semantic search.
Configuration Details
| Source Type | Description | Key Configuration | Ideal Use Case |
|---|---|---|---|
| Amazon S3 | AWS cloud object storage | Bucket name, Access Key, Secret Key | Large-scale document archives, cloud-native workflows |
| SharePoint | Microsoft 365 document management | Client ID, Client Secret, Tenant ID, Site Name | Enterprise document libraries, Office 365 environments |
| PostgreSQL | Relational database with pgvector extension | Host URL, Database name, User credentials, Port | Structured + unstructured hybrid data, existing database workflows |
| Qdrant | Purpose-built vector database for AI | API Key, Collection name, URL | Semantic search applications, RAG pipelines |
Key Features
Multiple Profiles
Create and manage multiple Data Connector connections
Connection Testing
Validate credentials before saving
Active Profile Selection
Switch between Data Connectors with one click
Schema Configuration
Customize field mappings (filename key, text key, tags key)
How Data Connectors Work
1
Create a Connector
Select your storage type and provide the required credentials.
2
Test the Connection
Validate that the platform can access your documents before saving.
3
Configure Schema Mapping
Map your data fields (filename, text content, tags) to the platform’s expected format.
4
Start Processing
Your documents are now available for metadata extraction.
Python SDK
- Create Connector
- List Connectors
- Ingest Data
- Delete Connector

