Airbyte Integration
Overview
Airbyte connectors enable data ingestion from 300+ sources and destinations.
Architecture
- Managers:
AirbyteConnectorManagerhandles connector lifecycle - Specs: Dynamic generation of configuration forms based on Airbyte specifications
- Secrets: Integrated with Supabase for secure credential storage
Configuration
Add Connection
- Go to Settings > Airbyte Connections
- Click "Add Source" or "Add Destination"
- Select connector type
- Enter configuration
- Test connection
- Save
Using in Pipelines
{
"id": "airbyte-source",
"type": "airbyte",
"data": {
"connection_id": "uuid-of-airbyte-connection",
"stream": "users",
"sync_mode": "full_refresh",
"destination": "internal_staging"
}
}
Supported Connectors
Sources
- Database (PostgreSQL, MySQL, Oracle)
- SaaS (Salesforce, HubSpot, Stripe)
- File (SFTP, Google Sheets)
- CDC (Debezium-based)
Destinations
- Database (PostgreSQL, Snowflake, BigQuery)
- Data Warehouse (Redshift, Databricks)
- File (S3, GCS)
Sync Modes
| Mode | Description |
|---|---|
full_refresh | Read entire dataset |
incremental | Only new/updated records |
cdc | Change data capture |
Troubleshooting
Connection fails
- Check credentials are correct
- Verify network access
- Check Airbyte logs
Sync stalls
- Check connector state
- Verify destination has space
- Review rate limits