Integrations Overview
Vectorize integrates with external services through connectors to create powerful data pipelines for AI agents. These connectors work together to ingest diverse content sources, generate embeddings, and transform unstructured data into structured, metadata-rich context that agents can reason over.
How Connectors Enable Agentic AI
Vectorize pipelines support retrieval and transformation workflows that enable agents to operate over structured context. Unlike traditional retrieval systems, agents built on Vectorize can:
- Reason across multiple data sources to synthesize insights and make decisions
- Use metadata for intelligent filtering - like prioritizing recent documents, specific authors, or document types
- Perform multi-step analysis - such as comparing versions, detecting trends, or preparing comprehensive reports
- Take action based on retrieved context - from generating summaries to triggering workflows
Example: Compliance Agent
Instead of just retrieving policy documents, a compliance agent might:
- Analyze emails and documents for regulatory keywords
- Use metadata (sender, date, document type) to prioritize high-risk communications
- Cross-reference findings with policy documents
- Generate a compliance report with specific recommendations
Types of Connectors
Source Connectors
Source connectors allow you to ingest diverse content sources into structured pipelines, enabling agents to reason across emails, documents, databases, and more. They transform unstructured data from various sources into rich, queryable context.
Key Features:
- Automatic metadata extraction (file type, creation date, author)
- Support for various file formats with intelligent parsing
- Configurable scheduling to keep agent knowledge current
- OAuth integration for secure access to third-party services
Important: Once a source connector is associated with a pipeline, it cannot be removed or deleted.
AI Platform Connectors
AI Platform connectors connect to AI services that generate embeddings and extract structured information from your content. They don't just create vector representations - they help transform raw text into structured data that agents can reason over.
Agent-Enabling Features:
- Generate high-quality embeddings for semantic understanding
- Extract structured metadata using schema-driven extraction
- Support varies by model - some embedding providers support multiple languages or domains
- Consistent model selection across ingestion and query time
Destination Connectors
Destination connectors integrate with storage systems optimized for agent workloads. They don't just store vectors - they maintain structured, queryable indexes that agents can efficiently reason over.
Agent-Optimized Features:
- Semantic search with metadata filtering for intelligent context selection
- Namespace isolation for multi-tenant agent applications
- Performance optimization for complex agent queries
Configuring Connectors for Agent Workflows
You can configure connectors in two ways:
1. From the Integrations Section
Navigate to the Vectorize dashboard and access each connector type from the left sidebar under "Integrations." This centralized approach allows you to:
- Set up connectors that multiple agents can use
- Configure metadata extraction rules
- Test connector output before deploying to agents
2. While Creating a RAG Pipeline
During pipeline creation, you can configure connectors in context:
- See how source data flows through to agent-ready output
- Configure metadata extraction specific to your use case
- Preview how agents will interact with the processed data
Note: Connectors configured as part of a RAG Pipeline automatically appear in your organization's connector lists for reuse across other agent workflows.
Understanding Connector Reuse
When you create a connector in Vectorize, you're creating a reusable integration with a service - for example, a Pinecone connector with your Pinecone API key, or a Fireflies connector with your Fireflies API key.
These connectors can be used across multiple pipelines. When you add a connector to a specific pipeline, you can configure additional parameters specific to that pipeline instance - like the Pinecone index name or a Fireflies meeting title filter.
This design lets you manage credentials securely while customizing connector behavior per pipeline.
Best Practices for Agentic AI
- Design for reasoning, not just retrieval: Structure your data with rich metadata that agents can use for decision-making
- Think in workflows: Consider how agents will use multiple data sources together
- Optimize for quality: Better structured data leads to more capable agents
- Monitor and iterate: Track how agents use different data sources and refine accordingly
- Plan for scale: Choose connectors that can grow with your agent deployment
Getting Started
Ready to build your first agent-powered application? Check out our guides:
- Build Your First Pipeline - Foundation for agent development
- All-Inclusive Quickstart - Quick setup with built-in connectors
- Platform-specific quickstarts for production deployments
For detailed configuration instructions, explore the individual connector pages in each category.