AI Platforms
Last updated
Last updated
AI Platforms are essential components in RAG (Retrieval-Augmented Generation) pipelines within Vectorize. They are responsible for generating text embeddings, which are crucial for transforming unstructured text data into vector representations that can be efficiently stored and searched in vector databases.
AI Platform integrations in Vectorize allow you to connect to various AI services that provide advanced natural language processing capabilities, particularly for generating text embeddings. These embeddings capture the semantic meaning of text, enabling more accurate and context-aware information retrieval in RAG pipelines.
Vectorize currently supports the following AI Platform integrations:
OpenAI: Leverage OpenAI's powerful language models for generating high-quality text embeddings.
Voyage AI: Utilize Voyage AI's specialized embedding models for various natural language processing tasks.
You can configure AI Platform integrations in two ways:
Navigate to the Vectorize dashboard.
In the left sidebar, under "Integrations," click on "AI Platforms."
You'll see a list of currently configured AI Platform integrations in your workspace.
To add a new integration, click the "New AI Platform" button.
Choose from the list of available AI Platform options.
Follow the prompts to configure the selected AI Platform integration.
During the RAG Pipeline creation process, you'll reach a step to configure the AI Platform.
You can either select an existing AI Platform integration or create a new one.
If creating a new integration, choose from the available options and follow the configuration steps.
Select the specific embedding model you want to use for this pipeline.
Note: AI Platform integrations configured as part of a RAG Pipeline will automatically appear in the AI Platforms list for your organization and can be reused in future RAG pipelines.
When setting up an AI Platform integration, you'll typically need to provide:
API keys or access tokens for authentication
Model selection (if the platform offers multiple embedding models)
Any additional configuration options specific to the chosen AI Platform
In a RAG Pipeline, the integrated AI Platform is used to:
Generate embeddings for the documents ingested through source connectors
Create embeddings for user queries during the retrieval process
These embeddings enable semantic similarity searches in the vector database, allowing for more accurate and context-aware information retrieval.
For detailed information on configuring specific AI Platform integrations, please refer to their individual documentation pages linked above.