Creating a RAG Pipeline
Last updated
Was this helpful?
Last updated
Was this helpful?
This page relates to page-based plans.
🔍 Looking for our legacy usage-based plan documentation? Click here.
The process of creating a RAG pipeline in Vectorize is simple and divided into three main steps:
Configuring the vector database and vectorization strategy.
Configuring one or more source connectors to ingest data from.
Configuring when the pipeline should update the vector indexes.
Below are the detailed instructions on how to configure a RAG pipeline, using the provided images as a guide.
From the Vectorize dashboard, select New RAG Pipeline from the RAG Pipelines section on the left sidebar.
This will take you to the pipeline configuration screen, where you'll be asked to name your pipeline and configure its components. Here you can provide a name for your pipeline.
You may configure the pipeline in any order, but the following steps are recommended:
First, select a source connector to ingest data from. You can either select an existing source connector or create a new one. Click Select Source to add a new source connector.
When you click on the box to add a source connector, you will be presented with a list of available source connectors.
Once you select the source connector you want to use, you will be presented with options to configure the connection to the source system. You can find documentation for how to configure each individual connector in the connector's documentation.
Next you can choose the extractor and chunker for your pipeline. The extractor is responsible for extracting text from the source data, while the chunker is responsible for breaking the text into smaller chunks. There are three options for the extracting strategy:
Fast: This is our fastest and most lightweight extractor.
Vectorize Iris: This extractor is excellent for PDFs and other complex documents.
Mixed: This extractor uses both Fast and Vectorize Iris depending on the file type. You may also choose the chunk size. You may want smaller chunks for more granular search results, or larger chunks for more context. Lastly you can configure the chunk overlap. This is the number of words that will overlap between chunks. This can help with context and search results.
Next is the embedder configuration. The embedder is what will generate the vector embeddings for the text data. You can choose between the built-in embedder or a custom embedder. The built-in embedder is free to use and is a great starting point for your pipeline. If you have a custom embedder, you can select it here.
Finally, select a vector database where the vector embeddings will be stored. You can choose between the built-in vector database or a custom vector database if you wish to view the embeddings in a different system.
After deploying, your pipeline will always run immediately to backfill the vector index.
You have successfully created a RAG pipeline in Vectorize. You can now start ingesting data and generating embeddings. If you have any questions or need help, please reach out to our support team.
Selecting Bring your own database will bring up a list of available vector databases you can integrate with. You can refer to the relevant documentation on each connector for more details about how to configure the integration.
Once you configure a vector database integration for a RAG Pipeline, it will become available to reuse in future pipelines. If you choose to bring your own database you will need to configure the collection/index name. You can choose to use an already existing collection/index or we can automatically create a new one for you.
After configuring the pipeline, you can choose to save it as a draft or deploy it immediately. If you save it as a draft, you can come back and deploy it later. If you deploy it immediately, the pipeline will start ingesting data and generating embeddings.