PostgreSQL Vector Database Integration
This guide walks you through configuring a Retrieval-Augmented Generation (RAG) pipeline in Vectorize using PostgreSQL as the vector database.
Before you Begin
Ensure you have:
- A Vectorize account (Create one free here ↗ )
- A PostgreSQL database with the pgVector extension
Configure the integration
To configure a vector database integration to connect to your PostgreSQL instance:
-
Click Vector Databases from the main menu.
-
Click New Vector Database Integration from the Vector Databases page.
-
Select the PostgreSQL card.
-
Enter the parameters in the form using the PostgreSQL Parameters table below as a guide, then click Create PostgreSQL Integration.
PostgreSQL Parameters
Field | Description | Required |
---|---|---|
Name | A descriptive name to identify the integration within Vectorize. | Yes |
Host | The name of the host where your database is located. | Yes |
Port | The port you'll connect to. | Yes |
Database | Your database name. | Yes |
Username | The name of the user used to connect to your database. | Yes |
Password | The user's password. | Yes |
Configuring the PostgreSQL integration in a RAG Pipeline
You can think of the PostgreSQL integration as having two parts to it. The first is authorization with your PostgreSQL cluster. This part is re-usable across pipelines and allows you to connect to this same application in different pipelines without providing the credentials every time.
The second part is the configuration that's specific to your RAG Pipeline. This is where you specify the name of the table in your PostgreSQL database. If the table does not already exist, Vectorize will create it for you.
What's next?
-
If you haven't yet built a connector to your source database, go to configure a source connector and select your source data platform.
OR
-
If you're ready to start creating pipelines for vectorizing data, head to Pipeline Basics.