Skip to main content

Build Your First RAG Pipeline

Time to complete: 5 minutes

This quickstart will show you how to create a pipeline that:

  • Ingests your documents
  • Creates vector embeddings
  • Makes your data searchable using AI

We'll use the Web Crawler connector as well as our built in, free to use, embedding model and vector database to process documents from your device.

Before you begin

Before you begin, you'll need:

Step 1: Create Your RAG Pipeline

  1. Log in to Vectorize.

  2. Click New RAG Pipeline in the left sidebar.

    Create your Pipeline

  3. Name your pipeline (e.g., "Quickstart Pipeline").

  4. Select your data source:

    • Click Select Source.
    • Choose Web Crawler.
    • Enter a name for the Web Crawler.
    • Enter a website you wish to crawl.
    • Click Create Web Crawler Integration.
    • You may wish to add additional configuration options, such as additional URLs and forbidden paths.
  5. Configure your Extractor and Chunker:

    • This is where you may configure the extractor and chunker for your pipeline.
    • For this quickstart, we will use the default values.
  6. Choose Embedder:

    • For this quickstart, we will use Built-in Embedder.
    • This is a free to use embedding model provided by Vectorize.
  7. Choose Vector Database:

    • For this quickstart, we will use Built-in Vector Database.
    • This will be automatically selected so you do not need to do anything.

    Configure Pipeline

Step 2: Deploy your pipeline:

  1. Click Save/Deploy RAG Pipeline.

  2. If you wish to save your pipeline as a draft, click Save RAG Pipeline. Otherwise, click Deploy RAG Pipeline.

    Complete Pipeline Configuration

Your pipeline will crawl the website you selected, ingest the content, generate embeddings, and write them to the vector database.

When the embeddings have been stored in the database, your pipeline's status will change to the Listening state, where it will "listen" for more updates to ensure there is no more data left to process. Once it's done processing it will go into Idle state.

Pipeline Listening State

Step 3: Try it out!

  1. Click the RAG Sandbox tab to open the RAG Sandbox.
  2. Ask a question about your website.
  3. See your data in action!

Ask Questions in Sandbox

What's Next?

Was this page helpful?