Skip to main content

SharePoint Source Connector

The SharePoint Source Connector allows you to integrate SharePoint as a data source for your pipelines. This guide explains the configuration options available when setting up a SharePoint connector.

Before you begin

Before starting, you'll need:

  • A Microsoft Entra ID application
  • The following information for your app:
    • Client Id
    • Tenant Id
    • Client Secret

If you don't have an application created yet, check out our guide How to Create a Microsoft Entra ID Application.

Configure the connector

To configure a SharePoint connector to your Microsoft Entra ID application:

  1. Click Source Connectors from the main menu.

  2. Click New Source Connector from the Source Connectors page.

  3. Select the SharePoint card.

    SharePoint Card

  4. Enter the name, client id, tenant id, client secret, and user(s) in the form using the SharePoint Parameters table below as a guide, then click Create SharePoint Integration.

    SharePoint Connection Parameters

SharePoint Parameters

FieldDescriptionRequired
NameA descriptive name to identify the connector within Vectorize.Yes
Client IdThe Microsoft Entra Id application's client id.Yes
Tenant IdThe Microsoft Entra Id application's tenantYes
Client SecretThe Microsoft Entra Id application's client secret value.Yes

Configuring the SharePoint Connector in a RAG Pipeline

You can think of the SharePoint connector as having two parts to it. The first is authorization with your Microsoft Entra ID application. This part is re-usable across pipelines and allows you to connect to this same application in different pipelines without providing the credentials every time.

The second part is the configuration that's specific to your RAG Pipeline, such as which files and directories should be processed.

Configuring SharePoint for RAG Pipeline

The following table outlines the fields available when configuring a Google Drive source for use within a Retrieval-Augmented Generation (RAG) pipeline.

FieldDescriptionRequired
File ExtensionsSpecifies the types of files to be included (e.g., PDF, HTML, Markdown, Text, DOCX).Yes
SitesThe names of the SharePoint sites whose files will be ingested. Enter one per line, and select + Add to add each additional site name.Yes

Files will be ingested from the Documents section in each specified SharePoint site.

SharePoint Documents

What's next?

  • If you haven't yet built a connector to your vector database, go to Configuring Vector Database Connectors and select the platform you prefer to use for storing output vectors.

    OR

  • If you're ready to start producing vector embeddings from your input data, head to Pipeline Basics. Select your new connector as the data source to use it in your pipeline.

Was this page helpful?