Deep Research (Beta)

📢 Note: Deep Research is currently in Beta.

The Deep Research feature allows you to generate comprehensive research reports based on your organization's data. This powerful tool combines AI with structured templates to produce in-depth analysis automatically.

Deep Research operates on the vectorized data associated with a pipeline. Deep Research will query the vector database configured on the pipeline multiple times to gather the required information to complete the report. It is also able to augment your private data with data searched from the web.

Deep Research is available in the Vectorize UI on the Pipeline Details page. It is also availabe as an endpoint on the Vectorize API. There is no additional charge for Deep Research, however it does use the retrieval endpoint when doing its research. Those retrievals will count against your plan's included retrievals and may trigger charges for additional retrievals.

Getting Started

To access the Deep Research feature, navigate to the Pipeline Details page and select the Deep Research tab. The interface is divided into three main sections:

Research Topic - Define what you want to research
Report Schema - Customize the structure of your report
Research Results - View the generated report

Step 1: Define Your Research Topic

Navigate to the Research Topic tab
Enter your research query in the text area (e.g., "Generate a research report on the financial status of the company"). For best results, be a specific as possible.
Toggle the Web Search option if you want to include internet data in your research
Click Next: Configure Schema to proceed

Research Topic Tab

Step 2: Configure Your Report Structure

The Report Structure defines the parts of your research report. This is helpful if your organization expects a certain structure in the report. Defining the structure is optional. If you do not provide it, Deep Research will automatically generate a suitable structure for your report based on the instructions you provided in the Research Topic section.

When configuring your report structure, you can:

Use a Template

Click the dropdown menu labeled "start from a template"
Select a pre-defined template that suits your needs

Create a Custom Schema

Use the Schema Builder to create your own report structure
Add sections using the Add Section button at the bottom
For each section:
- Provide a title and description
- Add subparts using the Add Subpart button
- Configure each subpart with its own title and description

Organize Your Schema

Rearrange sections and subparts: Drag and drop items to reorder them
Expand/collapse sections: Click the chevron icon next to a section title
Edit content: Click on any field to edit its content
Remove items: Click the trash icon to delete sections or subparts

Advanced Schema Options

View JSON: See the raw JSON structure of your schema
Download: Save your schema as a JSON file for future use
Import: Upload a previously saved schema
Clear All: Remove all sections and start fresh

Report Schema Builder

Step 3: Generate Your Research

Review your research topic and schema
Click the Start Research button to begin the generation process
The system will automatically switch to the Research Results tab
You'll see a progress tracker on the left showing each phase of the research process

Research Results

Viewing and Using Research Results

Once the research is complete:

The full report appears in the right panel
Review the content, which follows the structure defined in your schema (if provided)
Download the report as a PDF using the PDF button
To make changes and regenerate:
- Navigate back to the Research Topic or Report Schema tab
- Make your adjustments
- Click Run Again

Tips for Effective Research

Be specific: Clearly define what you want to research
Structure matters: Organize your schema logically with appropriate sections
Start small: Begin with a simple schema and expand as needed
Save templates: Create and save schemas for research types you perform regularly
Iterate: Use the "Run Again" button to refine results after reviewing initial output

Troubleshooting

Deep research can take several minutes to complete, so be patient. If research seems to be taking too long, check your internet connection.
If results don't match expectations, try refining your research question, adding more relevant source documents to you pipeline, or enabling web search to gather additional information from the internet
If a section is missing in the results, ensure your schema is properly structured

Using the Deep Research API

In addition to the user interface, the Deep Research feature is also available through a REST API. This enables you to integrate research capabilities into your own applications and workflows.

API Endpoints

Start Deep Research

Initiates a new research process.

URL: /v1/org/[organizationId]/pipelines/[pipelineId]/deep-research
Method: POST
Headers:
- Content-Type: application/json
- Authorization: Bearer [your-token]

Request Body:

{
  "query": "Your research question",
  "webSearch": true,
  "schema": "JSON schema for report structure",
  "n8n": {
    "account": "account-id",
    "webhookPath": "webhook-path",
    "headers": {
      "custom-header": "value"
    }
  }
}

Parameters:

query (required): The research question or topic
webSearch (optional): Set to true to include internet data in research
schema (optional): JSON string representing the report structure
n8n (optional): Configuration for n8n workflow integration

Response:

{
  "researchId": "unique-research-id"
}

Get Research Results

Retrieves the results of a research process.

URL: /v1/org/[organizationId]/pipelines/[pipelineId]/deep-research/[researchId]
Method: GET
Headers:
- Authorization: Bearer [your-token]

Response:

{
  "ready": true,
  "data": {
    "success": true,
    "events": ["Research started", "Processing data", "Generating report"],
    "markdown": "# Research Report\n\n## Introduction\n\nThis report explores..."
  }
}

If ready is false, the research is still in progress
If success is false, the error field will contain error details
events contains progress updates
markdown contains the full report in markdown format

Schema Format for API Usage

When using the API, you'll need to provide a schema in JSON format that defines the structure of your research report.

Schema Structure

{
  "report_schema": [
    {
      "type": "Section Name",
      "description": "Description of this section",
      "subparts": [
        {
          "type": "Subpart Name",
          "description": "Description of this subpart"
        }
      ]
    }
  ]
}

Key Components

report_schema: The root array containing all sections
type: The title/heading of a section or subpart
description: Detailed instructions for what should be included
subparts: An optional array of subsections within a main section

Example Schema

Here's an example schema for a financial report:

{
  "report_schema": [
    {
      "type": "Executive Summary",
      "description": "A high-level overview of the company's financial status and key findings.",
      "subparts": []
    },
    {
      "type": "Financial Highlights",
      "description": "A summary of key financial performance indicators.",
      "subparts": [
        {
          "type": "Revenue Summary",
          "description": "Total revenue generated, including a comparison with previous periods."
        },
        {
          "type": "Profitability Metrics",
          "description": "Gross profit, operating profit, and net profit margins."
        }
      ]
    }
  ]
}

Comprehensive Schema Example

Below is a more detailed schema example for a financial report:

{
  "report_schema": [
    {
      "type": "Title",
      "description": "The official title of the financial report, including the name of the company and the reporting period.",
      "subparts": []
    },
    {
      "type": "Executive Summary",
      "description": "A high-level overview of the company's financial status and key findings.",
      "subparts": []
    },
    {
      "type": "Company Overview",
      "description": "A summary of the company's business model, industry, and markets.",
      "subparts": [
        {
          "type": "Company Background",
          "description": "Brief history, mission, and vision of the company."
        },
        {
          "type": "Market Position",
          "description": "Competitive positioning and industry trends."
        }
      ]
    },
    {
      "type": "Financial Highlights",
      "description": "A summary of key financial performance indicators.",
      "subparts": [
        {
          "type": "Revenue Summary",
          "description": "Total revenue generated, including comparisons with previous periods."
        },
        {
          "type": "Profitability Metrics",
          "description": "Gross profit, operating profit, and net profit margins."
        }
      ]
    }
  ]
}

Best Practices for Schema Design

Be specific in descriptions: Provide clear guidance on what information should be included
Use logical organization: Structure the report with a natural flow from high-level to detailed information
Include context: Add details about relevant metrics, comparisons, or time periods
Balance depth and breadth: Include enough sections to be comprehensive without making the schema unwieldy
Consider your audience: Design the schema with the intended readers' needs in mind

API Usage Example

The following example demonstrates how to start a research task and retrieve the results using cURL:

# Start a new research task
curl -X POST \
  https://api.vectorize.io/v1/org/your-org-id/pipelines/your-pipeline-id/deep-research \
  -H 'Authorization: Bearer your-token' \
  -H 'Content-Type: application/json' \
  -d '{
    "query": "Analyze the financial performance of our company",
    "webSearch": false,
    "schema": "{\"report_schema\":[{\"type\":\"Executive Summary\",\"description\":\"Summary of findings\",\"subparts\":[]}]}"
  }'

# Response: {"researchId":"abc123"}

# Check research status
curl -X GET \
    https://api.vectorize.io/v1/org/your-org-id/pipelines/your-pipeline-id/deep-research/abc123 \
  -H 'Authorization: Bearer your-token'

Integration Tips

Poll the results endpoint periodically to check if research is complete
Consider implementing webhooks for asynchronous notifications
Store the research ID to access results later
Use proper error handling for cases where research fails

Getting Started​

Step 1: Define Your Research Topic​

Step 2: Configure Your Report Structure​

Use a Template​

Create a Custom Schema​

Organize Your Schema​

Advanced Schema Options​

Step 3: Generate Your Research​

Viewing and Using Research Results​

Tips for Effective Research​

Troubleshooting​

Using the Deep Research API​

API Endpoints​

Start Deep Research​

Get Research Results​

Schema Format for API Usage​

Schema Structure​

Key Components​

Example Schema​

Comprehensive Schema Example​

Best Practices for Schema Design​

API Usage Example​

Integration Tips​