Deep Research (Beta)
Last updated
Was this helpful?
Last updated
Was this helpful?
📢 Note: Deep Research is currently in Beta.
The Deep Research feature allows you to generate comprehensive research reports based on your organization's data. This powerful tool combines AI with structured templates to produce in-depth analysis automatically.
Deep Research operates on the vectorized data associated with a pipeline. Deep Research will query the vector database configured on the pipeline multiple times to gather the required information to complete the report. It is also able to augment your private data with data searched from the web.
Deep Research is available in the Vectorize UI on the Pipeline Details page. It is also availabe as an endpoint on the Vectorize API. There is no additional charge for Deep Research, however it does use the retrieval endpoint when doing its research. Those retrievals will count against your plan's included retrievals and may trigger charges for additional retrievals.
To access the Deep Research feature, navigate to the Pipeline Details page and select the Deep Research tab. The interface is divided into three main sections:
Research Topic - Define what you want to research
Report Schema - Customize the structure of your report
Research Results - View the generated report
Navigate to the Research Topic tab
Enter your research query in the text area (e.g., "Generate a research report on the financial status of the company"). For best results, be a specific as possible.
Toggle the Web Search option if you want to include internet data in your research
Click Next: Configure Schema to proceed
The Report Structure defines the parts of your research report. This is helpful if your organization expects a certain structure in the report. Defining the structure is optional. If you do not provide it, Deep Research will automatically generate a suitable structure for your report based on the instructions you provided in the Research Topic section.
When configuring your report structure, you can:
Click the dropdown menu labeled "start from a template"
Select a pre-defined template that suits your needs
Use the Schema Builder to create your own report structure
Add sections using the Add Section button at the bottom
For each section:
Provide a title and description
Add subparts using the Add Subpart button
Configure each subpart with its own title and description
Rearrange sections and subparts: Drag and drop items to reorder them
Expand/collapse sections: Click the chevron icon next to a section title
Edit content: Click on any field to edit its content
Remove items: Click the trash icon to delete sections or subparts
View JSON: See the raw JSON structure of your schema
Download: Save your schema as a JSON file for future use
Import: Upload a previously saved schema
Clear All: Remove all sections and start fresh
Review your research topic and schema
Click the Start Research button to begin the generation process
The system will automatically switch to the Research Results tab
You'll see a progress tracker on the left showing each phase of the research process
Once the research is complete:
The full report appears in the right panel
Review the content, which follows the structure defined in your schema (if provided)
Download the report as a PDF using the PDF button
To make changes and regenerate:
Navigate back to the Research Topic or Report Schema tab
Make your adjustments
Click Run Again
Be specific: Clearly define what you want to research
Structure matters: Organize your schema logically with appropriate sections
Start small: Begin with a simple schema and expand as needed
Save templates: Create and save schemas for research types you perform regularly
Iterate: Use the "Run Again" button to refine results after reviewing initial output
Deep research can take several minutes to complete, so be patient. If research seems to be taking too long, check your internet connection.
If results don't match expectations, try refining your research question, adding more relevant source documents to you pipeline, or enabling web search to gather additional information from the internet
If a section is missing in the results, ensure your schema is properly structured
In addition to the user interface, the Deep Research feature is also available through a REST API. This enables you to integrate research capabilities into your own applications and workflows.
Initiates a new research process.
URL: /v1/org/[organizationId]/pipelines/[pipelineId]/deep-research
Method: POST
Headers:
Content-Type: application/json
Authorization: Bearer [your-token]
Request Body:
Parameters:
query
(required): The research question or topic
webSearch
(optional): Set to true to include internet data in research
schema
(optional): JSON string representing the report structure
n8n
(optional): Configuration for n8n workflow integration
Response:
Retrieves the results of a research process.
URL: /v1/org/[organizationId]/pipelines/[pipelineId]/deep-research/[researchId]
Method: GET
Headers:
Authorization: Bearer [your-token]
Response:
If ready
is false, the research is still in progress
If success
is false, the error
field will contain error details
events
contains progress updates
markdown
contains the full report in markdown format
When using the API, you'll need to provide a schema in JSON format that defines the structure of your research report.
report_schema: The root array containing all sections
type: The title/heading of a section or subpart
description: Detailed instructions for what should be included
subparts: An optional array of subsections within a main section
Here's an example schema for a financial report:
Below is a more detailed schema example for a financial report:
Be specific in descriptions: Provide clear guidance on what information should be included
Use logical organization: Structure the report with a natural flow from high-level to detailed information
Include context: Add details about relevant metrics, comparisons, or time periods
Balance depth and breadth: Include enough sections to be comprehensive without making the schema unwieldy
Consider your audience: Design the schema with the intended readers' needs in mind
The following example demonstrates how to start a research task and retrieve the results using cURL:
Poll the results endpoint periodically to check if research is complete
Consider implementing webhooks for asynchronous notifications
Store the research ID to access results later
Use proper error handling for cases where research fails