Skip to content

thinkdepthai/Deep_Research

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ThinkDepth.ai Deep Research

Our ThinkDepth.ai deep research 1) addresses the issue of balancing multiple factors for long horizon and complex tasks and 2) addresses the issue of balancing between model capability and structural flexibility. It solves those issues by explicitly reasoning about the self-balancing rules that guide the interaction of different requirements at different stages. For example, those self-balancing rules allow ThinkDepth.ai deep research to explicitly guide the interaction between information gap closing and generation gap closing at different stages.

In the information collection stage, it focuses on closing the information gap by making external web search tool calls while doing a bit of generation gap closing by refining the draft report. Once the information gap is fully closed, it transitions to the final report generation stage. In the final report generation stage, it then fully optimizes for closing the generation gap. This explicit multi-stage self-balancing rules reasoning leads to the development of Self-Balancing Test-Time Diffusion Deep Research algorithm and more effective context engineering. We call this paradigm Self-Balancing Agentic AI.

Check out our blog post for more technical details.

Primary Contact: Paichun Lin's LinkedIn | paichul@cs.stanford.edu

Setup

Please follow the instructions to run the demo:

  1. Install uv
pip install uv
  1. in ~/.zshrc or ~/.bashrc, enter your API keys info:
export OPENAI_API_KEY='Your OpenAI API Key'

export TAVILY_API_KEY='Your Tavily API Key'
  1. Install all packages
uv sync
  1. Run the demo in the notebook
uv run jupyter notebook thinkdepthai_deepresearch.ipynb

Experiments

ThinkDepth.ai deep research was ranked #1 and established a new state-of-art result on DeepResearch Bench on Nov 17th, 2025. It is #1 open source deep research agent on DeepResearch Bench.

  • It outperformed Google Gemini 2.5 pro deep research by 2.78%.
  • It outperformed OpenAI deep research by 6.04%.
  • It outperformed Anthropic Claude deep research by 7.45%.
benchmark

DeepResearch Bench Leaderboard Screenshot

huggingface_leaderboard

DeepResearch Bench Leaderboard

Example Generated Report

For the task "Write a paper to discuss the influence of AI interaction on interpersonal relations, considering AI's potential to fundamentally change how and why individuals relate to each other.", a snapshot of the generated report is shared below:

report

About

ThinkDepth.ai Deep Research

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published