Skip to content

Agent Tools

The agent has access to built-in tools organized by category. Tool availability depends on the current mode.

Planning Tools

ToolDescriptionPlanExecute
ask_userAsk structured questions (2-4 options + free text per question)yesno
plan_toolCreate/update task plans, track resources, generate completion reports. In Execute mode, create and add operations require user approval.yesyes

Research Tools

ToolDescriptionPlanExecute
papersSearch papers, read ArXiv, get citations, find code/datasetsyesyes
web_searchBrave web searchyesyes
researchSpawn an independent research sub-agent with its own contextnoyes
github_read_fileRead files from GitHub reposyesyes
github_list_reposList repos for a user/orgyesyes
github_find_examplesFind code examples by filename patternyesyes
github_search_reposSearch GitHub repositories by queryyesyes
github_get_readmeGet README from a repositoryyesyes

Papers Operations

OperationSourceDescription
searchOpenAlexFull-text search with year/citation filters (primary)
arxiv_searcharXivSearch arXiv directly — best for ML/CS/Physics preprints
semantic_searchSemantic ScholarSearch with abstracts, good for recent papers
trendingOpenAlexRecent highly-cited papers
detailsOpenAlex + CrossRefFull metadata, abstract, OA links
read_paperarXiv (ar5iv)Parse HTML into sections, read by name/number
citationsOpenAlexReferences + cited-by with batch fetching
recommendOpenAlexRelated works
author_papersOpenAlexPapers by a specific author
find_codePapers With CodeGitHub repos linked to papers
find_datasetsPapers With CodeDatasets linked to papers

Search source recommendations

  • arXiv search: Latest ML/AI preprints, CS research, physics papers
  • OpenAlex search: Broad academic coverage across all fields, great for citation data
  • Semantic Scholar: When you need abstracts or Semantic Scholar IDs

Research Sub-Agent

The research tool spawns an independent sub-agent with its own context window. The parent agent sees nested tool calls streamed in real-time. Useful for deep dives that would consume too much of the main conversation's context.

Writing Tool

ToolDescriptionPlanExecute
writingPaper authoring — manage outline, write sections, update bibliographynoyes

The writing tool manages a writing project stored in the database:

  • Outline: Define paper structure (sections, subsections)
  • Sections: Write/update individual sections with auto-save
  • Bibliography: Manage citations and references
  • Auto-save: All changes persist to the database immediately, surviving across workers and restarts

Paper preview and client-side export (Markdown/LaTeX) are available in the Paper tab in the UI.

Filesystem Tools

ToolDescriptionPlanExecute
readRead files with line numbersyesyes
writeCreate/overwrite filesnoyes
editFind-and-replace in filesnoyes
list_dirList directory contentsyesyes
glob_filesFind files by glob patternyesyes
grep_searchSearch file contentsyesyes

In Plan mode, only read-only filesystem tools are available.

Execution Tools

ToolDescriptionPlanExecute
bashExecute shell commands (Docker-isolated when available)noyes
sandboxRun code in Docker containers, SSH remotes, or Modal cloudnoyes

Sandbox Types

TypeDescription
Local (Docker)Docker container on the host machine
SSHRemote machine via SSH
ModalCloud sandbox via Modal

Workspace Tools

ToolDescriptionPlanExecute
workspaceProject workspace operations — knowledge graph, notes, search, failure logsyesyes

Workspace Operations

OperationDescription
statusView workspace summary (file counts, knowledge graph size, recent failures)
searchSearch files by name or content
noteSave a research note with topic and content
knowledge_addAdd entity to the knowledge graph
knowledge_relateAdd relationship between entities
knowledge_querySearch entities in the knowledge graph
knowledge_summaryGet full knowledge graph context for the conversation
recent_failuresView recent tool/API failure logs

See Projects & Workspaces for details on the knowledge graph entity and relationship types.

Mode Restrictions

Tools are filtered based on the current mode before being sent to the LLM. See Modes for details on the enforcement layers.

In summary:

  • Plan mode: ask_user, plan_tool, workspace, read-only filesystem, web search, papers, GitHub
  • Execute mode: Everything except ask_user