Schrute

Teach your AI a website once. After that, it can repeat the job much faster.

Schrute is for repeated website tasks.

It supports both paths: you can teach it by having it watch what happens in a real browser, or you can let it discover useful backend structure on its own when a site exposes it. Schrute can do either well, and it is up to you which path to use for a given site or task. Either way, it turns what it finds into reusable tools. That means the first run can happen in the browser, or you can start from direct discovery when that fits better, and later runs can often skip the UI and go straight to the site's backend.

If you keep asking an AI to do the same website task over and over, Schrute is the layer that helps it stop starting from scratch every time.

Learn from a real browser session
Reuse your logged-in state
Replay repeatable tasks without brittle click scripts
Fall back to the browser when direct replay is not possible
Use it from MCP, CLI, REST, Python, or TypeScript

Why People Use It

Without Schrute:

An agent opens the site again
Clicks through the UI again
Waits for the page again
Pays the same latency again

With Schrute:

You teach it the task once
Schrute learns the request pattern behind the page
The next run can often call the learned action directly

That is especially useful for things like:

pulling the same dashboard data every day
checking prices or market pages repeatedly
searching a site with the same flow many times
reusing internal tools that only work when you are already logged in

Quick Start

npm install -g schrute
schrute setup

Install

Choose the install path that fits your environment:

npm CLI (recommended)
```
npm install -g schrute
schrute setup
```

Homebrew

brew install sheeki03/tap/schrute
schrute setup

Docker

Pull the image:

docker pull ghcr.io/sheeki03/schrute:latest

Then run Schrute with persistent data and an auth token:

docker run --rm \
  -p 3000:3000 \
  -p 3001:3001 \
  -e SCHRUTE_AUTH_TOKEN=my-secret \
  -v schrute-data:/data \
  ghcr.io/sheeki03/schrute:latest

A Docker Hub mirror can also be published when enabled for the repository.

Standalone binaries

Download the latest archive for Linux, macOS, or Windows from GitHub Releases, unpack it, and run schrute.

If you want to use Schrute from an AI client over MCP:

{
  "mcpServers": {
    "schrute": {
      "command": "npx",
      "args": ["-y", "schrute", "serve"]
    }
  }
}

Ways To Use Schrute

You can use the same learned skills in different ways depending on your workflow:

MCP Best when you want Claude Code, Cursor, Cline, Windsurf, or another MCP client to call learned website actions as tools.
CLI Best when you want to explore, record, inspect, and run skills manually from the terminal.
REST API Best when you want another app, script, or backend service to call Schrute over HTTP.
Python and TypeScript clients Best when you want a lightweight client package instead of calling raw HTTP endpoints yourself.

So Schrute is not tied to one interface. You can teach it a task once, then reuse that same learned task from the interface that fits your workflow.

First Run In 2 Minutes

# 1. Start Schrute
schrute serve

# 2. Open a site in a browser session
schrute explore https://httpbin.org

# 3. Start recording a task
schrute record --name get_ip

# 4. In the opened browser, go to:
#    https://httpbin.org/ip

# 5. Stop recording
schrute stop

# 6. Poll the background pipeline job until skill generation completes
schrute pipeline <job-id>

# 7. Run the learned skill
schrute execute httpbin_org.get_ip.v1 --yes

What just happened:

Schrute watched the browser traffic for that action.
It found the real request behind the page.
It saved that request as a reusable skill.
You can now run that learned action again without manually driving the page.

Commands Most People Will Use

schrute explore https://example.com
schrute record --name my_action
schrute stop
schrute pipeline <job-id>
schrute execute my_skill.v1

schrute skills list --status active
schrute skills search "bitcoin price"
schrute skills show <skill-id>

schrute workflow create --site example.com --name summary --spec '{"steps":[...]}'
schrute workflow run example_com.summary.v1

schrute discover https://api.example.com
schrute doctor
schrute trust

What Schrute Can Do Today

Schrute is no longer just "record and replay." Here is what the current product does, in practical terms:

Learns reusable skills from real browsing You do the task once in a browser. Schrute turns what it learned into named actions you can run again later.
Generates skills in the background When you run schrute stop, Schrute does not make you wait for all processing to finish in the foreground. It gives you a pipeline job and keeps building the skills in the background. You can check progress with schrute pipeline <job-id>.
Searches and explains what it has already learned Once you have multiple skills, Schrute helps you find the right one with skills search, inspect it with skills show, validate it, export it, and manage it without digging through raw data.
Builds workflows from multiple skills If one reusable action is not enough, Schrute can chain several read-only skills together into a larger workflow. That is useful for multi-step tasks like "get account info, then fetch usage, then return a summary."
Discovers APIs even before you record Schrute can scan a site for useful backend clues such as OpenAPI specs, GraphQL endpoints, sitemaps, platform fingerprints, and WebMCP tools. That helps you start faster on sites that already expose a structured backend.
Reuses the browser session you already trust If you are already logged into Chrome or an Electron app, Schrute can attach to that session instead of forcing you through login again. This is especially useful for internal tools and dashboards.
Supports more than one browser session You are not limited to one browser context. Schrute can manage multiple named sessions so different sites, accounts, or attached browsers do not all get mixed together.
Handles sites that still need a live browser Some sites cannot be cleanly replayed as direct HTTP calls because of Cloudflare, anti-bot checks, or other browser-only behavior. Schrute does not pretend otherwise. It keeps those tasks on a browser-backed path so they still work.
Lets you call the same learned skills from different places The same learned actions can be used from MCP, the CLI, REST, and the Python or TypeScript client packages. That means you do not have to re-teach the task separately for each integration.
Lets you move and maintain what you learned Schrute can export and import learned site bundles, run health checks with doctor, show a trust posture report with trust, and keep an audit trail of executions.
Can improve and maintain learned actions over time Schrute can validate skills, track amendments, run optimization on degraded skills, and keep using safer fallback paths when a direct path stops being reliable.
Can work with site-declared tools as well as learned traffic On some sites, Schrute can discover useful backend structure such as WebMCP tools, OpenAPI specs, or GraphQL endpoints in addition to what it learns from browser traffic.

Feature Overview

If you are trying to understand "what is actually included here?", this is the practical feature map:

Explore and record Open a site, perform an action, and let Schrute watch the traffic behind it.
Background processing Generate skills after recording without blocking the terminal.
Skill catalog List, search, inspect, validate, export, revoke, delete, and manage learned skills.
Execution Run learned skills directly from CLI, MCP, REST, or client SDKs.
Workflow building Combine multiple read-only skills into one higher-level reusable flow.
Discovery Scan a site for OpenAPI, GraphQL, sitemaps, platform patterns, WebMCP tools, and other useful backend signals.
Browser session reuse Attach to a browser you already have open and logged into.
Multi-session support Keep separate browser sessions for different sites, accounts, or experiments.
Fallback execution Keep browser-backed execution for sites that cannot safely or reliably use direct replay.
Import and export Move learned site bundles between environments without shipping credentials.
Operational tools Use doctor, trust reporting, audit logs, and pipeline status to understand what Schrute is doing.
Client access Use the same learned actions through MCP, CLI, REST, Python, and TypeScript.

How Schrute Runs A Task

Schrute tries to use the simplest reliable path:

Browser first while the task is still being learned
Direct replay later when the request is stable and safe to reuse
Browser fallback when the site truly requires a live browser

So the goal is not "force everything into direct HTTP." The goal is "use the fastest safe execution mode that actually works."

That is why sites behind Cloudflare or other anti-bot systems can still be useful in Schrute. If direct replay is blocked, Schrute keeps them on a browser-backed path instead of pretending they should work the same way as a public API.

Tested Workflows

Real examples recorded and tested. These show what Schrute does on actual sites — not hypothetical scenarios.

httpbin.org — Public API learning

Task: Get my public IP address

Schrute learned 4 clean REST endpoints with zero noise and no auth.

Pipeline: 4 requests captured, 4 signal, 0 noise, 4 skills generated
Skills: httpbin_org.get_ip.v1, get_get.v1, get_headers.v1, get_user_agent.v1

Run	Latency	Tier
1	1,029ms	Browser-proxied (Tier 3)
3	777ms	Browser-proxied (Tier 3)
5	273ms	Browser-proxied (Tier 3)
After promotion	~5-50ms	Direct HTTP (Tier 1)

After 5+ consecutive successful validations, the skill promoted to Tier 1 — a 20x latency improvement with zero LLM cost.

en.wikipedia.org — Parameterized API discovery

Task: Search Wikipedia for articles about artificial intelligence

Schrute automatically discovered which query parameters vary (the search term) and which are constants (action, format, list type).

Pipeline: 4 requests, 4 signal, 0 noise, 2 skills generated
Learned: en_wikipedia_org.get_api_php.v1 — GET /w/api.php
- Discovered input: query.srsearch (varies between requests)
- Baked-in constants: action=query, list=search, format=json, origin=*
Latency: 1,033ms (browser-proxied)

One skill takes a search query and returns structured Wikipedia results.

dog.ceo — Noise filtering

Task: List all dog breeds and get a random dog image

Schrute separated real API calls from page chrome (CSS, images, scripts).

Pipeline: 6 requests, 3 signal, 3 noise, 2 skills generated
Skills: dog_ceo.get_all.v1 (breeds list), dog_ceo.get_random.v1 (random image)
Latency: 472-558ms (browser-proxied), promoted to ~5-50ms (Tier 1)

The 3 noise requests (CSS, favicon, scripts) were discarded automatically.

www.coingecko.com — Cloudflare-protected site

Task: Get Bitcoin 24-hour price data

CoinGecko is protected by Cloudflare Turnstile. Schrute detects the challenge, applies a permanent browser_required lock, and uses live Chrome for execution.

Pipeline: 16 requests, 7 noise filtered, 7 signal, 3 skills generated
Key skill: www_coingecko_com.get_24_hours_json.v1
Tier lock: browser_required (permanent — direct HTTP blocked by Cloudflare)

Run	Latency	Method
1	310ms	Browser-proxied with live Chrome bootstrap
2	73ms	Browser-proxied (cookies warm)
3	165ms	Browser-proxied (different skill, same site)
4	63ms	Browser-proxied (warm)

63-310ms for a Cloudflare-protected site — faster than any approach requiring LLM inference per action.

news.ycombinator.com — Server-rendered HTML

Task: Get the front page of Hacker News

Hacker News is fully server-rendered HTML with no JSON APIs.

Pipeline: 12 requests, 0 signal, 10 noise, 2 document navigations
0 skills generated (correct behavior)

With HTML extraction (Primitive 4), Schrute can now also generate skills for HTML-only sites using CSS selectors to extract structured data from the response.

Benchmarks

Site	Skill	First Run	Warm	Tier Ceiling	Auth	Noise Filtered
httpbin.org	`get_ip`	1,029ms	~5-50ms	Tier 1 (direct)	None	0/4
dog.ceo	`get_all`	551ms	~5-50ms	Tier 1 (direct)	None	3/6
en.wikipedia.org	`get_api_php`	1,033ms	—	Tier 3	None	0/4
www.coingecko.com	`get_24_hours_json`	310ms	63ms	Tier 3 (locked)	CF cookies	7/16

Where It Fits Best

Schrute is a strong fit when:

the site makes predictable HTTP or JSON requests behind the UI
the task is repeated often
you already have the right browser auth state
you want reusable tools instead of one-off browser scripts

Schrute is a weaker fit when:

the site is mostly server-rendered HTML with no meaningful backend calls to learn
the workflow depends heavily on canvas, WebSockets, or visual-only interactions
the task is truly one-time and not worth teaching

Common Use Cases

Schrute is especially useful for:

Repeated internal dashboard checks Example: pull the same account, usage, or reporting view every day without re-clicking the whole UI.
Logged-in business tools Example: use your existing browser session to access an internal admin panel, support tool, CMS, or analytics product.
Price, market, and listing lookups Example: repeatedly fetch the same market page or structured data endpoint after teaching the browser path once.
Search and lookup workflows Example: teach a site search flow once, then reuse it with different inputs.
Agent tool creation Example: turn a repeated browser task into a reusable MCP tool for an AI coding or operations workflow.
Multi-step read-only automations Example: fetch one piece of data, use it in a second call, and return a final combined answer through a workflow skill.
Sites with a mix of easy and hard paths Example: let Schrute use direct replay where it works, but keep a live-browser fallback for the parts that truly need it.

Reusing Your Logged-In Browser

If you already have a browser session with the right login state, Schrute can attach to it instead of making you sign in again.

Typical pattern:

chrome --remote-debugging-port=9222

After that, Schrute can connect to the running browser through CDP using its MCP or REST surfaces.

This is especially useful for:

internal dashboards
admin tools
sites with multi-step login flows
flows where the browser already has the right cookies and session state

REST API And SDKs

If you want to call Schrute from scripts or apps:

schrute config set server.authToken my-secret
schrute serve --http --port 3000

Then call it over HTTP:

curl -X POST http://127.0.0.1:3000/api/v1/execute \
  -H "Authorization: Bearer my-secret" \
  -H "Content-Type: application/json" \
  -d '{"skillId":"httpbin_org.get_ip.v1","params":{}}'

Client packages:

TypeScript: npm install @schrute/client
Python: pip install schrute-client

MCP HTTP is also available at:

http://127.0.0.1:3001/mcp

Safety And Storage

Schrute does not blindly replay everything it sees.

Before a learned skill runs, Schrute applies safeguards such as:

domain allowlists
redirect validation
method and path checks
approval for first execution when needed
audit logging
rate limiting

Credentials are not exported with skill bundles, and dangerous raw browser execution tools are blocked.

For the full security model, see SECURITY.md.

Development

Prerequisites: Node.js >= 22

git clone https://github.com/sheeki03/schrute.git
cd schrute
npm install
npm run build

Useful commands:

npm run build
npm test
npm run dev

Release Channels

The primary release surfaces are:

npm CLI: schrute
Docker image: GHCR, with an optional Docker Hub mirror
GitHub Releases: Linux, macOS, and Windows standalone binaries
Homebrew: sheeki03/tap/schrute

The devcontainer feature remains in this repository for local development, but it is not published from this repository. That keeps GitHub Packages focused on the main runtime image instead of showing a second package entry for the feature.

More

License

Apache-2.0

Name		Name	Last commit message	Last commit date
Latest commit History 113 Commits
.claude-plugin		.claude-plugin
.devcontainer/features/cli		.devcontainer/features/cli
.github		.github
Formula		Formula
agents		agents
assets		assets
bin		bin
commands		commands
docs		docs
hooks		hooks
integrations/hermes/skills/web-automation/schrute-web-skills		integrations/hermes/skills/web-automation/schrute-web-skills
native		native
prompts		prompts
scripts		scripts
skills		skills
src		src
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
.mcp.json		.mcp.json
.mise.toml		.mise.toml
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
SECURITY.md		SECURITY.md
docker-compose.yml		docker-compose.yml
package.json		package.json
pkg.config.json		pkg.config.json
pkgx.yaml		pkgx.yaml
smithery.yaml		smithery.yaml
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts
vitest.live.config.ts		vitest.live.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Schrute

Why People Use It

Quick Start

Install

Ways To Use Schrute

First Run In 2 Minutes

Commands Most People Will Use

What Schrute Can Do Today

Feature Overview

How Schrute Runs A Task

Tested Workflows

httpbin.org — Public API learning

en.wikipedia.org — Parameterized API discovery

dog.ceo — Noise filtering

www.coingecko.com — Cloudflare-protected site

news.ycombinator.com — Server-rendered HTML

Benchmarks

Where It Fits Best

Common Use Cases

Reusing Your Logged-In Browser

REST API And SDKs

Safety And Storage

Development

Release Channels

More

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Schrute

Why People Use It

Quick Start

Install

Ways To Use Schrute

First Run In 2 Minutes

Commands Most People Will Use

What Schrute Can Do Today

Feature Overview

How Schrute Runs A Task

Tested Workflows

httpbin.org — Public API learning

en.wikipedia.org — Parameterized API discovery

dog.ceo — Noise filtering

www.coingecko.com — Cloudflare-protected site

news.ycombinator.com — Server-rendered HTML

Benchmarks

Where It Fits Best

Common Use Cases

Reusing Your Logged-In Browser

REST API And SDKs

Safety And Storage

Development

Release Channels

More

License

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages