Installation

browsy is available as a Rust library, a CLI binary, a Python package, and an MCP server.

Rust library

Add browsy-core to your project:

cargo add browsy-core

This enables the fetch feature by default, which includes HTTP fetching, session management, and web search via reqwest.

Without networking

To use browsy as a pure HTML-to-Spatial-DOM parser with no network dependencies:

cargo add browsy-core --no-default-features

This disables the fetch feature. You get browsy_core::parse(html, width, height) and nothing else -- no Session, no HTTP, no reqwest. Useful for embedding browsy in contexts where you handle fetching yourself.

#![allow(unused)]
fn main() {
// Available without fetch feature
let dom = browsy_core::parse(html, 1920.0, 1080.0);

// Requires fetch feature (enabled by default)
use browsy_core::fetch::Session;
let mut session = Session::new()?;
}

Feature flags

Feature	Default	Description
`fetch`	Yes	HTTP fetching, `Session` API, web search, cookie persistence

CLI

Install the browsy CLI binary:

cargo install browsy

Usage:

# Fetch and parse a live page
browsy fetch https://example.com

# Parse local HTML from stdin
cat page.html | browsy parse

# JSON output
browsy fetch https://example.com --format json

REST API server

The CLI includes a built-in REST API + A2A server:

browsy serve
browsy serve --port 8080
browsy serve --allow-private-network

See REST API for endpoint documentation and A2A Protocol for agent-to-agent integration.

Python

browsy has PyO3 bindings published as the browsy-ai package:

pip install browsy-ai

import browsy

# Parse HTML directly
dom = browsy.parse(html, 1920.0, 1080.0)
print(dom.page_type)
print(dom.suggested_actions)

# Session-based browsing
session = browsy.Session()
dom = session.goto("https://example.com")
session.type_text(19, "hello")
session.click(34)

The Python bindings expose the same Session API as the Rust library, including login, search, enter_code, and all form interaction methods.

Framework integrations

Install browsy with framework-specific extras:

pip install browsy-ai[langchain]   # LangChain tools
pip install browsy-ai[crewai]      # CrewAI tool
pip install browsy-ai[openai]      # OpenAI function calling
pip install browsy-ai[autogen]     # AutoGen integration
pip install browsy-ai[smolagents]  # HuggingFace smolagents
pip install browsy-ai[all]         # All integrations

See Framework Integrations for usage guides.

Requirements

Python 3.9+
No native dependencies (the compiled extension includes everything)

JavaScript / TypeScript

The browsy-ai npm package provides a TypeScript SDK with integrations for LangChain.js, OpenAI, and Vercel AI SDK:

npm install browsy-ai

import { BrowsyClient, BrowsyContext } from "browsy-ai";       // Core SDK
import { getTools } from "browsy-ai/langchain";                  // LangChain.js
import { getToolDefinitions, handleToolCall } from "browsy-ai/openai";  // OpenAI
import { browsyTools } from "browsy-ai/vercel-ai";               // Vercel AI SDK

Framework dependencies are optional peer dependencies -- install only what you need:

npm install browsy-ai @langchain/core    # LangChain.js
npm install browsy-ai openai             # OpenAI
npm install browsy-ai ai                 # Vercel AI SDK

Requires Node.js 22+ and the browsy CLI (cargo install browsy) for the REST server.

See JavaScript / TypeScript for the full SDK guide.

MCP Server

browsy ships an MCP server that exposes the full Session API as tools. This works with Claude Code, Claude Desktop, and any MCP-compatible client.

Install

cargo install browsy-mcp

Configure for Claude Code

Add to your Claude Code MCP configuration (.claude/mcp.json or equivalent):

{
  "mcpServers": {
    "browsy": {
      "command": "browsy-mcp",
      "args": []
    }
  }
}

Configure for Claude Desktop

Add to your Claude Desktop config (claude_desktop_config.json):

{
  "mcpServers": {
    "browsy": {
      "command": "browsy-mcp",
      "args": []
    }
  }
}

Available MCP tools

The MCP server exposes these tools:

Tool	Description
`browse`	Navigate to a URL, returns Spatial DOM
`click`	Click an element by ID
`type_text`	Type into an input field by ID
`check` / `uncheck`	Toggle checkboxes and radio buttons
`select`	Select a dropdown option
`get_page`	Get the current page DOM with form state
`back`	Go back in navigation history
`search`	Web search via DuckDuckGo or Google
`find`	Find elements by text or ARIA role
`login`	Fill and submit a login form
`enter_code`	Fill and submit a verification code
`tables`	Extract structured table data
`page_info`	Get page metadata, type, and suggested actions

Building from source

git clone https://github.com/GhostPeony/browsy
cd browsy

# Build everything (library + CLI + MCP server)
cargo build --release

# Run tests
cargo test -p browsy-core

# Install CLI and MCP server from local source
cargo install --path crates/cli
cargo install --path crates/mcp

browsy Documentation