browserwing-executor
Control browser automation through HTTP API. Supports page navigation, element interaction (click, type, select), data extraction, accessibility snapshot analysis, screenshot, JavaScript execution, and batch operations.
SKILL.md
| Name | browserwing-executor |
| Description | Control browser automation through HTTP API. Supports page navigation, element interaction (click, type, select), data extraction, accessibility snapshot analysis, screenshot, JavaScript execution, and batch operations. |
https://github.com/user-attachments/assets/e5377892-4b88-433a-8620-43b38a2fb28f
Highlights
Native Browser Automation Platform with AI Integration
- Complete Browser Control: 26+ HTTP API endpoints for full-featured browser automation
- Built-in AI Agent: Direct conversational interface for browser automation tasks
- Universal AI Tool Integration: Native MCP & Skills protocol support - compatible with any AI tool that supports these standards
- Visual Script Recording: Record browser actions, edit visually, and replay with precision
- Flexible Export Options: Convert recorded scripts to MCP commands or Skills files for AI tool integration
- Intelligent Data Extraction: LLM-powered semantic extraction supporting OpenAI, Claude, DeepSeek, and more
- Session Management: Robust cookie and storage handling for stable, authenticated browsing sessions
Requirements
- Google Chrome or Chromium installed and accessible in your environment.
Screenshots
<img width="600" alt="BrowserWing Homepage" src="https://raw.githubusercontent.com/browserwing/browserwing/main/docs/assets/screenshot_homepage.png">Turn Scripts Into Claude Skill
You can now combine any scripts into a SKILL.md.
<img width="600" alt="BrowserWing Skill" src="https://raw.githubusercontent.com/browserwing/browserwing/main/docs/assets/screenshot_skill.png">Quick Start
Let Your AI Agent Install It For You
Simply send the following message to your AI agent (OpenClaw, Cursor, Claude Code, etc.):
"Help me install BrowserWing following https://raw.githubusercontent.com/browserwing/browserwing/main/INSTALL.md"
The agent will read the guide and handle the installation, configuration, Chrome setup, and Skill integration automatically.
Option A — Install via Package Manager (recommended)
Using npm:
npm install -g browserwing
browserwing --port 8080
Using pnpm:
pnpm add -g browserwing
browserwing --port 8080
The npm package automatically tests GitHub and Gitee mirrors during installation and selects the fastest one.
⚠️ macOS Users:
If you encounter a "killed" error when running, fix it with:
xattr -d com.apple.quarantine $(which browserwing)
See the macOS Installation Fix Guide for details.
Using Homebrew (macOS/Linux):
# Coming soon
brew install browserwing
Option B — One-Line Install Script
Linux / macOS:
curl -fsSL https://raw.githubusercontent.com/browserwing/browserwing/main/install.sh | bash
Windows (PowerShell):
iwr -useb https://raw.githubusercontent.com/browserwing/browserwing/main/install.ps1 | iex
The script automatically:
- Detects your OS/architecture
- Tests GitHub and Gitee mirrors, selects the fastest one
- Downloads and extracts the binary
- Adds to PATH
Then start BrowserWing:
browserwing --port 8080
# Open http://localhost:8080 in your browser
Note for users in China: The installation script automatically uses Gitee mirror if GitHub is slow.
Option C — Manual Download
Download the prebuilt binary for your OS from Releases:
# Linux/macOS
chmod +x ./browserwing
./browserwing --port 8080
# Windows (PowerShell)
./browserwing.exe --port 8080
Option D — Build from Source
# Install deps (Go + pnpm required)
make install
# Build integrated binary (frontend embedded)
make build-embedded
./build/browserwing --port 8080
# Or build all targets and packages
make build-all
make package
Quick Integration with AI Tools
Three Ways to Use BrowserWing:
1. MCP Server Integration
Configure BrowserWing as an MCP server in any MCP-compatible AI tool:
{
"mcpServers": {
"browserwing": {
"type": "http",
"url": "http://localhost:8080/api/v1/mcp/message"
}
}
}
Paste this configuration into your AI tool's MCP settings to enable browser automation capabilities.
2. Skills File Integration
Download and import the Skills file into any AI tool that supports the Skills protocol:
- Start BrowserWing
- Download SKILL.md from the repository
- Import into your AI tool's Skills settings
- Start automating with natural language commands
Example:
"Navigate to example.com, search for 'AI tools', and extract the top 5 results"
3. Direct AI Agent Interface
Use BrowserWing's built-in AI Agent for immediate browser automation:
- Open BrowserWing web interface at
http://localhost:8080 - Navigate to "AI Agent" section
- Configure your LLM (OpenAI, Claude, DeepSeek, etc.)
- Start conversational browser automation
Export Custom Scripts:
# Export your recorded scripts as Skills or MCP commands
curl -X POST 'http://localhost:8080/api/v1/scripts/export/skill' \
-H 'Content-Type: application/json' \
-d '{"script_ids": []}' \
-o MY_CUSTOM_SCRIPTS.md
Why BrowserWing
Professional Browser Automation with AI Integration
- Universal Protocol Support: Native MCP & Skills implementation works with any compatible AI tool
- Complete Automation API: 26+ HTTP endpoints providing comprehensive browser control capabilities
- Flexible Integration Options: Use as MCP server, Skills file, or standalone AI Agent
- Visual Workflow Builder: Record, edit, and replay browser actions without writing code
- Token-Efficient Design: Optimized for LLM usage with fast performance and minimal token consumption
- Production-Ready: Stable session management, cookie handling, and error recovery
- Extensible Architecture: Convert recorded scripts to reusable MCP commands or Skills files
- Multi-LLM Support: Works with OpenAI, Anthropic, DeepSeek, and other providers
- Enterprise Use Cases: Data extraction, RPA, testing, monitoring, and agent-driven automation
Usage Guide
Getting Started in Three Steps
-
Choose Integration Method
- Copy MCP server configuration for AI tool integration
- Download Skills file for Skills-compatible AI tools
- Or use built-in AI Agent for immediate access
-
Configure Your AI Tool
- Import MCP configuration or Skills file into your preferred AI tool
- Configure LLM settings (API keys, model selection)
- Verify connection to BrowserWing
-
Start Automating
- Control browser through natural language commands
- Record custom scripts for repeated tasks
- Export scripts as MCP commands or Skills for reuse
Advanced Workflows
For Browser Automation:
- Launch and manage multiple browser instances
- Configure profiles, proxies, and browser settings
- Handle cookies and authentication sessions
- Execute complex interaction sequences
For Script Recording:
- Capture clicks, inputs, navigation, and waits
- Edit actions visually in the script editor
- Test and debug with step-by-step replay
- Add variables and conditional logic
For AI Integration:
- Convert scripts to MCP commands or Skills files
- Integrate with multiple LLM providers
- Use semantic extraction for data parsing
- Build agent-driven automation workflows
HTTP API Reference
BrowserWing exposes 26+ RESTful endpoints for programmatic browser control:
Navigation & Control
- Navigate to URLs, go back/forward, refresh pages
- Manage browser windows and tabs
- Handle page loading and timeouts
Element Interaction
- Click, type, select, and hover actions
- File uploads and form submissions
- Keyboard shortcuts and key presses
Data Extraction
- Extract text, HTML, and attributes
- Semantic content analysis with LLM
- Screenshot capture (full page or element)
Advanced Operations
- Execute custom JavaScript
- Manage cookies and local storage
- Batch operations for efficiency
- Wait conditions and element visibility
Complete Documentation: See docs/EXECUTOR_HTTP_API.md for detailed endpoint specifications
Contributing
- Issues and PRs are welcome. Please include clear steps to reproduce or a concise rationale.
- For feature ideas, open a discussion with use cases and expected outcomes.
Community
Discord: https://discord.gg/BkqcApRj twitter: https://x.com/chg80333
Acknowledgements
- Inspired by modern browser automation, agentic workflows, and MCP.
License
- MIT License. See
LICENSE.
Disclaimer
- Do not use for illegal purposes or to violate site terms.
- Intended for personal learning and legitimate automation only.