Skip to content

browser-act/skills

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

317 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

BrowserAct Logo

BrowserAct Skills

Discord GitHub Stars License

Website X (Twitter) LinkedIn YouTube


Browser automation CLI built for AI agents. Get past anti-bot walls, hand off to humans across platforms when stuck, run parallel tasks without cross-contamination, and isolate multiple accounts in independent browsers.

Why BrowserAct

The browser an AI agent needs has to reach places standard tools can't, let a human seamlessly take over when the agent is stuck, keep parallel tasks from cross-contaminating, and be designed for LLM reasoning β€” not human-written scripts. A browser for agents must get four things right.

1. Break through blocks β€” three progressive layers

  1. Environment layer β€” stealth fingerprint spoofing, TLS rotation, proxy switching. The vast majority of blocks never trigger.
  2. Execution layer β€” solve-captcha auto-solves CAPTCHAs; stealth-extract pulls protected pages in one command.
  3. Human layer β€” remote-assist generates a live URL; the user takes over from any device, and the agent continues seamlessly when done.

2. Three browser modes β€” by real-world scenario

Mode Scenario Key trait
chrome Reuse local Chrome login state Profile import or CDP attach
stealth privacy mode Frictionless batch scraping without login Fresh fingerprint per session + proxy rotation, zero residue
stealth fixed identity Logged-in accounts Β· multi-browser parallel Stable fingerprint + stable IP, stable account identity, not flagged as bots

3. Zero-interference concurrency β€” every agent in its own lane

  • Cross-browser parallel β€” independent cookies, fingerprints, proxies. Sites cannot correlate them.
  • Same-browser multi-session β€” shared login state, independent execution, tasks don't block each other.
  • Privacy mode β€” fresh fingerprint and empty profile per session, zero residue when done.

4. Designed for agent reasoning β€” not human scripts

  • Compact text output β€” indexed text format, several times more token-efficient than JSON or HTML.
  • Indexed interaction β€” state returns an indexed list; click 3 / input 2 "...". No DOM parsing required.
  • Semantic memory β€” every browser carries a desc, matched to tasks by meaning.
  • Concurrency-safe β€” session ownership + explicit naming. Multi-agent operation never conflicts.

Security: confirmation gating β€” sensitive operations (browser create / delete, Profile import, proxy changes, security and privacy toggles) require explicit user approval. Prior approvals do not carry over. Enforced at the Skill layer, not a configuration toggle.


And More

  • Better headless β€” Default headless without disrupting users; stealth headless that isn't detected.
  • Cross-platform remote handoff β€” Any device opens the link to take over, and the agent continues seamlessly.

Install

Tell your AI agent:

Install browser-act. Skill source: https://github.com/browser-act/skills/tree/main/browser-act . Verify it works after installation.

Installation details β†’


Quick Start

# Extract protected page content (zero config)
browser-act stealth-extract https://example.com

# Full browser automation
browser-act --session my-task browser open <id> https://example.com
browser-act --session my-task state          # See clickable elements
browser-act --session my-task click 3        # Click by index
browser-act --session my-task input 2 "hi"   # Type into a field

More examples and workflows β†’

The agent runs get-skills at the start of each session β€” gets environment state, browser list, and commands in one call:

browser-act get-skills core --skill-version 2.0.2

How agents discover and use BrowserAct β†’


Compatibility

OS: Windows, macOS, Linux

Agents: Claude Code Β· Cursor Β· VS Code Β· OpenCode Β· OpenClaw Β· Codex Β· Gemini CLI β€” works with any agent that can execute shell commands and load Skills.


What's Free

Almost everything is free. Only two features require payment: managed proxies (Dynamic / Static), and stealth browsers beyond the first 5.

Feature Free
(NoΒ Signup)
Free
(LoginΒ Only)
Paid
Browser automation, Chrome / Chrome-direct βœ“ βœ“ βœ“
Stealth browser (≀ 5), stealth-extract, solve-captcha, remote-assist, privacy mode, Skill Forge β€” βœ“ βœ“
Stealth browser (> 5), Dynamic / Static proxy β€” β€” βœ“

Documentation

Full documentation covers anti-blocking, browser modes, sessions and concurrency, headless and remote handoff, agent design, the Skills system, and the complete command reference.

Read the full documentation β†’


Also From BrowserAct

Skill Forge β€” Your Personal Scraping Engineer

Need to extract data from the same website repeatedly at scale? Don't write scrapers by hand. Skill Forge explores a site once, discovers its APIs and data patterns, generates a deploy-ready Skill package, then runs reliably without re-exploration β€” 500 or 5,000 records through the same stable path.

Any website. Any data. One command to start:

Install browser-act-skill-forge. Skill source: https://github.com/browser-act/skills/tree/main/browser-act-skill-forge . Verify it works after installation.

Then tell your agent what you need:

"Forge a Skill that extracts job listings from LinkedIn β€” title, company, salary, URL. I'll run 300 keywords later."

Skill Forge documentation β†’

Solutions Catalog

30+ pre-built Skills already generated by Skill Forge, ready to install and run. Covers Amazon, Google Maps, YouTube, Reddit, WeChat, Zhihu, and more.

Browse the full Solutions Catalog β†’

Build Your Own

Can't find what you need above? Generate a custom Skill for any website in minutes β€” no coding required. Just describe what data you want or what action to perform, and Skill Forge handles the rest.


πŸ’– Support the Project

BrowserAct Skills is free and open source. If it saves you time, please give us a ⭐ Star β€” it keeps the project alive and helps us ship more skills.

GitHub Stars

🎁 Bonus: Once you star the repository, you can join our Discord and post in the #claim-500-credits channel to receive 500 free credits!

🀝 Community & Support

Built with ❀️ by the BrowserAct Team

Star History

Star History Chart

About

Browser automation CLI built for AI agents. Break through anti-bot walls, hand off to humans across platforms when stuck. Parallel multi-task execution, independent multi-session operation, isolated multi-account browsing.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages