mcp-native
Connect from any MCP client in one line of config. Claude Desktop, Claude Code, Cursor, or anything that speaks MCP. Supports stdio, streamable HTTP, and SSE transports.
real chromium
Not headless. Not Playwright. A real Chromium browser with DevTools and extensions. Plus real desktop apps — file managers, terminals, office tools. Your agent works in a full desktop environment, not a browser sandbox.
semantic page map
Structured DOM map with element coordinates. Headings, links, forms, buttons — all with viewport positions. Faster and cheaper than vision-only agents. No AI model needed to find a button.
docker isolation
Every desktop is a throwaway container. Memory-limited, network-isolated, no bind mounts. When the task is done, the sandbox disappears. Nothing leaks to the host.
full automation
Screenshot, OCR, click, type, key combos, shell, file I/O, clipboard. 8 core tools + 4 dispatchers covering 100+ actions. Batch multiple operations in a single call to reduce round-trips.
multi-agent
Agent registration, API keys, desktop ownership. Each agent sees only its own desktops. Admin sees everything. Built for teams running multiple agents in parallel.
human oversight
Watch agents work live via RDP or noVNC. Take mouse and keyboard control at any moment. Help the agent, then release. Share desktop links with external viewers.
snapshot/restore
Save full desktop state — files, browser sessions, everything. Restore later. Clone desktops from templates. Perfect for repeatable workflows and environment provisioning.
dashboard
Web UI for managing everything. Live desktop view, agent management, snapshot controls, knowledge browser, session logs. Pure frontend — all operations proxy through the MCP API.
auto-management
Idle desktops pause automatically. Acquired desktops release after configurable TTL. Resources freed when not in use. No babysitting required.
cross-platform
Linux, macOS, Windows. Native Docker on Linux. Docker Desktop on macOS. WSL2 on Windows. ~2 GB RAM per desktop, no GPU needed.