Open Source
Explore the latest AI open-source projects from GitHub and HuggingFace.
Explore the latest AI open-source projects from GitHub and HuggingFace.
## Introduction Chrome DevTools MCP is an official Google project that exposes Chrome browser debugging and inspection capabilities to AI coding agents through the Model Context Protocol. With nearly 30,000 GitHub stars, this server bridges the gap between AI agents and real browser workflows, enabling agents to control, inspect, debug, and performance-test live Chrome instances. Whether you're using Gemini, Claude, Cursor, or GitHub Copilot, Chrome DevTools MCP gives your AI assistant the same browser superpowers that developers use daily through the DevTools panel. The project represents a significant move by Google to make browser automation a first-class capability for AI agents. Rather than relying on fragile screenshot-based approaches or limited DOM parsing, Chrome DevTools MCP provides structured access to 29 browser tools covering everything from navigation and input automation to network inspection, performance profiling, and memory analysis. ## Architecture and Design Chrome DevTools MCP is built on a clean layered architecture: | Layer | Purpose | Technology | |-------|---------|--------| | MCP Server | Protocol handler for agent communication | Model Context Protocol | | Tool Registry | 29 categorized browser tools with JSON schemas | TypeScript | | Chrome Bridge | Browser connection and lifecycle management | Puppeteer | | Result Pipeline | Automatic wait, capture, and formatting of tool outputs | Custom middleware | The server manages Chrome browser instances through Puppeteer, translating MCP tool calls into DevTools Protocol commands. A key architectural decision is the automatic result waiting system -- when an agent triggers a navigation or interaction, the server automatically waits for the page to settle before returning results, eliminating common timing issues that plague browser automation. Tools are organized into functional categories (navigation, interaction, inspection, performance, debugging, etc.) with consistent JSON input/output schemas. Each tool returns structured data rather than raw protocol responses, making it easy for LLMs to interpret results. ## Key Capabilities **Navigation and Page Control**: Navigate to URLs, go back/forward, reload pages, and manage browser history. The server handles redirect chains, SPA navigation events, and authentication flows transparently. **Input Automation**: Click elements, type text, scroll, hover, drag-and-drop, and interact with form controls. Element targeting supports CSS selectors, XPath, ARIA attributes, and coordinate-based interaction. The system automatically scrolls elements into view and waits for interactability. **DOM Inspection**: Query the DOM tree, read element properties, compute styles, and extract text content. Agents can inspect shadow DOM, iframes, and web component internals. Results include accessibility properties for a11y testing. **Network Inspection**: Monitor network requests and responses, inspect headers, view request/response bodies, and analyze loading waterfall data. Supports filtering by resource type, URL pattern, and status code. **Performance Profiling**: Run Lighthouse audits, capture Performance traces, measure Core Web Vitals, and identify layout shifts and long tasks. Results are structured for easy comparison across runs. **Screenshot and Visual Capture**: Full-page screenshots, element-specific captures, and viewport snapshots. Supports device emulation for responsive testing. **Console and Debugging**: Read console logs, evaluate JavaScript expressions, set breakpoints, and inspect variable values. Agents can debug runtime errors by examining stack traces and scope chains. **Memory Analysis**: Capture heap snapshots, identify memory leaks, and track allocation timelines. This enables agents to diagnose memory-related performance issues. ## Developer Integration Setup requires Node.js and Chrome: ```bash npx @anthropic-ai/chrome-devtools-mcp ``` For Claude Code integration: ```json { "mcpServers": { "chrome-devtools": { "command": "npx", "args": ["@anthropic-ai/chrome-devtools-mcp"] } } } ``` Once connected, agents can issue commands like: ``` Navigate to https://example.com Click the "Sign In" button Run a Lighthouse performance audit Capture a screenshot of the hero section ``` All 29 tools are automatically available through the MCP protocol with full schema documentation. ## Limitations Chrome DevTools MCP requires a local Chrome installation, limiting use in headless CI environments without additional setup. The 29-tool surface area means agents need clear instructions about which tools to use for specific tasks, as LLMs may not always select the optimal tool. Cross-origin restrictions still apply -- the server cannot bypass CORS or access cross-origin iframe content. Performance profiling generates large data payloads that may consume significant context window space. The project currently supports only Chromium-based browsers, excluding Firefox and Safari. Some advanced DevTools features like the Coverage panel and Application panel are not yet exposed as MCP tools. ## Who Should Use This Chrome DevTools MCP is essential for developers who want their AI coding agents to help with frontend debugging, performance optimization, and browser testing. QA engineers building AI-assisted test automation workflows gain structured browser interaction capabilities. Web performance consultants can leverage AI agents to run and interpret Lighthouse audits at scale. Full-stack developers who frequently switch between code and browser can keep their AI agent informed about runtime behavior without manual copy-pasting of console output.