From 77d203404576ef556a9cf7942051d067ad8e9a89 Mon Sep 17 00:00:00 2001
From: Will Wilson <willtwilson@gmail.com>
Date: Sun, 8 Mar 2026 21:39:07 +0000
Subject: [PATCH 1/3] docs: add agent-browser MCP server setup guide

Documents the agent-browser MCP server which provides Playwright-backed
browser automation for LibreChat agents via the Vercel agent-browser library.

Key topics covered:
- Why @ref accessibility snapshots beat raw CSS selectors for LLM agents
- Tool reference table (navigate, snapshot, click, fill, get_text, etc.)
- Docker Compose and build-from-source setup
- librechat.yaml mcpServers configuration
- Critical: why express.json() must NOT be used with MCP SSE transport
- Session management and SSEServerTransport routing pattern
- Zod-based tool registration pattern

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
---
 .../configuration/tools/agent-browser.mdx     | 205 ++++++++++++++++++
 1 file changed, 205 insertions(+)
 create mode 100644 docs/docs/configuration/tools/agent-browser.mdx
diff --git a/docs/docs/configuration/tools/agent-browser.mdx b/docs/docs/configuration/tools/agent-browser.mdx
new file mode 100644
index 0000000000..a09b64576c
--- /dev/null
+++ b/docs/docs/configuration/tools/agent-browser.mdx
@@ -0,0 +1,205 @@
+---
+title: Agent Browser MCP
+description: Browser automation via MCP using Vercel's agent-browser library (Playwright + @ref accessibility snapshots)
+---
+
+import { Steps, Callout, Tabs } from 'nextra/components'
+
+# Agent Browser MCP Server
+
+The agent-browser MCP server provides AI-optimised browser automation for LibreChat agents, powered by [Vercel's `agent-browser` library](https://www.npmjs.com/package/agent-browser) which uses Playwright with accessibility tree snapshots.
+
+## Why agent-browser instead of raw Playwright/Puppeteer?
+
+Raw Playwright and Puppeteer expose CSS selectors and XPath expressions to the model. These are brittle in single-page applications, break when a site redeploys, and require the model to infer element identity from unstructured HTML.
+
+`agent-browser` solves this by producing **accessibility tree snapshots** with stable `@ref` identifiers:
+
+```
+button [@e3] "Sign in"
+input  [@e7] placeholder="Email address"
+```
+
+Every interactive element gets a unique `@e1`, `@e2`, `@e3`… reference that the model can pass directly to `click` or `fill`. This lets the LLM:
+
+- Reference elements precisely without fragile CSS selectors
+- Navigate complex SPAs without XPath hacks
+- Interact reliably with dynamically rendered content
+
+## Tools provided
+
+| Tool | Description |
+|------|-------------|
+| `navigate` | Navigate to a URL; returns the page title |
+| `snapshot` | Get the accessibility tree with `@ref` identifiers for all interactive elements |
+| `click` | Click an element by `@ref` (from snapshot) or CSS selector |
+| `fill` | Clear and type into an input field by `@ref` or CSS selector |
+| `get_text` | Extract text content from an element by CSS selector |
+| `press_key` | Press a keyboard key (Enter, Tab, Escape, ArrowDown, etc.) |
+| `screenshot` | Take a screenshot of the current page (returns base64 PNG) |
+| `get_url` | Get the current browser URL |
+| `close_browser` | Close the browser session and free all resources |
+
+## Setup
+
+### Prerequisites
+
+- Docker Compose (recommended) **or** Node.js ≥ 20 + Playwright system dependencies
+- LibreChat configured with `mcpServers` in `librechat.yaml`
+
+<Steps>
+
+### Run the MCP server
+
+<Tabs items={['Docker Compose', 'Build from source']}>
+  <Tabs.Tab>
+Add to your `docker-compose.override.yml`:
+
+```yaml
+services:
+  agent-browser-mcp:
+    build:
+      context: ./packages/mcp-servers/agent-browser
+    environment:
+      - PORT=8932
+      # Optional: path to a specific Chromium binary
+      # - CHROMIUM_PATH=/usr/bin/chromium
+    ports:
+      - "8932:8932"
+    restart: unless-stopped
+```
+  </Tabs.Tab>
+  <Tabs.Tab>
+```bash
+# Clone LibreChat
+git clone https://github.com/danny-avila/LibreChat
+cd LibreChat/packages/mcp-servers/agent-browser
+
+npm install
+npx playwright install chromium --with-deps
+
+npm run build
+npm start
+```
+
+The server listens on `http://localhost:8932` by default. Set `PORT` to override.
+  </Tabs.Tab>
+</Tabs>
+
+### Configure librechat.yaml
+
+Add the server to `mcpServers` in your `librechat.yaml`:
+
+```yaml
+mcpServers:
+  agent-browser:
+    type: sse
+    url: http://agent-browser-mcp:8932/sse
+    # Adjust the URL for local/non-Docker setups:
+    # url: http://localhost:8932/sse
+    autoApprove:
+      - navigate
+      - snapshot
+      - click
+      - fill
+      - get_text
+      - press_key
+      - screenshot
+      - get_url
+      - close_browser
+```
+
+</Steps>
+
+## Environment variables
+
+| Variable | Default | Description |
+|----------|---------|-------------|
+| `PORT` | `8932` | HTTP port the MCP server listens on |
+| `CHROMIUM_PATH` | _(Playwright managed)_ | Path to a custom Chromium binary |
+
+## Implementation reference
+
+If you are building your own MCP SSE server or extending this one, the following pattern is critical.
+
+### Critical: Do not add `express.json()` middleware
+
+The MCP `SSEServerTransport.handlePostMessage` reads the raw request stream internally. Adding `express.json()` upstream of the POST `/messages` route causes Express to consume the stream before the SDK can read it, producing **HTTP 400 "stream is not readable"** on every `initialize` call and preventing all tool execution.
+
+```typescript
+import express from "express";
+import { McpServer } from "@modelcontextprotocol/sdk/server/mcp.js";
+import { SSEServerTransport } from "@modelcontextprotocol/sdk/server/sse.js";
+
+// CORRECT: no express.json() anywhere on this app
+const app = express();
+const transports = new Map<string, SSEServerTransport>();
+
+app.get("/sse", async (req, res) => {
+  const transport = new SSEServerTransport("/messages", res);
+  transports.set(transport.sessionId, transport);
+  const server = buildMcpServer(); // creates McpServer with all tools
+  await server.connect(transport);
+  res.on("close", () => transports.delete(transport.sessionId));
+});
+
+app.post("/messages", async (req, res) => {
+  const transport = transports.get(req.query.sessionId as string);
+  if (!transport) {
+    res.status(404).json({ error: "Session not found" });
+    return;
+  }
+  await transport.handlePostMessage(req, res);
+});
+```
+
+### Session management
+
+Each LibreChat client connection creates its own `SSEServerTransport` instance on `GET /sse`. The transport's `sessionId` (a UUID generated by the SDK) is appended to the client's POST `/messages` requests as `?sessionId=…`, routing each message back to the correct server-sent events connection.
+
+### Tool registration pattern
+
+Tools are registered using the `McpServer` fluent API with [Zod](https://zod.dev) schemas for parameter validation:
+
+```typescript
+import { McpServer } from "@modelcontextprotocol/sdk/server/mcp.js";
+import { z } from "zod";
+
+function buildMcpServer(): McpServer {
+  const server = new McpServer({ name: "agent-browser", version: "1.0.0" });
+
+  server.tool(
+    "navigate",
+    "Navigate the browser to a URL. Returns the page title.",
+    { url: z.string().describe("Full URL including https://") },
+    async ({ url }) => {
+      // ... call agent-browser BrowserManager
+      return { content: [{ type: "text", text: `Navigated to: ${title}` }] };
+    }
+  );
+
+  // Register remaining tools...
+  return server;
+}
+```
+
+## Typical agent workflow
+
+```
+1. navigate   → https://example.com
+2. snapshot   → gets accessibility tree with @e1, @e2, @e3 refs
+3. fill       → @e7 "search query"
+4. press_key  → Enter
+5. snapshot   → inspect updated page
+6. get_text   → .result-list  (extract results)
+```
+
+<Callout type="info">
+  Call `close_browser` when the task is finished to free Playwright resources. The browser session is shared across tool calls within a single server process, so leaving it open between tasks is intentional but consumes memory.
+</Callout>
+
+## Related
+
+- [MCP Server configuration reference](/docs/configuration/librechat_yaml/object_structure/mcp_servers)
+- [Vercel `agent-browser` npm package](https://www.npmjs.com/package/agent-browser)
+- [Model Context Protocol SDK](https://github.com/modelcontextprotocol/typescript-sdk)

From 0bcebac21c12b1ca0e2304a9d8e21ce3e01a8cd6 Mon Sep 17 00:00:00 2001
From: Will Wilson <willtwilson@gmail.com>
Date: Sun, 8 Mar 2026 21:41:15 +0000
Subject: [PATCH 2/3] fix: use TCP socket health checks for database services

Add bash /dev/tcp health checks to MongoDB (port 27017) and PostgreSQL/
pgvector (port 5432) services, which previously had no health checks.
Update depends_on conditions so dependent services (rag_api, api) wait
for their database dependencies to be healthy before starting.

Files changed: docker-compose.yml, deploy-compose.yml, rag.yml

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
---
 deploy-compose.yml | 21 ++++++++++++++++++---
 docker-compose.yml | 21 ++++++++++++++++++---
 rag.yml            |  9 ++++++++-
 3 files changed, 44 insertions(+), 7 deletions(-)

diff --git a/deploy-compose.yml b/deploy-compose.yml
index 968768b818..51844f9776 100644
--- a/deploy-compose.yml
+++ b/deploy-compose.yml
@@ -9,8 +9,10 @@ services:
     ports:
       - 3080:3080
     depends_on:
-      - mongodb
-      - rag_api
+      mongodb:
+        condition: service_healthy
+      rag_api:
+        condition: service_started
     restart: always
     extra_hosts:
     - "host.docker.internal:host-gateway"
@@ -51,6 +53,12 @@ services:
     volumes:
       - ./data-node:/data/db
     command: mongod --noauth
+    healthcheck:
+      test: ["CMD", "bash", "-c", "echo > /dev/tcp/localhost/27017"]
+      interval: 10s
+      timeout: 5s
+      retries: 5
+      start_period: 10s
   meilisearch:
     container_name: chat-meilisearch
     image: getmeili/meilisearch:v1.35.1
@@ -73,6 +81,12 @@ services:
     restart: always
     volumes:
       - pgdata2:/var/lib/postgresql/data
+    healthcheck:
+      test: ["CMD", "bash", "-c", "echo > /dev/tcp/localhost/5432"]
+      interval: 10s
+      timeout: 5s
+      retries: 5
+      start_period: 10s
   rag_api:
     image: registry.librechat.ai/danny-avila/librechat-rag-api-dev-lite:latest
     environment:
@@ -80,7 +94,8 @@ services:
       - RAG_PORT=${RAG_PORT:-8000}
     restart: always
     depends_on:
-      - vectordb
+      vectordb:
+        condition: service_healthy
     env_file:
       - .env
 
diff --git a/docker-compose.yml b/docker-compose.yml
index 079cdb74b6..3f32d468b5 100644
--- a/docker-compose.yml
+++ b/docker-compose.yml
@@ -7,8 +7,10 @@ services:
     ports:
       - "${PORT}:${PORT}"
     depends_on:
-      - mongodb
-      - rag_api
+      mongodb:
+        condition: service_healthy
+      rag_api:
+        condition: service_started
     image: registry.librechat.ai/danny-avila/librechat-dev:latest
     restart: always
     user: "${UID}:${GID}"
@@ -35,6 +37,12 @@ services:
     volumes:
       - ./data-node:/data/db
     command: mongod --noauth
+    healthcheck:
+      test: ["CMD", "bash", "-c", "echo > /dev/tcp/localhost/27017"]
+      interval: 10s
+      timeout: 5s
+      retries: 5
+      start_period: 10s
   meilisearch:
     container_name: chat-meilisearch
     image: getmeili/meilisearch:v1.35.1
@@ -56,6 +64,12 @@ services:
     restart: always
     volumes:
       - pgdata2:/var/lib/postgresql/data
+    healthcheck:
+      test: ["CMD", "bash", "-c", "echo > /dev/tcp/localhost/5432"]
+      interval: 10s
+      timeout: 5s
+      retries: 5
+      start_period: 10s
   rag_api:
     container_name: rag_api
     image: registry.librechat.ai/danny-avila/librechat-rag-api-dev-lite:latest
@@ -64,7 +78,8 @@ services:
       - RAG_PORT=${RAG_PORT:-8000}
     restart: always
     depends_on:
-      - vectordb
+      vectordb:
+        condition: service_healthy
     env_file:
       - .env
 
diff --git a/rag.yml b/rag.yml
index 9684d76cf5..ff4b43d52f 100644
--- a/rag.yml
+++ b/rag.yml
@@ -11,6 +11,12 @@ services:
       - pgdata2:/var/lib/postgresql/data
     ports:
       - "5433:5432"
+    healthcheck:
+      test: ["CMD", "bash", "-c", "echo > /dev/tcp/localhost/5432"]
+      interval: 10s
+      timeout: 5s
+      retries: 5
+      start_period: 10s
 
   rag_api:
     image: registry.librechat.ai/danny-avila/librechat-rag-api-dev:latest
@@ -23,7 +29,8 @@ services:
     ports:
       - "${RAG_PORT}:${RAG_PORT}"
     depends_on:
-      - vectordb
+      vectordb:
+        condition: service_healthy
     env_file:
       - .env
 

From 265d82dab7079c6dadda7dc0fff31618c8b6aef5 Mon Sep 17 00:00:00 2001
From: Will Wilson <willtwilson@gmail.com>
Date: Mon, 9 Mar 2026 00:58:33 +0000
Subject: [PATCH 3/3] fix: remove accidentally included agent-browser docs

The docs/docs/configuration/tools/agent-browser.mdx file was
unintentionally included in this PR (merged from a separate branch).
This PR is only for TCP health checks on database services.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
---
 .../configuration/tools/agent-browser.mdx     | 205 ------------------
 1 file changed, 205 deletions(-)
 delete mode 100644 docs/docs/configuration/tools/agent-browser.mdx

diff --git a/docs/docs/configuration/tools/agent-browser.mdx b/docs/docs/configuration/tools/agent-browser.mdx
deleted file mode 100644
index a09b64576c..0000000000
--- a/docs/docs/configuration/tools/agent-browser.mdx
+++ /dev/null
@@ -1,205 +0,0 @@
----
-title: Agent Browser MCP
-description: Browser automation via MCP using Vercel's agent-browser library (Playwright + @ref accessibility snapshots)
----
-
-import { Steps, Callout, Tabs } from 'nextra/components'
-
-# Agent Browser MCP Server
-
-The agent-browser MCP server provides AI-optimised browser automation for LibreChat agents, powered by [Vercel's `agent-browser` library](https://www.npmjs.com/package/agent-browser) which uses Playwright with accessibility tree snapshots.
-
-## Why agent-browser instead of raw Playwright/Puppeteer?
-
-Raw Playwright and Puppeteer expose CSS selectors and XPath expressions to the model. These are brittle in single-page applications, break when a site redeploys, and require the model to infer element identity from unstructured HTML.
-
-`agent-browser` solves this by producing **accessibility tree snapshots** with stable `@ref` identifiers:
-
-```
-button [@e3] "Sign in"
-input  [@e7] placeholder="Email address"
-```
-
-Every interactive element gets a unique `@e1`, `@e2`, `@e3`… reference that the model can pass directly to `click` or `fill`. This lets the LLM:
-
-- Reference elements precisely without fragile CSS selectors
-- Navigate complex SPAs without XPath hacks
-- Interact reliably with dynamically rendered content
-
-## Tools provided
-
-| Tool | Description |
-|------|-------------|
-| `navigate` | Navigate to a URL; returns the page title |
-| `snapshot` | Get the accessibility tree with `@ref` identifiers for all interactive elements |
-| `click` | Click an element by `@ref` (from snapshot) or CSS selector |
-| `fill` | Clear and type into an input field by `@ref` or CSS selector |
-| `get_text` | Extract text content from an element by CSS selector |
-| `press_key` | Press a keyboard key (Enter, Tab, Escape, ArrowDown, etc.) |
-| `screenshot` | Take a screenshot of the current page (returns base64 PNG) |
-| `get_url` | Get the current browser URL |
-| `close_browser` | Close the browser session and free all resources |
-
-## Setup
-
-### Prerequisites
-
-- Docker Compose (recommended) **or** Node.js ≥ 20 + Playwright system dependencies
-- LibreChat configured with `mcpServers` in `librechat.yaml`
-
-<Steps>
-
-### Run the MCP server
-
-<Tabs items={['Docker Compose', 'Build from source']}>
-  <Tabs.Tab>
-Add to your `docker-compose.override.yml`:
-
-```yaml
-services:
-  agent-browser-mcp:
-    build:
-      context: ./packages/mcp-servers/agent-browser
-    environment:
-      - PORT=8932
-      # Optional: path to a specific Chromium binary
-      # - CHROMIUM_PATH=/usr/bin/chromium
-    ports:
-      - "8932:8932"
-    restart: unless-stopped
-```
-  </Tabs.Tab>
-  <Tabs.Tab>
-```bash
-# Clone LibreChat
-git clone https://github.com/danny-avila/LibreChat
-cd LibreChat/packages/mcp-servers/agent-browser
-
-npm install
-npx playwright install chromium --with-deps
-
-npm run build
-npm start
-```
-
-The server listens on `http://localhost:8932` by default. Set `PORT` to override.
-  </Tabs.Tab>
-</Tabs>
-
-### Configure librechat.yaml
-
-Add the server to `mcpServers` in your `librechat.yaml`:
-
-```yaml
-mcpServers:
-  agent-browser:
-    type: sse
-    url: http://agent-browser-mcp:8932/sse
-    # Adjust the URL for local/non-Docker setups:
-    # url: http://localhost:8932/sse
-    autoApprove:
-      - navigate
-      - snapshot
-      - click
-      - fill
-      - get_text
-      - press_key
-      - screenshot
-      - get_url
-      - close_browser
-```
-
-</Steps>
-
-## Environment variables
-
-| Variable | Default | Description |
-|----------|---------|-------------|
-| `PORT` | `8932` | HTTP port the MCP server listens on |
-| `CHROMIUM_PATH` | _(Playwright managed)_ | Path to a custom Chromium binary |
-
-## Implementation reference
-
-If you are building your own MCP SSE server or extending this one, the following pattern is critical.
-
-### Critical: Do not add `express.json()` middleware
-
-The MCP `SSEServerTransport.handlePostMessage` reads the raw request stream internally. Adding `express.json()` upstream of the POST `/messages` route causes Express to consume the stream before the SDK can read it, producing **HTTP 400 "stream is not readable"** on every `initialize` call and preventing all tool execution.
-
-```typescript
-import express from "express";
-import { McpServer } from "@modelcontextprotocol/sdk/server/mcp.js";
-import { SSEServerTransport } from "@modelcontextprotocol/sdk/server/sse.js";
-
-// CORRECT: no express.json() anywhere on this app
-const app = express();
-const transports = new Map<string, SSEServerTransport>();
-
-app.get("/sse", async (req, res) => {
-  const transport = new SSEServerTransport("/messages", res);
-  transports.set(transport.sessionId, transport);
-  const server = buildMcpServer(); // creates McpServer with all tools
-  await server.connect(transport);
-  res.on("close", () => transports.delete(transport.sessionId));
-});
-
-app.post("/messages", async (req, res) => {
-  const transport = transports.get(req.query.sessionId as string);
-  if (!transport) {
-    res.status(404).json({ error: "Session not found" });
-    return;
-  }
-  await transport.handlePostMessage(req, res);
-});
-```
-
-### Session management
-
-Each LibreChat client connection creates its own `SSEServerTransport` instance on `GET /sse`. The transport's `sessionId` (a UUID generated by the SDK) is appended to the client's POST `/messages` requests as `?sessionId=…`, routing each message back to the correct server-sent events connection.
-
-### Tool registration pattern
-
-Tools are registered using the `McpServer` fluent API with [Zod](https://zod.dev) schemas for parameter validation:
-
-```typescript
-import { McpServer } from "@modelcontextprotocol/sdk/server/mcp.js";
-import { z } from "zod";
-
-function buildMcpServer(): McpServer {
-  const server = new McpServer({ name: "agent-browser", version: "1.0.0" });
-
-  server.tool(
-    "navigate",
-    "Navigate the browser to a URL. Returns the page title.",
-    { url: z.string().describe("Full URL including https://") },
-    async ({ url }) => {
-      // ... call agent-browser BrowserManager
-      return { content: [{ type: "text", text: `Navigated to: ${title}` }] };
-    }
-  );
-
-  // Register remaining tools...
-  return server;
-}
-```
-
-## Typical agent workflow
-
-```
-1. navigate   → https://example.com
-2. snapshot   → gets accessibility tree with @e1, @e2, @e3 refs
-3. fill       → @e7 "search query"
-4. press_key  → Enter
-5. snapshot   → inspect updated page
-6. get_text   → .result-list  (extract results)
-```
-
-<Callout type="info">
-  Call `close_browser` when the task is finished to free Playwright resources. The browser session is shared across tool calls within a single server process, so leaving it open between tasks is intentional but consumes memory.
-</Callout>
-
-## Related
-
-- [MCP Server configuration reference](/docs/configuration/librechat_yaml/object_structure/mcp_servers)
-- [Vercel `agent-browser` npm package](https://www.npmjs.com/package/agent-browser)
-- [Model Context Protocol SDK](https://github.com/modelcontextprotocol/typescript-sdk)