Home/AI & LLMs/screen-monitor

screen-monitor

Safe
AI & LLMs

Dual-mode screen sharing and analysis. Model-agnostic (Gemini/Claude/Qwen3-VL).

SKILL.md

# Screen Monitor This skill provides two ways for the agent to see and interact with your screen. ## 🟢 Path A: Fast Share (WebRTC) *Best for: Quick visual checks, restricted browsers, or non-technical environments.* ### Tools - **`screen_share_link`**: Generates a local WebRTC portal URL. - **`screen_analyze`**: Captures the current frame from the portal and analyzes it with vision. **Usage:** ```bash # Get the link bash command:"{baseDir}/references/get-share-url.sh" # Analyze bash command:"{baseDir}/references/screen-analyze.sh" ``` --- ## 🔵 Path B: Full Control (Browser Relay) *Best for: Deep debugging, UI automation, and clicking/typing in tabs.* ### Setup 1. Run `clawdbot browser extension install`. 2. Load the unpacked extension from `clawdbot browser extension path`. 3. Click the Clawdbot icon in your Chrome toolbar to **Attach**. ### Tools - **`browser action:snapshot`**: Take a precise screenshot of the attached tab. - **`browser action:click`**: Interact with elements (requires `profile="chrome"`). --- ## Technical Details - **Port**: 18795 (WebRTC Backend) - **Files**: - `web/screen-share.html`: The sharing portal. - `references/backend-endpoint.js`: Frame storage server.

More in AI & LLMs