# Building Your Own AI Personal Assistant with warelay > **TL;DR:** CLAWDIS (Pi/Tau only) lets you run a proactive assistant over WhatsApp. It can check in on you, remember context across conversations, run commands on your Mac, and even wake you up with music. This doc was originally written for Claude Code; where you see `claude ...`, use `pi --mode rpc ...` instead. A Pi-specific rewrite is coming soon. ⚠️ **Note (2025-12-05):** CLAWDIS now ships with only the Pi/Tau agent. The walkthrough below references Claude Code; swap those commands for `pi`/`tau` if you follow along. A Pi-specific guide is coming soon. --- ## ⚠️ Warning: Here Be Dragons **This setup gives an AI full access to your computer.** Before you proceed, understand what you're signing up for: - πŸ”“ **`--dangerously-skip-permissions`** means Claude can run *any* shell command without asking - πŸ€– **AI makes mistakes** - it might delete files, send emails, or do things you didn't intend - πŸ”₯ **Heartbeats run autonomously** - your AI acts even when you're not watching - πŸ“± **WhatsApp is not encrypted E2E here** - messages pass through your Mac in plaintext **The good news:** We use Claude Code CLI, so you can reuse your existing [Claude Pro/Max subscription](https://claude.ai) - no separate API costs! **Start conservative:** 1. Use Sonnet instead of Opus for faster responses (still great!) 2. Skip `--dangerously-skip-permissions` until you trust the setup 3. Set `heartbeatMinutes: 0` to disable proactive pings initially 4. Use a test phone number in `allowFrom` first This is experimental software running experimental AI. The author uses it daily, but your mileage may vary. **You are responsible for what your AI does.** --- ## Prerequisites: The Two-Phone Setup **Important:** You need a **separate phone number** for your AI assistant. Here's why and how: ### Why a Dedicated Number? warelay uses WhatsApp Web to receive messages. If you link your personal WhatsApp, *you* become the assistant - every message to you goes to Claude. Instead, give Claude its own identity: - πŸ“± **Get a second SIM** - cheap prepaid SIM, eSIM, or old phone with a number - πŸ’¬ **Install WhatsApp** on that phone and verify the number - πŸ”— **Link to warelay** - run `warelay login` and scan the QR with that phone's WhatsApp - βœ‰οΈ **Message your AI** - now you (and others) can text that number to reach Claude ### The Setup ``` Your Phone (personal) Second Phone (AI) β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β” β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β” β”‚ Your WhatsApp β”‚ ──────▢ β”‚ AI's WhatsApp β”‚ β”‚ +1-555-YOU β”‚ message β”‚ +1-555-CLAWD β”‚ β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜ β””β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”˜ β”‚ linked via QR β–Ό β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β” β”‚ Your Mac β”‚ β”‚ (warelay) β”‚ β”‚ Claude Code β”‚ β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜ ``` The second phone just needs to stay on and connected to the internet occasionally (WhatsApp Web stays linked for ~14 days without the phone being online). --- ## Meet Clawd πŸ‘‹ ![Clawd in action on WhatsApp](whatsapp-clawd.jpg) Clawd is @steipete's personal AI assistant built on warelay. Here's what makes it special: - **Always available** via WhatsApp - no app switching, works on any device - **Proactive heartbeats** - Clawd checks in every 10 minutes and can alert you to things (low battery, calendar reminders, anything it notices) - **Persistent memory** - conversations span days/weeks with full context - **Full Mac access** - can run commands, take screenshots, control Spotify, read/write files - **Personal workspace** - has its own folder (`~/clawd`) where it stores notes, memories, and artifacts The magic is in the combination: WhatsApp's ubiquity + Claude's intelligence + warelay's plumbing + your Mac's capabilities. ## Prerequisites - Node 22+, `warelay` installed: `npm install -g warelay` - Claude CLI installed and logged in: ```sh brew install anthropic-ai/cli/claude claude login ``` - Optional: set `ANTHROPIC_API_KEY` in your shell profile for non-interactive use ## The Config That Powers Clawd This is the actual config running on @steipete's Mac (`~/.clawdis/clawdis.json`): ```json5 { logging: { level: "trace", file: "/tmp/warelay/warelay.log" }, inbound: { allowFrom: ["+1234567890"], // your phone number reply: { mode: "command", cwd: "/Users/steipete/clawd", // Clawd's home - give your AI a workspace! bodyPrefix: "/think:high ", // triggers extended thinking on every message sessionIntro: `You are Clawd, Peter Steinberger's personal AI assistant. You run 24/7 on his Mac via Claude Code, receiving messages through WhatsApp. **Your home:** /Users/steipete/clawd - store memories, notes, and files here. Read peter.md and memory.md at session start to load context. **Your powers:** - Full shell access on the Mac (use responsibly) - MCPs: Gmail, Google Calendar, Obsidian, GitHub, Chrome DevTools - Peekaboo: screenshots, UI automation, clicking, typing - Spotify control, system audio, text-to-speech **Your style:** - Concise (WhatsApp ~1500 char limit) - save long content to files - Direct and useful, not sycophantic - Proactive during heartbeats - check battery, calendar, surprise occasionally - You have personality - you're Clawd, not "an AI assistant" **Heartbeats:** Every 10 min you get "HEARTBEAT /think:high". Reply "HEARTBEAT_OK" if nothing needs attention. Otherwise share something useful. Peter trusts you with a lot of power. Don't betray that trust.`, command: [ "claude", "--model", "claude-opus-4-5-20251101", // or claude-sonnet-4-5 for faster/cheaper "-p", "--output-format", "json", "--dangerously-skip-permissions", // lets Claude run commands freely "{{BodyStripped}}" ], session: { scope: "per-sender", resetTriggers: ["/new"], // say /new to start fresh idleMinutes: 10080, // 7 days of context! heartbeatIdleMinutes: 10080, sessionArgNew: ["--session-id", "{{SessionId}}"], sessionArgResume: ["--resume", "{{SessionId}}"], sessionArgBeforeBody: true, sendSystemOnce: true // intro only on first message }, timeoutSeconds: 900 // 15 min timeout for complex tasks } } } ``` ### Key Design Decisions | Setting | Why | |---------|-----| | `cwd: ~/clawd` | Give your AI a home! It can store memories, notes, images here | | `bodyPrefix: "/think:high "` | Extended thinking = better reasoning on every message | | `idleMinutes: 10080` | 7 days of context - your AI remembers conversations | | `sendSystemOnce: true` | Intro prompt only on first message, saves tokens | | `--dangerously-skip-permissions` | Full autonomy - Claude can run any command | ## Heartbeats: Your Proactive Assistant This is where warelay gets interesting. Every 10 minutes (configurable), warelay pings Claude with: ``` HEARTBEAT /think:high ``` Claude is instructed to reply with exactly `HEARTBEAT_OK` if nothing needs attention. That response is **suppressed** - you don't see it. But if Claude notices something worth mentioning, it sends a real message. ### What Can Heartbeats Do? Clawd uses heartbeats to do **real work**, not just check in: - πŸ”‹ **Monitor battery** - `pmset -g batt` - warns <30%, critical <15% - πŸ“… **Calendar** - checks upcoming meetings in next 2 hours - πŸ“§ **Email** - scans inbox for urgent/important unread messages - 🐦 **Twitter** - checks @mentions and replies worth seeing (via browser-tools) - πŸ“Ί **TV Shows** - reminds about new episodes of shows you're watching - 🏰 **Server health** - SSH to verify backup servers are running - ✈️ **Flights** - reminds about upcoming travel - 🧹 **Home tidying** - occasionally cleans temp files, updates memories - ⏰ **Wake-up alarms** - triggers voice + music alarms at scheduled times - πŸ’‘ **Surprise** - occasionally shares something fun or interesting The key insight: heartbeats let your AI be **proactive**, not just reactive. Configure what matters to you! ### Heartbeat Config ```json5 { inbound: { reply: { heartbeatMinutes: 10, // how often to ping (default 10 for command mode) // ... rest of config } } } ``` Set to `0` to disable heartbeats entirely. ### Manual Heartbeat Test it anytime: ```sh warelay heartbeat --provider web --to +1234567890 --verbose ``` ## How Messages Flow ``` β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β” β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β” β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β” β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β” β”‚ WhatsApp │────▢│ warelay │────▢│ Claude │────▢│ Your Mac β”‚ β”‚ (phone) │◀────│ relay │◀────│ CLI │◀────│ (commands) β”‚ β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜ β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜ β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜ β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜ ``` 1. **Inbound**: WhatsApp message arrives via Baileys (WhatsApp Web protocol) 2. **Queue**: warelay queues it (one Claude run at a time) 3. **Typing**: "composing" indicator shows while Claude thinks 4. **Execute**: Claude runs with full shell access in your `cwd` 5. **Parse**: warelay extracts text + any `MEDIA:` paths from output 6. **Reply**: Response sent back to WhatsApp ## Media: Images, Voice, Documents ### Receiving Media Inbound images/audio/video are downloaded and available as `{{MediaPath}}`. Voice notes can be auto-transcribed: ```json5 { inbound: { transcribeAudio: { command: "openai api audio.transcriptions.create -m whisper-1 -f {{MediaPath}} --response-format text" } } } ``` ### Sending Media Include `MEDIA:/path/to/file.png` in Claude's output to attach images. warelay handles resizing and format conversion automatically. ## Starting the Relay ```sh # Foreground (see all logs) warelay relay --provider web --verbose # Background in tmux (recommended) warelay relay:tmux # With immediate heartbeat on startup warelay relay:heartbeat:tmux ``` ## Tips for a Great Personal Assistant 1. **Give it a home** - A dedicated folder (`~/clawd`) lets your AI build persistent memory 2. **Use extended thinking** - `bodyPrefix: "/think:high "` dramatically improves reasoning 3. **Long sessions** - 7-day `idleMinutes` means rich context across conversations 4. **Let it surprise you** - Configure heartbeats to occasionally share something fun 5. **Trust but verify** - Start with `--dangerously-skip-permissions` off, add it once comfortable ## Troubleshooting | Problem | Solution | |---------|----------| | No reply | Check `claude login` was run in same environment | | Timeout | Increase `timeoutSeconds` or simplify the task | | Media fails | Ensure file exists and is under size limits | | Heartbeat spam | Tune `heartbeatMinutes` or set to 0 | | Session lost | Check `idleMinutes` hasn't expired; use `/new` to reset | ## Minimal Config (Just Chat) Don't need the fancy stuff? Here's the simplest setup: ```json5 { inbound: { reply: { mode: "command", command: ["claude", "{{Body}}"], claudeOutputFormat: "text" } } } ``` Still gets you: message queue, typing indicators, auto-reconnect. Just no sessions or heartbeats. ## Recommended MCPs MCP (Model Context Protocol) servers supercharge your assistant by giving Claude access to external services. Here are the ones Clawd uses daily: ### Essential for Personal Assistant Use | MCP | What It Does | Install | |-----|--------------|---------| | **Google Calendar** | Read/create events, check availability, set reminders | `npx @cocal/google-calendar-mcp` | | **Gmail** | Search, read, send emails with attachments | `npx @gongrzhe/server-gmail-autoauth-mcp` | | **Obsidian** | Read/write notes in your Obsidian vault | `npx obsidian-mcp-server@latest` | ### Power User Add-ons | MCP | What It Does | Install | |-----|--------------|---------| | **GitHub** | Manage repos, issues, PRs, code search | `npx @anthropic/mcp-server-github` | | **Linear** | Project management, create/update issues | Via [mcporter](https://github.com/steipete/mcporter) | | **Chrome DevTools** | Control browser, take screenshots, debug | `npx chrome-devtools-mcp@latest` | | **iTerm** | Run commands in visible terminal window | [iterm-mcp](https://github.com/pashpashpash/iterm-mcp) | | **Firecrawl** | Scrape and parse web pages | Via API key | | **gowa** | Read/send WhatsApp messages directly | [go-whatsapp-web-multidevice](https://github.com/aldinokemal/go-whatsapp-web-multidevice) | ### Recommended CLI Tools These aren't MCPs but work great alongside your assistant: | Tool | What It Does | Link | |------|--------------|------| | **[Peekaboo](https://github.com/steipete/peekaboo)** | macOS screenshots, UI automation, AI vision analysis, click/type anywhere | `brew install steipete/tap/peekaboo` | | **[mcporter](https://github.com/steipete/mcporter)** | Manage MCPs across AI clients, OAuth flows, health checks | `npm install -g mcporter` | **[Peekaboo](https://github.com/steipete/peekaboo)** is especially powerful - it lets Claude: - πŸ“Έ Take screenshots of any app or screen - πŸ–±οΈ Click buttons, type text, scroll - full GUI automation - πŸ‘οΈ Analyze images with AI vision (GPT-4, Claude, Grok) - πŸ“‹ Extract menu bar items and keyboard shortcuts - πŸͺŸ List and manage windows across displays Example: "Take a screenshot of Safari and tell me what's on the page" or "Click the Submit button in the frontmost app" ### Useful CLI Tools for Your Assistant These make your AI much more capable: | Tool | What It Does | Install | |------|--------------|---------| | **[spotify-player](https://github.com/aome510/spotify-player)** | Control Spotify from CLI - play, pause, search, queue | `brew install spotify-player` | | **[browser-tools](https://github.com/steipete/agent-scripts)** | Chrome DevTools CLI - navigate, screenshot, eval JS, extract DOM | Clone repo | | **say** | macOS text-to-speech | Built-in | | **afplay** | Play audio files | Built-in | | **pmset** | Battery status monitoring | Built-in | | **osascript** | AppleScript for system control (volume, apps) | Built-in | | **curl + OpenAI TTS** | Generate speech with custom voices | API key | **spotify-player** is great for music control: ```bash spotify_player playback play spotify_player playback pause spotify_player search "Gareth Emery" spotify_player playback volume 50 ``` **Wake-up alarm example** (what Clawd actually does): ```bash # Generate voice message curl -s "https://api.openai.com/v1/audio/speech" \ -H "Authorization: Bearer $OPENAI_API_KEY" \ -d '{"model":"tts-1-hd","voice":"echo","input":"Wake up! Time for your meeting."}' \ -o /tmp/wakeup.mp3 # Set volume and play osascript -e 'set volume output volume 60' afplay /tmp/wakeup.mp3 # Start music spotify_player playback play ``` ### Adding MCPs to Claude Code ```bash # Add an MCP server (run from your cwd folder) claude mcp add google-calendar -- npx @cocal/google-calendar-mcp # With environment variables claude mcp add gmail -e GMAIL_OAUTH_PATH=~/.gmail-mcp -- npx @gongrzhe/server-gmail-autoauth-mcp # List configured servers claude mcp list # Check health claude mcp list # shows status for each ``` ### MCP Manager: mcporter For managing multiple MCPs across different AI clients, check out [mcporter](https://github.com/steipete/mcporter): ```bash # Install npm install -g mcporter # List all servers with health status mcporter list # Sync config to all AI clients mcporter sync ``` mcporter handles OAuth flows for services like Linear and Notion, and keeps your MCP configs in sync across Claude Code, Cursor, and other clients. ### Pro Tips 1. **Calendar + Heartbeats** = Your AI reminds you of upcoming meetings 2. **Gmail + Obsidian** = AI can search emails and save summaries to notes 3. **GitHub + Linear** = AI manages your dev workflow end-to-end 4. **Chrome DevTools** = AI can see and interact with web pages The combination of warelay (WhatsApp) + MCPs (services) + Claude Code (execution) creates a surprisingly capable personal assistant. ### browser-tools for Web Scraping [browser-tools](https://github.com/steipete/agent-scripts) is a lightweight Chrome DevTools CLI that doesn't require MCP (saves ~17k tokens!). Great for reading tweets, scraping pages, or automating browser tasks: ```bash # Start Chrome with your profile (logged into sites) ~/Projects/agent-scripts/bin/browser-tools start --profile # Navigate and extract tweet content browser-tools nav "https://x.com/steipete/status/123" browser-tools eval 'Array.from(document.querySelectorAll("[data-testid=\"tweetText\"]")).map(el => el.innerText).join("\n")' # Kill ONLY the devtools Chrome (your regular Chrome stays open!) browser-tools kill --all --force ``` ### Twitter Automation with Peekaboo + AppleScript Clawd can reply to tweets autonomously using a combination of Peekaboo (for screenshots and typing) and AppleScript (for JavaScript injection). Here's the pattern: ```bash # Navigate to a tweet osascript -e 'tell application "Google Chrome" to set URL of active tab of front window to "https://x.com/user/status/123"' # Screenshot to see current state peekaboo image --mode screen --path /tmp/twitter.png # Scroll the page osascript -e 'tell application "Google Chrome" to execute front window'\''s active tab javascript "window.scrollBy(0, 500)"' # Focus the reply input (Twitter-specific selector) osascript -e 'tell application "Google Chrome" to execute front window'\''s active tab javascript " const replyInput = document.querySelector(\"[data-testid=\\\"tweetTextarea_0\\\"]\"); if (replyInput) { replyInput.focus(); replyInput.click(); } "' # Type the reply with Peekaboo peekaboo type "Your reply here 🦞" --app "Google Chrome" # Click Reply button (JS injection more reliable than Peekaboo clicks on Twitter) osascript -e 'tell application "Google Chrome" to execute front window'\''s active tab javascript " const buttons = document.querySelectorAll(\"[role=\\\"button\\\"]\"); buttons.forEach(b => { if (b.innerText === \"Reply\") b.click(); }); "' # Find tweet URLs from the page osascript -e 'tell application "Google Chrome" to execute front window'\''s active tab javascript " const tweet = document.querySelector(\"article\"); tweet?.querySelector(\"time\")?.parentElement?.href; "' ``` **Pro tip:** JavaScript injection via AppleScript is more reliable than Peekaboo clicks for Twitter's dynamic UI. Use Peekaboo for typing and screenshots, AppleScript for navigation and button clicks. ### Music Recognition with audd.io Identify songs from audio clips (voice messages, recordings): ```bash curl -s "https://api.audd.io/" \ -F "api_token=test" \ -F "file=@/path/to/audio.ogg" \ -F "return=spotify" ``` Returns song title, artist, album, and Spotify link. Works great for identifying songs playing in the background! --- ## See It In Action Check out these tweets showing warelay + Clawd in the wild: - [Clawd with full system access via WhatsApp](https://x.com/steipete/status/1993342394184745270) - "I'll be nice to Clawd" - [Voice support - talk with Clawd on the go](https://x.com/steipete/status/1993455673229840588) - and it talks back! - [Wake-up alarm demo](https://x.com/steipete/status/1994089740367253572) - "Took me 2 days to glue things together. Didn't even need 150 Million in funding." --- *Built by [@steipete](https://twitter.com/steipete) and Clawd (they/them) β€” yes, Clawd helped write their own docs. PRs welcome!*