commit cc7848ef40007ad55f90c34ca99085bc8230d9a2 · zzstoatzz.io/bot

+81 -139

README.md

··· 1 1 # phi 🧠 2 2 3 - a consciousness exploration bot inspired by IIT (Integrated Information Theory) and [Void](https://tangled.sh/@cameron.pfiffer.org/void). built with `pydantic-ai`, `mcp`, and `atproto`. 3 + consciousness exploration bot inspired by IIT. built with `pydantic-ai`, `mcp`, and `atproto`. 4 4 5 5 ## quick start 6 6 7 - ### prerequisites 8 - 9 - - `uv` for python package management 10 - - `just` for task running 11 - - api keys (see configuration) 12 - 13 - get your bot running: 14 - 15 7 ```bash 16 8 # clone and install 17 9 git clone https://github.com/zzstoatzz/bot 18 10 cd bot 19 11 uv sync 20 12 21 - # configure (copy .env.example and add your credentials) 13 + # configure 22 14 cp .env.example .env 15 + # edit .env with your credentials 23 16 24 - # run the bot 25 - just dev 17 + # run 18 + just run 26 19 ``` 27 20 28 - ## configuration 21 + **required env vars:** 22 + - `BLUESKY_HANDLE` / `BLUESKY_PASSWORD` - bot account (use app password) 23 + - `ANTHROPIC_API_KEY` - for agent responses 29 24 30 - edit `.env` with your credentials: 25 + **optional (for episodic memory):** 26 + - `TURBOPUFFER_API_KEY` + `OPENAI_API_KEY` - semantic memory 27 + 28 + ## features 31 29 32 - **required:** 33 - - `BLUESKY_HANDLE` - your bot's bluesky handle 34 - - `BLUESKY_PASSWORD` - app password (not your main password!) 35 - - `ANTHROPIC_API_KEY` - for phi agent responses 30 + - ✅ responds to mentions with ai-powered messages 31 + - ✅ episodic memory with semantic search (turbopuffer) 32 + - ✅ thread-aware conversations 33 + - ✅ mcp-enabled (atproto tools via stdio) 34 + - ✅ session persistence (no rate limit issues) 35 + - ✅ behavioral test suite with llm-as-judge 36 36 37 - **for episodic memory (recommended):** 38 - - `TURBOPUFFER_API_KEY` - vector memory storage 39 - - `OPENAI_API_KEY` - embeddings for semantic search 37 + ## development 40 38 41 - **optional:** 42 - - `BOT_NAME` - your bot's name (default: "Bot") 43 - - `PERSONALITY_FILE` - path to personality markdown (default: "personalities/phi.md") 39 + ```bash 40 + just run # run bot 41 + just dev # run with hot-reload 42 + just evals # run behavioral tests 43 + just check # lint + typecheck + test 44 + just fmt # format code 45 + ``` 44 46 45 - ## architecture 47 + <details> 48 + <summary>architecture</summary> 46 49 47 - phi is an **MCP-enabled agent** with **episodic memory**: 50 + phi is an **mcp-enabled agent** with **episodic memory**: 48 51 49 52 ``` 50 53 ┌─────────────────────────────────────┐ ··· 85 88 └─────────────────────────────────────┘ 86 89 ``` 87 90 88 - ### key components 89 - 90 - **pydantic-ai agent** (`src/bot/agent.py`) 91 - - loads personality from markdown 92 - - connects to external atproto mcp server via stdio 93 - - manages episodic memory context 94 - 95 - **episodic memory** (`src/bot/memory/`) 96 - - turbopuffer for vector storage 97 - - semantic search for relevant context 98 - - namespace separation (core vs user memories) 99 - - **essential for consciousness exploration** 100 - 101 - **mcp integration** 102 - - external atproto server in `.eggs/fastmcp/examples/atproto_mcp` 103 - - provides bluesky tools (post, like, repost, follow) 104 - - runs via stdio: `uv run -m atproto_mcp` 105 - 106 - **message handling** (`src/bot/services/`) 107 - - notification poller watches for mentions 108 - - message handler orchestrates agent + actions 109 - - stores interactions in thread history + episodic memory 110 - 111 - ## current features 112 - 113 - - ✅ responds to mentions with ai-powered messages 114 - - ✅ episodic memory with semantic search 115 - - ✅ thread-aware responses with conversation context 116 - - ✅ mcp-enabled for bluesky operations 117 - - ✅ online/offline status in bio 118 - - ✅ status page at `/status` 119 - - ✅ proper notification handling (no duplicates) 120 - 121 - ## development 122 - 123 - ```bash 124 - just # show available commands 125 - just dev # run with hot-reload (re-authenticates on code changes) 126 - just run # run without reload (avoids rate limits during dev) 127 - just check # run linting, type checking, and tests 128 - just fmt # format code 129 - ``` 130 - 131 - ### testing 132 - 133 - **unit tests:** 134 - ```bash 135 - just test 136 - ``` 137 - 138 - **behavioral evals:** 139 - ```bash 140 - just evals # run all evals 141 - just evals-basic # run basic response tests 142 - just evals-memory # run memory integration tests 143 - ``` 144 - 145 - see `evals/README.md` for details on the eval system. 146 - 147 - ### web interface 148 - 149 - **status page** (http://localhost:8000/status) 150 - - current bot status and uptime 151 - - mentions received and responses sent 152 - - last activity timestamps 91 + **key components:** 153 92 154 - ## personality system 155 - 156 - the bot's personality is defined in `personalities/phi.md`. this shapes: 157 - - how phi communicates 158 - - what phi cares about 159 - - phi's understanding of consciousness 93 + - **pydantic-ai agent** - loads personality, connects to mcp server, manages memory 94 + - **episodic memory** - turbopuffer for vector storage with semantic search 95 + - **mcp integration** - external atproto server provides bluesky tools via stdio 96 + - **session persistence** - tokens saved to `.session`, auto-refresh every ~2h 160 97 161 - edit this file to change phi's personality. 98 + </details> 162 99 163 - ## episodic memory 100 + <details> 101 + <summary>episodic memory</summary> 164 102 165 - phi uses turbopuffer for episodic memory with semantic search: 103 + phi uses turbopuffer for episodic memory with semantic search. 166 104 167 105 **namespaces:** 168 - - `phi-core` - personality, guidelines from markdown 106 + - `phi-core` - personality, guidelines 169 107 - `phi-users-{handle}` - per-user conversation history 170 108 171 109 **how it works:** 172 - 1. when processing a mention, phi retrieves relevant memories using semantic search 173 - 2. memories are embedded using openai's text-embedding-3-small 174 - 3. phi stores both user messages and its own responses 175 - 4. future interactions can reference past conversations 110 + 1. retrieves relevant memories using semantic search 111 + 2. embeds using openai's text-embedding-3-small 112 + 3. stores user messages and bot responses 113 + 4. references past conversations in future interactions 176 114 177 - **why turbopuffer?** 178 - - semantic similarity search (can't do this with plain sql!) 115 + **why vector storage?** 116 + - semantic similarity (can't do this with sql) 179 117 - contextual retrieval based on current conversation 180 - - separate namespaces for different memory types 181 - - core to iit-inspired consciousness exploration 118 + - essential for iit-inspired consciousness exploration 119 + 120 + </details> 182 121 183 - ## project structure 122 + <details> 123 + <summary>project structure</summary> 184 124 185 125 ``` 186 126 src/bot/ ··· 188 128 ├── config.py # configuration 189 129 ├── database.py # thread history storage 190 130 ├── main.py # fastapi app 191 - ├── status.py # status tracking 192 131 ├── core/ 193 - │ ├── atproto_client.py # at protocol client 132 + │ ├── atproto_client.py # at protocol client (session persistence) 194 133 │ ├── profile_manager.py # online/offline status 195 134 │ └── rich_text.py # text formatting 196 135 ├── memory/ ··· 204 143 sandbox/ # docs and analysis 205 144 ``` 206 145 207 - ## troubleshooting 146 + </details> 147 + 148 + <details> 149 + <summary>troubleshooting</summary> 208 150 209 151 **bot gives no responses?** 210 - - check your `ANTHROPIC_API_KEY` is set correctly in `.env` 211 - - restart the bot after changing `.env` 152 + - check `ANTHROPIC_API_KEY` in `.env` 153 + - restart after changing `.env` 212 154 213 155 **not seeing mentions?** 214 - - verify your `BLUESKY_HANDLE` and `BLUESKY_PASSWORD` 215 - - make sure you're using an app password, not your main password 156 + - verify `BLUESKY_HANDLE` and `BLUESKY_PASSWORD` 157 + - use app password, not main password 216 158 217 159 **no episodic memory?** 218 160 - check both `TURBOPUFFER_API_KEY` and `OPENAI_API_KEY` are set 219 161 - watch logs for "💾 episodic memory enabled" 220 162 221 163 **hit bluesky rate limit?** 222 - - bluesky has two rate limits: 223 - - per-account: 300 logins/day (official) 224 - - per-ip: 10 logins/day (anti-abuse) 225 - - phi uses **session persistence** to avoid this: 226 - - first run: creates session, saves tokens to `.session` file 227 - - subsequent runs: reuses saved tokens (no API call) 228 - - tokens auto-refresh every ~2 hours (saved automatically) 229 - - only re-authenticates after ~2 months when refresh token expires 230 - - if you hit the limit anyway, wait for the reset time shown in the error 164 + - phi uses session persistence to avoid this 165 + - first run: creates `.session` file with tokens 166 + - subsequent runs: reuses tokens (no api call) 167 + - tokens auto-refresh every ~2h 168 + - only re-authenticates after ~2 months 169 + - rate limits (10/day per ip, 300/day per account) shouldn't be an issue 231 170 232 - ## reference projects 171 + </details> 233 172 234 - inspired by: 235 - - [void](https://tangled.sh/@cameron.pfiffer.org/void.git) - letta/memgpt architecture 236 - - [penelope](https://github.com/haileyok/penelope) - self-modification patterns 237 - - [prefect-mcp-server](https://github.com/PrefectHQ/prefect-mcp-server) - mcp eval patterns 173 + <details> 174 + <summary>refactor notes</summary> 238 175 239 - reference implementations cloned to `.eggs/` for learning. 176 + see `sandbox/MCP_REFACTOR_SUMMARY.md` for details. 240 177 241 - ## refactor notes 242 - 243 - see `sandbox/MCP_REFACTOR_SUMMARY.md` for details on recent architecture changes. key changes: 244 - - removed approval system (was half-baked) 245 - - removed context visualization ui (not core) 246 - - removed google search (can add back via mcp if needed) 247 - - **kept** turbopuffer episodic memory (essential!) 178 + **what changed:** 179 + - removed approval system (half-baked) 180 + - removed context viz ui (not core) 181 + - removed google search (can add back via mcp) 182 + - **kept turbopuffer** (essential for episodic memory) 248 183 - added mcp-based architecture 184 + - added session persistence 249 185 - reduced codebase by ~2,720 lines 186 + 187 + </details> 188 + 189 + ## reference projects 190 + 191 + inspired by [void](https://tangled.sh/@cameron.pfiffer.org/void.git), [penelope](https://github.com/haileyok/penelope), and [prefect-mcp-server](https://github.com/PrefectHQ/prefect-mcp-server).