fix: increase connection pool resilience for Neon cold starts (#422)
after 5 minutes idle, Neon scales down and cold start takes 5-10s.
first requests after idle would exhaust the pool (5 connections),
causing all subsequent requests to fail with 500 errors.
changes:
- pool_size: 5 → 10 (more concurrent cold start requests)
- max_overflow: 0 → 5 (burst capacity to 15 connections)
- connection_timeout: 3s → 10s (wait for Neon wake-up)
this is a recurrence of the Nov 17 incident. that fix addressed the
queue listener's asyncpg connection but not the SQLAlchemy pool.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-authored-by: Claude <noreply@anthropic.com>
authored by
zzstoatzz.io
Claude
and committed by
GitHub
d4b6d70e
124a1fc6