Skip to content

Scaling Recommendations

Chunk 31 does not claim final production sizing. It establishes a repository-owned baseline and identifies where future scaling work should focus.

Current recommendations

  • treat the current runtime as a bootstrap control plane, not final enterprise throughput capacity
  • scale API and worker processes independently because read traffic and queue pressure fail differently
  • keep Redis external and durable for cache and queue coordination before increasing worker concurrency
  • move high-write subsystems off file-backed state before attempting aggressive horizontal scaling

Near-term priorities

  • add live Redis-backed queue and cache load runs in environments that match expected deployment topology
  • add engine-specific database connection pool validation for PostgreSQL, MSSQL, and MariaDB
  • profile worker saturation under real queued execution rather than enqueue-only pressure
  • capture trend baselines over time instead of relying on single-run point measurements

Expected bottlenecks in the current stage

  • file-backed inventory and auth state under concurrent writes
  • job enqueue pressure before worker throughput becomes the dominant limit
  • session issuance churn when many users authenticate at once