Python Infrastructure System-reliability concerns for Python services, grouped because real code uses them together: a task you queue (background-jobs) needs retries (resilience) and instrumentation (observability) on the same call path. Scope routing | If you need to… | Read | |---|---| | Design a task queue, schedule recurring jobs, or run async workers (Celery, RQ, asyncio task pools) | | | Decide what to retry, with what backoff, and when to stop (tenacity patterns, idempotency, circuit breakers) | | | Instrument a service with structured logs, metrics, and traces (structlog, OpenTelemetr…