Skip to content

Job Retry And Failure Rules

InfraLynx job execution must fail predictably and avoid uncontrolled retry loops.

Retry rules

  • retry limits are explicit and job-scoped
  • workers requeue failed jobs only while retry allowance remains
  • retry count is recorded on the job record
  • repeated failure does not create a duplicate job record

Failure rules

  • terminal failure moves the job to failed
  • failure details are stored in job result metadata
  • job logs must include the failure reason
  • audit summaries must record the final failure state

Operational rules

  • handlers must be designed for idempotent re-execution
  • failure alerting is a follow-up concern and remains outside the baseline runtime
  • queue backends must preserve job status and logs across worker restarts