[us-commits] [ehb54/us3lims_gridctl] 2843ec: Fix race condition causing intermittent "Failed da...

emre brookes noreply at github.com
Sat Jun 20 14:11:36 MDT 2026


  Branch: refs/heads/ehb54-issue-914
  Home:   https://github.com/ehb54/us3lims_gridctl
  Commit: 2843ec5521e3abaca87929751db9d635958a789e
      https://github.com/ehb54/us3lims_gridctl/commit/2843ec5521e3abaca87929751db9d635958a789e
  Author: ehb54 <brookes at uthscsa.edu>
  Date:   2026-06-20 (Sat, 20 Jun 2026)

  Changed paths:
    M cleanup.php
    M jobmonitor/cleanup.php

  Log Message:
  -----------
  Fix race condition causing intermittent "Failed data fetch"

The analysis-results.tar scp had no retry, and the only wait/backoff
loop polled for stderr's arrival, not the tar itself. If the remote
job finished writing stderr before closing the tar, the tar fetch
silently failed and was never retried, leaving gfac.analysis.tarfile
empty even though the job succeeded.

Add a retry to the tar scp matching stdout/stderr, and a dedicated
backoff loop that re-checks for and re-fetches the tar before giving
up. Applied identically to cleanup.php and jobmonitor/cleanup.php.

Fixes ehb54/ultrascan-tickets#914



To unsubscribe from these emails, change your notification settings at https://github.com/ehb54/us3lims_gridctl/settings/notifications


More information about the us-commits mailing list