[us-commits] [ehb54/us3lims_gridctl] 2843ec: Fix race condition causing intermittent "Failed da...
emre brookes
noreply at github.com
Sat Jun 20 14:11:36 MDT 2026
Branch: refs/heads/ehb54-issue-914
Home: https://github.com/ehb54/us3lims_gridctl
Commit: 2843ec5521e3abaca87929751db9d635958a789e
https://github.com/ehb54/us3lims_gridctl/commit/2843ec5521e3abaca87929751db9d635958a789e
Author: ehb54 <brookes at uthscsa.edu>
Date: 2026-06-20 (Sat, 20 Jun 2026)
Changed paths:
M cleanup.php
M jobmonitor/cleanup.php
Log Message:
-----------
Fix race condition causing intermittent "Failed data fetch"
The analysis-results.tar scp had no retry, and the only wait/backoff
loop polled for stderr's arrival, not the tar itself. If the remote
job finished writing stderr before closing the tar, the tar fetch
silently failed and was never retried, leaving gfac.analysis.tarfile
empty even though the job succeeded.
Add a retry to the tar scp matching stdout/stderr, and a dedicated
backoff loop that re-checks for and re-fetches the tar before giving
up. Applied identically to cleanup.php and jobmonitor/cleanup.php.
Fixes ehb54/ultrascan-tickets#914
To unsubscribe from these emails, change your notification settings at https://github.com/ehb54/us3lims_gridctl/settings/notifications
More information about the us-commits
mailing list