Done
Pinned fields
Click on the next to a field label to start pinning.
Details
Assignee
UnassignedUnassignedReporter
Huan LinHuan LinLabels
RubinTeam
Ops MiddlewareComponents
Details
Details
Assignee
Unassigned
UnassignedReporter
Huan Lin
Huan LinLabels
RubinTeam
Ops Middleware
Components
Checklist
Checklist
Checklist
Created March 22, 2022 at 4:31 PM
Updated January 9, 2023 at 10:06 PM
Resolved January 9, 2023 at 10:06 PM
Rescue submissions for DP0.2 step3 production, done using the RSP on data-int.lsst.cloud using stack v23_0_1_rc4, have sometimes become stuck during quantum graph generation, where normally the submission process would succeed after 1.5 to 2 hours. In particular, identical submissions can be stuck one day but succeed on a different day. An example submission was initiated with more logging, using the command
and turning on sqlalchemy debug logging by adding the following to the bps submit yaml file:
The submission became stuck on a complicated SQL query during quantum graph generation, as shown in the attached log file (which has the last 1000 lines of the much larger 3.6 GB full log file). Also attached are the submission yaml file and 2 other yaml files used for the submission.