GitRefScanJob fails to back off and retry when hosting backend returns 502/503

Bug #1797532 reported by Tom Haddon
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Launchpad itself
Triaged
Critical
Unassigned

Bug Description

We've seen a number of instances over the last few weeks of merge proposals against git branches not including a commit that is in the repo in question. The branch scan is failing and never being retried.

Tags: git lp-code oops
Revision history for this message
Colin Watson (cjwatson) wrote :

This bug report is too general to be actionable on its own. This comment is an attempt to make it actionable.

1) ceph rebalancing activity means that git.launchpad.net has very little I/O to play with. Many of the problems are due to this.
2) I've filed https://bugs.launchpad.net/turnip/+bug/1797534 for scaling issues on the turnip side.
3) Other issues with Launchpad's job runners (https://bugs.launchpad.net/launchpad/+bug/1783315 and https://bugs.launchpad.net/launchpad/+bug/1792920) can cause scan jobs to fail or be delayed.
4) Launchpad should back off and retry when the Git hosting backend returns 502/503 in response to its requests. That would make it more robust against scaling/resource issues on the hosting backend.

I suggest that we confine this bug to 4).

summary: - git branches don't always get correctly updated with changes
+ GitRefScanJob fails to back off and retry when hosting backend returns
+ 502/503
tags: added: git lp-code oops
Changed in launchpad:
status: New → Triaged
importance: Undecided → Critical
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.