fullstack job failing to create a namespace, hitting kernel deadlock
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
BaGPipe |
Confirmed
|
High
|
Unassigned | ||
Linux |
New
|
Undecided
|
Unassigned | ||
neutron |
New
|
Undecided
|
Unassigned |
Bug Description
networking-bagpipe fullstack job hits the following kernel issue when a new tests are added that use more netns's:
Sep 01 14:48:40 ubuntu-
Sep 01 14:48:40 ubuntu-
Sep 01 14:48:40 ubuntu-
Sep 01 14:48:40 ubuntu-
Sep 01 14:48:40 ubuntu-
Sep 01 14:48:40 ubuntu-
Sep 01 14:48:40 ubuntu-
Sep 01 14:48:40 ubuntu-
Sep 01 14:48:40 ubuntu-
Sep 01 14:48:40 ubuntu-
Sep 01 14:48:40 ubuntu-
Sep 01 14:48:40 ubuntu-
Sep 01 14:48:40 ubuntu-
Sep 01 14:48:40 ubuntu-
Sep 01 14:48:40 ubuntu-
Sep 01 14:48:40 ubuntu-
Sep 01 14:48:40 ubuntu-
(The command that is blocked is an "ip netns add ..." command.)
This happens in the openstack CI on ubuntu kernel 4.4.0-93-generic.
On another box (not openstack CI, ubuntu kernel 4.8.0-49), this issue seems correlated with a lot of "unregister_
description: | updated |
Changed in networking-bagpipe: | |
status: | New → Confirmed |
importance: | Undecided → High |
The ""unregister_ netdevice: waiting for lo to become free" logs are also present in the Openstack CI in many kolla jobs, although these jobs don't have the kernel "task ... blocked for more than 120 seconds" message. Kolla may be hitting a different issue, or the same issue but the deadlock resolving before the 120s limit.