Node powering on but not deploying
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
MAAS |
Fix Released
|
Critical
|
Unassigned |
Bug Description
MAAS 1.7.0~beta4+
When attempting to deploy using Juju I hit a situation where a node powered on, but then booted into the "poweroff" system.
The event log is https:/
Note the power off "44 mins ago" was from a previous environment that was "juju destroy-
Relevant chunk of /var/log/
The bootstrap started at 02:08, with the add-machine's happening around 02:13.
The machine which misbehaved is planck, which seems to be missing a "Stopping monitor" line (which the other machines have).
The node was left Allocated, but note the inconsistency in the event log which shows it should be Deploying.
If I hit the Check Power State button then MAAS tells me that the node is powered on, but it's not responding to pings.
Related branches
- Raphaël Badin (community): Approve
- Jason Hobbs (community): Approve
-
Diff: 81 lines (+32/-0)2 files modifiedsrc/maasserver/models/node.py (+12/-0)
src/maasserver/models/tests/test_node.py (+20/-0)
description: | updated |
description: | updated |
description: | updated |
tags: | added: cloud-installer landscape |
Changed in maas: | |
importance: | Undecided → High |
status: | New → Triaged |
Changed in maas: | |
milestone: | none → 1.7.0 |
importance: | High → Medium |
importance: | Medium → Critical |
Changed in maas: | |
status: | Triaged → Fix Committed |
Changed in maas: | |
status: | Fix Committed → Fix Released |
When deploying machines, Juju does two things: first it allocates a machine and then it starts it. My guess is that the second step failed but starting the node (which is part of this second step) succeeded even though the transition itself (from Allocated to Deploying) failed. Can you have a look at the django-maas.log and pserv.log files to see if you can find any stacktrace there? Also, apache's log might tell you if the second API call issued by Juju (the one that should have transitioned the node from Allocated to Deploying) failed.