Timeout leads to inconsistency between maas and real world state, can't commission or start nodes
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
MAAS |
Fix Released
|
Critical
|
Graham Binns |
Bug Description
maas 1.7.0~beta4+
While viewing a specific node registered with maas, I clicked on "Commission". That took a while, and eventually it showed a bland Internal server error" in a white unstyled page. The logs had several timeout errors and this was the last one:
ERROR 2014-09-30 19:17:27,675 django.request Internal Server Error: /MAAS/nodes/
Traceback (most recent call last):
File "/usr/lib/
response = wrapped_
File "/usr/lib/
return self.dispatch(
File "/usr/lib/
return super(NodeView, self).dispatch(
File "/usr/lib/
return handler(request, *args, **kwargs)
File "/usr/lib/
return super(BaseUpdat
File "/usr/lib/
return self.form_
File "/usr/lib/
self.object = form.save()
File "/usr/lib/
message = action.
File "/usr/lib/
self.
File "/usr/lib/
[self.
File "/usr/lib/
wait_
File "/usr/lib/
results = block_until_
File "/usr/lib/
return func_in_
File "/usr/lib/
result = self._result(
File "/usr/lib/
raise TimeoutError()
TimeoutError
ERROR 2014-09-30 19:17:27,686 twisted [(True, {})]
ERROR 2014-09-30 19:18:25,360 maasserver Unable to identify boot image for (ubuntu/
I then reloaded the node view page, and to my surprise it says the node is "Ready", while in fact the poweron command did work and the node is powered up and sitting at a boot prompt now:
Intel(R) Boot Agent GE v1.4.02
Copyright (C) 1997-2012, Intel Corporation
CLIENT MAC ADDR: 2C 59 E5 4A ED 90 GUID: 37313636 3735 4D58 5133 343230514459
CLIENT IP: 10.96.2.146 MASK: 255.255.0.0 DHCP IP: 10.96.0.10
GATEWAY IP: 10.96.0.1
!PXE entry point found (we hope) at 93DA:0106 via plan A
UNDI code segment at 93DA len 56A0
UNDI data segment at 8D9D len 63D0
Getting cached packet 01 02 03
My IP address seems to be 0A600292 10.96.2.146
ip=10.96.
BOOTIF=
SYSUUID=
TFTP prefix:
Trying to load: pxelinux.
APM not present.
boot:
I'm attaching maas-django.log.
The "commission" action was this one:
10.112.0.10 - - [30/Sep/
Related branches
- Gavin Panella (community): Approve
-
Diff: 79 lines (+30/-0)2 files modifiedsrc/maasserver/middleware.py (+3/-0)
src/maasserver/tests/test_middleware.py (+27/-0)
- Raphaël Badin (community): Approve
- Gavin Panella (community): Approve
-
Diff: 508 lines (+368/-17)2 files modifiedsrc/maasserver/models/node.py (+76/-13)
src/maasserver/models/tests/test_node.py (+292/-4)
Changed in maas: | |
milestone: | none → 1.7.0 |
importance: | Undecided → Critical |
summary: |
- Timeout leads to inconsistency between maas and real world state + Timeout leads to inconsistency between maas and real world state, can't + commission or start nodes |
Changed in maas: | |
assignee: | nobody → Graham Binns (gmb) |
Changed in maas: | |
status: | Incomplete → In Progress |
Changed in maas: | |
status: | In Progress → Fix Committed |
Changed in maas: | |
status: | Fix Committed → Fix Released |
Sequence of events when I click on "commission": http:// pastebin. ubuntu. com/8472925/
And the node is powered up afterwards.