Machines fail to PXE Boot

Bug #1373710 reported by Andres Rodriguez
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
MAAS
Fix Released
Critical
Newell Jensen

Bug Description

Machines fail to PXE boot with the following error, I'm using latest trunk (rev 3082)

2014-09-24 21:44:03-0500 [-] Logged OOPS id OOPS-673689a6dae717f76772cc687997feb5: No exception type: No exception value
2014-09-24 21:44:03-0500 [-] Logged OOPS id OOPS-1698ffdf6cab9fdd75c5e0912bdfb11a: NoConnectionsAvailable:
2014-09-24 21:44:03-0500 [-] Logged OOPS id OOPS-552ac55de393079083137f1b967cba25: NoConnectionsAvailable:
2014-09-24 21:44:03-0500 [-] Logged OOPS id OOPS-61c83845800ff648377960c51c92ff66: No exception type: No exception value
2014-09-24 21:44:06-0500 [TFTP (UDP)] Datagram received from ('192.168.122.101', 50011): <RRQDatagram(filename=pxelinux.0, mode=octet, options={'blksize': '1432', 'tsize': '0'})>
2014-09-24 21:44:06-0500 [TFTP (UDP)] Datagram received from ('192.168.122.101', 50011): <RRQDatagram(filename=pxelinux.0, mode=octet, options={'blksize': '1432', 'tsize': '0'})>
2014-09-24 21:44:06-0500 [-] Unhandled error in Deferred:
2014-09-24 21:44:06-0500 [-] Unhandled Error
 Traceback (most recent call last):
   File "/usr/lib/python2.7/dist-packages/twisted/internet/defer.py", line 139, in maybeDeferred
     result = f(*args, **kw)
   File "/usr/lib/python2.7/dist-packages/provisioningserver/pserv_services/tftp.py", line 215, in get_reader
     description=file_name)
   File "/usr/lib/python2.7/dist-packages/provisioningserver/utils/twisted.py", line 102, in wrapper
     return func(*args, **kwargs)
   File "/usr/lib/python2.7/dist-packages/twisted/internet/defer.py", line 1237, in unwindGenerator
     return _inlineCallbacks(None, gen, Deferred())
 --- <exception caught here> ---
   File "/usr/lib/python2.7/dist-packages/twisted/internet/defer.py", line 1099, in _inlineCallbacks
     result = g.send(result)
   File "/usr/lib/python2.7/dist-packages/provisioningserver/events.py", line 155, in send_event_node_mac_address
     client = getRegionClient()
   File "/usr/lib/python2.7/dist-packages/provisioningserver/rpc/__init__.py", line 39, in getRegionClient
     return rpc_service.getClient()
   File "/usr/lib/python2.7/dist-packages/provisioningserver/rpc/clusterservice.py", line 469, in getClient
     raise exceptions.NoConnectionsAvailable()
 provisioningserver.rpc.exceptions.NoConnectionsAvailable:

2014-09-24 21:44:06-0500 [-] Unhandled error in Deferred:
2014-09-24 21:44:06-0500 [-] Unhandled Error
 Traceback (most recent call last):
   File "/usr/lib/python2.7/dist-packages/twisted/internet/defer.py", line 139, in maybeDeferred
     result = f(*args, **kw)
   File "/usr/lib/python2.7/dist-packages/provisioningserver/boot/windows.py", line 203, in get_node_info
     return request_node_info_by_mac_address(remote_mac)
   File "/usr/lib/python2.7/dist-packages/provisioningserver/utils/twisted.py", line 102, in wrapper
     return func(*args, **kwargs)
   File "/usr/lib/python2.7/dist-packages/twisted/internet/defer.py", line 1237, in unwindGenerator
     return _inlineCallbacks(None, gen, Deferred())
 --- <exception caught here> ---
   File "/usr/lib/python2.7/dist-packages/twisted/internet/defer.py", line 1099, in _inlineCallbacks
     result = g.send(result)
   File "/usr/lib/python2.7/dist-packages/provisioningserver/boot/windows.py", line 99, in request_node_info_by_mac_address
     client = getRegionClient()
   File "/usr/lib/python2.7/dist-packages/provisioningserver/rpc/__init__.py", line 39, in getRegionClient
     return rpc_service.getClient()
   File "/usr/lib/python2.7/dist-packages/provisioningserver/rpc/clusterservice.py", line 469, in getClient
     raise exceptions.NoConnectionsAvailable()
 provisioningserver.rpc.exceptions.NoConnectionsAvailable:

2014-09-24 21:44:06-0500 [-] Logged OOPS id OOPS-fed9a8914f1e0bbb9405406c03865797: No exception type: No exception value
2014-09-24 21:44:06-0500 [-] Logged OOPS id OOPS-0b36b04c5d68d48e2a12596bb3b5da9a: NoConnectionsAvailable:
2014-09-24 21:44:06-0500 [-] Logged OOPS id OOPS-4b36a0351a05fbd63fc10ed097f8e466: No exception type: No exception value
2014-09-24 21:44:06-0500 [-] Logged OOPS id OOPS-967033fdb988e3935ee82f8023707f7f: NoConnectionsAvailable:

Changed in maas:
importance: Undecided → Critical
milestone: none → 1.7.0
Changed in maas:
assignee: nobody → Newell Jensen (newell-jensen)
Revision history for this message
Blake Rouse (blake-rouse) wrote :

This is actually a side effect of bug 1373658. When pserv request the node information from the region, the region fails to handle the exception, so it drops the connections which results in this error on the cluster.

This branch was already up for review: https://code.launchpad.net/~newell-jensen/maas/fix-bug-1373658/+merge/235881

It would have fix the enlistment issue, instead of just reverting the change.

Changed in maas:
status: New → Triaged
Revision history for this message
Newell Jensen (newell-jensen) wrote :

This bug was fixed with branch lp:~newell-jensen/maas/windows-boot-method-request-pxe-once-lets-do-it-again which was merged at revision 3110

Changed in maas:
status: Triaged → Fix Committed
Changed in maas:
status: Fix Committed → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.