Quanta D52B-1U unable to PXE-boot in EFI mode
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
MAAS |
Incomplete
|
Undecided
|
Unassigned | ||
grub2-signed (Ubuntu) |
New
|
Undecided
|
Unassigned |
Bug Description
Certification recently received a Quanta D52B-1U server (jehan). This server enlists, commissions, and deploys fine in BIOS/CSM/legacy mode; however, in EFI/UEFI mode, it fails, hanging at the "grub>" prompt; here's a capture from an IPMI SOL session:
>>Checking Media Presence......
>>Media Present......
>>Start PXE over IPv4. Press ESC key to abort PXE boot.
Station IP address is 10.1.10.164
Server IP address is 10.1.10.2
NBP filename is bootx64.efi
NBP filesize is 1196736 Bytes
>>Checking Media Presence......
>>Media Present......
Downloading NBP file...
Succeed to download NBP file.
Fetching Netboot Image
Minimal BASH-like line editing is supported. For the first word, TAB
lists possible command completions. Anywhere else TAB lists possible
device or file completions.
grub> ls
(memdisk) (hd0) (hd1) (hd1,msdos1) (hd2) (hd2,gpt1)
grub> ls (memdisk)/
grub.cfg
grub> less (memdisk)/grub.cfg
error: can't find command `less'.
grub> cat (memdisk)/grub.cfg
if [ -e $prefix/
source $prefix/
else
source $prefix/grub.cfg
fi
grub>
The interaction at the end ("ls" and other commands at the "grub>" prompt) is me trying to identify the GRUB environment; these commands were not entered automatically.
I've tried dozens of combinations of firmware settings (enabling and disabling various options), with no success; the system always seems to fail at the same point.
I've been unable to enlist the node in EFI mode; and once enlisted in BIOS mode, commissioning it in EFI mode fails. Because commissioning sets up partitions, including the critical EFI System Partition (ESP), which are different between BIOS- and EFI-mode boots, I have been unable to test deployment in EFI mode.
Other systems have commissioned and deployed from this MAAS server both shortly before and after discovering the problem with jehan.
This bug is similar in symptoms to several other "hang-at-grub>" bugs, such as bug #1690878 and bug #1437024; however, I have yet to find a workaround, and this problem occurs 100% of the time.
I'm attaching the contents of /var/log/maas/* to this bug report. Here's the MAAS version information:
$ dpkg -l '*maas*'|cat
Desired=
| Status=
|/ Err?=(none)
||/ Name Version Architecture Description
+++-===
ii maas 2.3.0-6434-
ii maas-cert-server 0.2.30-
ii maas-cli 2.3.0-6434-
un maas-cluster-
ii maas-common 2.3.0-6434-
ii maas-dhcp 2.3.0-6434-
ii maas-dns 2.3.0-6434-
ii maas-proxy 2.3.0-6434-
ii maas-rack-
ii maas-region-api 2.3.0-6434-
ii maas-region-
un maas-region-
un python-django-maas <none> <none> (no description available)
un python-maas-client <none> <none> (no description available)
un python-
ii python3-django-maas 2.3.0-6434-
ii python3-maas-client 2.3.0-6434-
ii python3-
Jeff lane has noted that in the logs, the system PXE-boots off a8:1e:84:f2:96:c5 (which MAAS identifies as enp59s0f0), whereas in EFI mode, it uses a8:1e:84:f2:96:c6 (enp59s0f1).