2.9 release has for me coincided nicely with adopting some new hardware to expand the cluster.
There’s some pain points I’ve seen so far. These may not be 2.9 specific, and could very well be quirks of my infra setup or the new hardware but I can’t stop to dupe check against launchpad right now, and I would like to write them down whilst they’re fresh to me:
- The existing machines on the cluster are happily redeployed with Focal+KVM pods with no fuss, and some IPMI sporadic issues I’d seen seem to have vanished
- The new hardware is not happy - PXEBoot works for enlist/comish but during deploy (at the final reboot) it bails when trying to boot local disk with “WARN: No MBR Magic, treating disk as raw”. I’ve seen this before and imagine it’s an EFI bios compatibility tweak, rather than anything related to the PXE changes in 2.9.
- In debugging this problem I tried to instead deploy Bionic, and was blocked with
"Error:hwe_kernel(hwe-18.04-edge) is older than min_hwe_kernel(hwe-20.04-edge).". I checked if this was from me setting a specific version for comish/deploy but changing those values didn’t seem to help.
- This one is definitely a me thing, but I hit the gotcha of MAAS clobbering disk superblocks for block devices I’d preseeded with data for use post-deploy. Is there any way I can disable that behavior, armed with the knowledge that if I see any problems related to unique IDs I know to resolve those manually? It’s not usually a problem in my standard deploy process, but when doing repeated enlist/comish/deploy cycles, at some point I forgot to remove the preseed disks.
- I manually added a machine to MAAS, and now can’t delete it. in the UI I see this error.