iDRAC IPMI connections stopped working

Does anyone know what happend during the last circa 2 weeks with MAAS → iDRAC connections? We have 3 different MAAS instances running and all of them started failing on running anything via iDRAC. First thing I noticed was that I couldn’t “Release” the hosts anymore. I needed to do that for redeployment. I tried setting the maas -user password in both iDRAC and MAAS, but to no avail. Only way of getting those hosts working again was to delete them from MAAS and PXE boot them as new hosts. But I really wouldn’t like to do that on 100+ hosts since it would require me to reconfigure them all. This has never happened before, and now all MAAS instances had it coming at the same time.

I had something like this recently, where after updating the BMC and BIOS firmware of a bunch of blade servers, IPMI was no longer responding. As a result, MAAS couldn’t control them anymore. I had to switch over to using Redfish instead, and that worked fine.

Don’t know if that helps you, but it’s worth a try.

I can’t use Redfish for everything though. With some big HGX servers, Redfish doesn’t handle the longer power-on time very well, resulting in lots of confusion between Redfish and MaaS on the server’s power state (it cycles between off and on over and over).

1 Like