Hi,
I encounter a very similar issue but not related to libvirt in any way.
I’m deploying a new cloud based on MaaS 3.6 (snap edition) on top of Ubuntu 24.04 nodes (I had one based on MaaS 3.1/Ubuntu 20.04 before).
When deploying machines with a LACP bond with MaaS, once the machine is installed and it reboots to enable the new configuration (hence, the LACP bonded network), it seems the OS don’t has access to the network as if it was unplugged.
This makes downloadind MaaS meta data fail and make cloud-init crashes at the end of the process with the same error that we can see in @mk4umha screenshot.
Worst than that, this makes network services such as SSH fail and there is no way to connect to the machine once booted.
What is even more weird is the fact that L2/L3 network is up since I can ping this newly deployed machine from the MaaS node !
But, as for @mk4umha , there is no feedback to MaaS and deployment for this machine remains stuck in “Rebooting” status.
I tried to re-deploy the same machine with the exact same network configuration except that I changed the bond mode to “active-backup” instead of “802.3ad” and it worked, deployment was a success.
I even reconfigured the network by changing the bond mode afterwards to re-enable LACP and then, after a reboot, it worked !
This is just pure crazyness and definitely a fault, MaaS 3.6 is just unusable like this.
I attach the Netplan configuration that is applied during node installation taken form the IPMI console output.
Any help would be very welcomed
####### Working Active-Backup bond conf ##############
bonds:
bond0:
addresses:
- 10.38.130.16/24
gateway4: 10.38.130.1
interfaces:
- eno12399np0
- eno12409np1
macaddress: 14:23:f2:f4:15:10
mtu: 9000
nameservers:
addresses:
- 10.38.130.11
- 10.38.130.19
- 10.38.130.12
search:
- maas
parameters:
down-delay: 0
gratuitious-arp: 1
mii-monitor-interval: 100
mode: active-backup
transmit-hash-policy: layer2
up-delay: 0
ethernets:
eno12399np0:
match:
macaddress: 14:23:f2:f4:15:10
mtu: 9000
set-name: eno12399np0
eno12409np1:
match:
macaddress: 14:23:f2:f4:15:11
mtu: 9000
set-name: eno12409np1
version: 2
vlans:
bond0.21:
addresses:
- 10.38.135.32/24
id: 21
link: bond0
mtu: 9000
nameservers:
addresses:
- 10.38.130.11
- 10.38.130.12
- 10.38.130.13
- 10.38.130.18
search:
- maas
####### Non Working LACP bond conf ##############
bonds:
bond0:
addresses:
- 10.38.130.16/24
gateway4: 10.38.130.1
interfaces:
- eno12399np0
- eno12409np1
macaddress: 14:23:f2:f4:15:10
mtu: 9000
nameservers:
addresses:
- 10.38.130.11
- 10.38.130.19
- 10.38.130.12
search:
- maas
parameters:
down-delay: 0
lacp-rate: fast
mii-monitor-interval: 100
mode: 802.3ad
transmit-hash-policy: layer3+4
up-delay: 0
ethernets:
eno12399np0:
match:
macaddress: 14:23:f2:f4:15:10
mtu: 9000
set-name: eno12399np0
eno12409np1:
match:
macaddress: 14:23:f2:f4:15:11
mtu: 9000
set-name: eno12409np1
version: 2
vlans:
bond0.21:
addresses:
- 10.38.135.32/24
id: 21
link: bond0
mtu: 9000
nameservers:
addresses:
- 10.38.130.11
- 10.38.130.12
- 10.38.130.13
- 10.38.130.18
search:
- maas
As you can see, no big difference except the bond mode.
I don’t explain also why MaaS don’t use the same DNS servers for the bond and for the vlans added on top of the bond.
And remember what I said, re-using the same LACP conf after a successful Active-Backup deployment but edited by hand works ! So basically, this is not the Netplan config that is faulty, there is “something else”.