I cannot think about anything specific on top of my head with this limited amount of information.
Your environment seems to be a complex production with 3 region in HA and 19 racks. The “problem” behind this behavior could be literally everything and I suspect it’s unfortunately impossible to tackle it at community level.
I’d say you can open a new bug with as much information as you can. For example, you should quantify all the statements like
Because the service restarts a lot
How frequently? Does this happen on all the regions? And so on…
You can also upload an sos report that could help to triage this issue.