Really not sure how to optimize this metric? More arms seem like they need to do so much. Oh and probably reordering the outputs could save a few cycles at the end?