This is the "obvious" wasteless strategy. The area for the last few steps can probably be much lower for this method, but I have a feeling doing some kind of conditional behavior with division and rejection instead of purifying will be superior. Creating enough lead to store a bunch of qs waste also might be viable, even with the output multiplier. Having fewer arms with long track loops is worth considering