My profiling shows among others 3 methods whose tpdiff spikes when we have more than 64 registers – processKills(), freeRegisters(), processBlockStartLocations(). These 3 iterate over a regMaskTP and modifies it. The idea here is to work on 2 regMaskSmalls separately instead of regMaskTP