I stumbled accross this thread when looking for something else, and the issue of making workloads prefer the higher-clocked cores over lower-clocked is easily solved, see:
@numbqq if you can run some tests and confirm back to me, I will send this upstream to the kernel.