Hi I can't tell what the ultimate result of all these changes is. I see code using the per CPU structure without indexing it but can't tell if that is uniprocessor specific. Can you please write up the intended design/behavior? Ultimately that needs to be in the porting guide.