> internally the processor still has to have separate "8 bit" data paths > and do shifting to reorder the bytes. This is a barrel shifter in the data path that is integrated in the queue and does not take an additional execution cycle. -Michael