[fpc-devel] Current work
J. Gareth Moreton
gareth at moreton-family.com
Sat Aug 24 11:30:16 CEST 2024
Thanks Denis,
I have to be slightly careful though because the example I gave with
"addps" is not actually correct, because (%r8) could be unaligned (it
works fine with "vaddps" though due to a paradigm change in the design
of AVX compared to SSE). In this situation a more correct example would be:
movups (%rdx),%xmm0
movups (%r8),%xmm1
addps %xmm1,%xmm0
movups %xmm0,(%rcx)
ret
This, like pure functions, is another one of those big changes that need
a lot of testing and experimentation, and probably a means to turn off
if it is actually merged into the main branch. It would be good to
gather some input from the moderators and administrators on this.
Gareth aka. Kit
On 23/08/2024 20:59, denisgolovan wrote:
> Nice work, Gareth.
> Vectorization is definitely a nice improvement for the compiler.
>
>
> -- Regards,
> Denis Golovan
>
--
This email has been checked for viruses by Avast antivirus software.
www.avast.com
More information about the fpc-devel
mailing list