<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META content="text/html; charset=utf-8" http-equiv=Content-Type>
<META name=GENERATOR content="MSHTML 8.00.6001.23588">
<STYLE></STYLE>
</HEAD>
<BODY bgColor=#ffffff>
<DIV><FONT size=2 face=Arial>Relooking at your timings and mine, it appears that
you allocate 10x my count of register-size count of items and require 10x the
FillChar which you need to initialize your filter array.</FONT></DIV>
<DIV><FONT size=2 face=Arial></FONT> </DIV>
<DIV><FONT size=2 face=Arial>My timing is about 80 ms and yours looks like 900
ms for 10x more register sized data, which look like the reasonable ratio since
we may have difference in the way we get the timings (my timing routines
beeing maybe a bit optimistic).</FONT></DIV>
<DIV><FONT size=2 face=Arial></FONT> </DIV>
<DIV><FONT size=2 face=Arial>Here I think the speed limit is the time it takes
to effectively transfer the data to RAM and that whether you have a FillChar or
FillQWord that ends up beeing STOSB or STOSQ, that is fully cached inside the
CPU thus the limiting factor is the time it takes to move the initialized CPU
cached data to the RAM.</FONT></DIV></BODY></HTML>