mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   Hardware (https://www.mersenneforum.org/forumdisplay.php?f=9)
-   -   AlderLake anyone? (https://www.mersenneforum.org/showthread.php?t=27112)

kriesel 2022-01-03 22:45

So, how's the AVX512 support in affordable hardware over at the house of AMD?

LordJulius 2022-01-03 23:36

[QUOTE=kriesel;597062]So, how's the AVX512 support in affordable hardware over at the house of AMD?[/QUOTE]
Look to Zen4 having AVX512 - nothing before that.

NookieN 2022-01-05 07:00

That is disappointing. From a strictly p95 perspective, my experience with AVX512 on Cascade Lake is that it's going to saturate the (4-channel) memory long before you'll realize more throughput vs AVX2.

kruoli 2022-01-05 09:48

It depends on your FFT size. For really small work, AVX-512 will still be useful. I was impressed when I saw the improvements for small FFTs between 10th Gen and 11th Gen. But I have not seen an FFT timings or throughput benchmark of Prime95 of Alder Lake in this or the benchmark thread. Is anyone able and eager to run one and share, please? If possible, with AVX-512 enabled (as long as we can test it)?

kriesel 2022-01-05 10:31

And, above exponent ~920M, the throughput on AVX2 is zero since there's no fft >50M words coded in prime95 / mprime for that, while AVX512 reaches up to ~1169M exponent via up to 64M words fft. (Mlucas is not so limited.)

Xyzzy 2022-02-02 22:57

[url]https://www.techpowerup.com/291559/msi-partially-reenables-avx-512-support-for-alder-lake-s-processors[/url]

:mike:

Xyzzy 2022-02-08 00:55

[url]https://www.mersenneforum.org/showpost.php?p=599622&postcount=28[/url]

:mike:

ixfd64 2022-03-08 18:56

Intel is going to support ECC on consumer CPUs soon: [url]https://tomshardware.com/news/intel-enables-ecc-on-12th-gen-core-cpus[/url]

However, the catch is you'll need a W680 motherboard.

Magellan3s 2022-03-27 17:39

12900k Corsair Vengeance DDR5 5200 Mhz




[code]Machine#0 (total=65555544KB, DMIProductName="System Product Name", DMIProductVersion="System Version", DMIBoardVendor="ASUSTeK COMPUTER INC.", DMIBoardName="ProArt Z690-CREATOR WIFI", DMIBoardVersion="Rev 1.xx", DMIBoardAssetTag="Default string", DMIChassisVendor="Default string", DMIChassisType=3, DMIChassisVersion="Default string", DMIChassisAssetTag="Default string", DMIBIOSVendor="American Megatrends Inc.", DMIBIOSVersion=0811, DMIBIOSDate=12/15/2021, DMISysVendor=ASUS, Backend=Linux, LinuxCgroup=/, OSName=Linux, OSRelease=5.13.0-37-generic, OSVersion="#42-Ubuntu SMP Tue Mar 15 14:34:06 UTC 2022", HostName=Magellan, Architecture=x86_64, hwlocVersion=2.4.1, ProcessName=mprime)
Package#0 (total=65555544KB, CPUVendor=GenuineIntel, CPUFamilyNumber=6, CPUModelNumber=151, CPUModel="12th Gen Intel(R) Core(TM) i9-12900K", CPUStepping=2)
L3 (size=30720KB, linesize=64, ways=12, Inclusive=0)
L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
Core#0 (cpuset: 0x00000003)
PU#0 (cpuset: 0x00000001)
PU#1 (cpuset: 0x00000002)
L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
Core#4 (cpuset: 0x0000000c)
PU#2 (cpuset: 0x00000004)
PU#3 (cpuset: 0x00000008)
L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
Core#8 (cpuset: 0x00000030)
PU#4 (cpuset: 0x00000010)
PU#5 (cpuset: 0x00000020)
L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
Core#12 (cpuset: 0x000000c0)
PU#6 (cpuset: 0x00000040)
PU#7 (cpuset: 0x00000080)
L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
Core#16 (cpuset: 0x00000300)
PU#8 (cpuset: 0x00000100)
PU#9 (cpuset: 0x00000200)
L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
Core#20 (cpuset: 0x00000c00)
PU#10 (cpuset: 0x00000400)
PU#11 (cpuset: 0x00000800)
L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
Core#24 (cpuset: 0x00003000)
PU#12 (cpuset: 0x00001000)
PU#13 (cpuset: 0x00002000)
L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
Core#28 (cpuset: 0x0000c000)
PU#14 (cpuset: 0x00004000)
PU#15 (cpuset: 0x00008000)
Prime95 64-bit version 30.7, RdtscTiming=1
Timings for 280K FFT length (8 cores, 1 worker): 0.11 ms. Throughput: 9386.98 iter/sec.
Timings for 280K FFT length (8 cores hyperthreaded, 1 worker): 0.11 ms. Throughput: 9246.01 iter/sec.
Timings for 288K FFT length (8 cores, 1 worker): 0.10 ms. Throughput: 9557.78 iter/sec.
Timings for 288K FFT length (8 cores hyperthreaded, 1 worker): 0.11 ms. Throughput: 9379.33 iter/sec.
Timings for 300K FFT length (8 cores, 1 worker): 0.17 ms. Throughput: 5944.67 iter/sec.
Timings for 300K FFT length (8 cores hyperthreaded, 1 worker): 0.18 ms. Throughput: 5500.07 iter/sec.
Timings for 320K FFT length (8 cores, 1 worker): 0.12 ms. Throughput: 8530.83 iter/sec.
Timings for 320K FFT length (8 cores hyperthreaded, 1 worker): 0.12 ms. Throughput: 8481.67 iter/sec.
Timings for 336K FFT length (8 cores, 1 worker): 0.17 ms. Throughput: 5945.75 iter/sec.
Timings for 336K FFT length (8 cores hyperthreaded, 1 worker): 0.18 ms. Throughput: 5645.57 iter/sec.
Timings for 360K FFT length (8 cores, 1 worker): 0.17 ms. Throughput: 5769.20 iter/sec.
Timings for 360K FFT length (8 cores hyperthreaded, 1 worker): 0.19 ms. Throughput: 5359.68 iter/sec.
Timings for 384K FFT length (8 cores, 1 worker): 0.18 ms. Throughput: 5580.35 iter/sec.
Timings for 384K FFT length (8 cores hyperthreaded, 1 worker): 0.20 ms. Throughput: 5096.85 iter/sec.
Timings for 392K FFT length (8 cores, 1 worker): 0.18 ms. Throughput: 5672.10 iter/sec.
Timings for 392K FFT length (8 cores hyperthreaded, 1 worker): 0.19 ms. Throughput: 5273.72 iter/sec.
Timings for 400K FFT length (8 cores, 1 worker): 0.16 ms. Throughput: 6278.55 iter/sec.
Timings for 400K FFT length (8 cores hyperthreaded, 1 worker): 0.16 ms. Throughput: 6234.71 iter/sec.
Timings for 420K FFT length (8 cores, 1 worker): 0.19 ms. Throughput: 5261.85 iter/sec.
Timings for 420K FFT length (8 cores hyperthreaded, 1 worker): 0.20 ms. Throughput: 5017.30 iter/sec.
Timings for 432K FFT length (8 cores, 1 worker): 0.21 ms. Throughput: 4742.76 iter/sec.
Timings for 432K FFT length (8 cores hyperthreaded, 1 worker): 0.22 ms. Throughput: 4475.20 iter/sec.
Timings for 448K FFT length (8 cores, 1 worker): 0.19 ms. Throughput: 5129.40 iter/sec.
Timings for 448K FFT length (8 cores hyperthreaded, 1 worker): 0.21 ms. Throughput: 4720.66 iter/sec.
Timings for 480K FFT length (8 cores, 1 worker): 0.21 ms. Throughput: 4719.08 iter/sec.
Timings for 480K FFT length (8 cores hyperthreaded, 1 worker): 0.21 ms. Throughput: 4740.41 iter/sec.
Timings for 504K FFT length (8 cores, 1 worker): 0.25 ms. Throughput: 4012.29 iter/sec.
Timings for 504K FFT length (8 cores hyperthreaded, 1 worker): 0.26 ms. Throughput: 3816.60 iter/sec.
Timings for 512K FFT length (8 cores, 1 worker): 0.23 ms. Throughput: 4360.39 iter/sec.
Timings for 512K FFT length (8 cores hyperthreaded, 1 worker): 0.23 ms. Throughput: 4263.01 iter/sec.
Timings for 560K FFT length (8 cores, 1 worker): 0.23 ms. Throughput: 4425.69 iter/sec.
Timings for 560K FFT length (8 cores hyperthreaded, 1 worker): 0.22 ms. Throughput: 4465.82 iter/sec.
Timings for 576K FFT length (8 cores, 1 worker): 0.25 ms. Throughput: 3973.33 iter/sec.
Timings for 576K FFT length (8 cores hyperthreaded, 1 worker): 0.26 ms. Throughput: 3866.14 iter/sec.
Timings for 588K FFT length (8 cores, 1 worker): 0.26 ms. Throughput: 3785.77 iter/sec.
Timings for 588K FFT length (8 cores hyperthreaded, 1 worker): 0.28 ms. Throughput: 3591.11 iter/sec.
Timings for 600K FFT length (8 cores, 1 worker): 0.24 ms. Throughput: 4162.44 iter/sec.
Timings for 600K FFT length (8 cores hyperthreaded, 1 worker): 0.23 ms. Throughput: 4296.47 iter/sec.
Timings for 640K FFT length (8 cores, 1 worker): 0.24 ms. Throughput: 4137.38 iter/sec.
Timings for 640K FFT length (8 cores hyperthreaded, 1 worker): 0.24 ms. Throughput: 4242.48 iter/sec.
Timings for 672K FFT length (8 cores, 1 worker): 0.30 ms. Throughput: 3328.62 iter/sec.
Timings for 672K FFT length (8 cores hyperthreaded, 1 worker): 0.30 ms. Throughput: 3356.60 iter/sec.
[Sun Mar 27 12:17:11 2022]
Timings for 720K FFT length (8 cores, 1 worker): 0.28 ms. Throughput: 3519.86 iter/sec.
Timings for 720K FFT length (8 cores hyperthreaded, 1 worker): 0.28 ms. Throughput: 3631.91 iter/sec.
Timings for 768K FFT length (8 cores, 1 worker): 0.25 ms. Throughput: 3943.02 iter/sec.
Timings for 768K FFT length (8 cores hyperthreaded, 1 worker): 0.24 ms. Throughput: 4185.08 iter/sec.
Timings for 800K FFT length (8 cores, 1 worker): 0.27 ms. Throughput: 3756.21 iter/sec.
Timings for 800K FFT length (8 cores hyperthreaded, 1 worker): 0.26 ms. Throughput: 3866.58 iter/sec.
Timings for 840K FFT length (8 cores, 1 worker): 0.33 ms. Throughput: 3047.84 iter/sec.
Timings for 840K FFT length (8 cores hyperthreaded, 1 worker): 0.32 ms. Throughput: 3110.11 iter/sec.
Timings for 864K FFT length (8 cores, 1 worker): 0.29 ms. Throughput: 3418.21 iter/sec.
Timings for 864K FFT length (8 cores hyperthreaded, 1 worker): 0.27 ms. Throughput: 3644.02 iter/sec.
Timings for 896K FFT length (8 cores, 1 worker): 0.39 ms. Throughput: 2573.33 iter/sec.
Timings for 896K FFT length (8 cores hyperthreaded, 1 worker): 0.44 ms. Throughput: 2264.96 iter/sec.
Timings for 960K FFT length (8 cores, 1 worker): 0.37 ms. Throughput: 2671.63 iter/sec.
Timings for 960K FFT length (8 cores hyperthreaded, 1 worker): 0.36 ms. Throughput: 2775.67 iter/sec.
Timings for 1000K FFT length (8 cores, 1 worker): 0.39 ms. Throughput: 2555.33 iter/sec.
Timings for 1000K FFT length (8 cores hyperthreaded, 1 worker): 0.36 ms. Throughput: 2749.40 iter/sec.
Timings for 1008K FFT length (8 cores, 1 worker): 0.32 ms. Throughput: 3142.33 iter/sec.
Timings for 1008K FFT length (8 cores hyperthreaded, 1 worker): 0.32 ms. Throughput: 3131.49 iter/sec.
Timings for 1024K FFT length (8 cores, 1 worker): 0.32 ms. Throughput: 3139.38 iter/sec.
Timings for 1024K FFT length (8 cores hyperthreaded, 1 worker): 0.31 ms. Throughput: 3243.13 iter/sec.
Timings for 1152K FFT length (8 cores, 1 worker): 0.35 ms. Throughput: 2829.56 iter/sec.
Timings for 1152K FFT length (8 cores hyperthreaded, 1 worker): 0.35 ms. Throughput: 2826.23 iter/sec.
Timings for 1200K FFT length (8 cores, 1 worker): 0.45 ms. Throughput: 2214.72 iter/sec.
Timings for 1200K FFT length (8 cores hyperthreaded, 1 worker): 0.40 ms. Throughput: 2522.22 iter/sec.
Timings for 1280K FFT length (8 cores, 1 worker): 0.41 ms. Throughput: 2462.77 iter/sec.
Timings for 1280K FFT length (8 cores hyperthreaded, 1 worker): 0.40 ms. Throughput: 2524.76 iter/sec.
Timings for 1344K FFT length (8 cores, 1 worker): 0.43 ms. Throughput: 2340.35 iter/sec.
Timings for 1344K FFT length (8 cores hyperthreaded, 1 worker): 0.42 ms. Throughput: 2400.07 iter/sec.
Timings for 1400K FFT length (8 cores, 1 worker): 0.53 ms. Throughput: 1885.26 iter/sec.
Timings for 1400K FFT length (8 cores hyperthreaded, 1 worker): 0.50 ms. Throughput: 1989.80 iter/sec.
Timings for 1440K FFT length (8 cores, 1 worker): 0.45 ms. Throughput: 2218.84 iter/sec.
Timings for 1440K FFT length (8 cores hyperthreaded, 1 worker): 0.44 ms. Throughput: 2269.81 iter/sec.
Timings for 1500K FFT length (8 cores, 1 worker): 0.57 ms. Throughput: 1741.10 iter/sec.
Timings for 1500K FFT length (8 cores hyperthreaded, 1 worker): 0.54 ms. Throughput: 1849.75 iter/sec.
Timings for 1536K FFT length (8 cores, 1 worker): 0.46 ms. Throughput: 2154.13 iter/sec.
Timings for 1536K FFT length (8 cores hyperthreaded, 1 worker): 0.47 ms. Throughput: 2130.51 iter/sec.
Timings for 1600K FFT length (8 cores, 1 worker): 0.59 ms. Throughput: 1695.36 iter/sec.
Timings for 1600K FFT length (8 cores hyperthreaded, 1 worker): 0.53 ms. Throughput: 1889.56 iter/sec.
Timings for 1680K FFT length (8 cores, 1 worker): 0.63 ms. Throughput: 1596.03 iter/sec.
Timings for 1680K FFT length (8 cores hyperthreaded, 1 worker): 0.57 ms. Throughput: 1764.23 iter/sec.
Timings for 1728K FFT length (8 cores, 1 worker): 0.62 ms. Throughput: 1606.69 iter/sec.
Timings for 1728K FFT length (8 cores hyperthreaded, 1 worker): 0.56 ms. Throughput: 1785.40 iter/sec.
Timings for 1800K FFT length (8 cores, 1 worker): 0.68 ms. Throughput: 1480.07 iter/sec.
Timings for 1800K FFT length (8 cores hyperthreaded, 1 worker): 0.61 ms. Throughput: 1633.83 iter/sec.
Timings for 1920K FFT length (8 cores, 1 worker): 0.65 ms. Throughput: 1542.94 iter/sec.
Timings for 1920K FFT length (8 cores hyperthreaded, 1 worker): 0.64 ms. Throughput: 1569.51 iter/sec.
Timings for 1960K FFT length (8 cores, 1 worker): 0.70 ms. Throughput: 1437.34 iter/sec.
Timings for 1960K FFT length (8 cores hyperthreaded, 1 worker): 0.68 ms. Throughput: 1467.12 iter/sec.
Timings for 2048K FFT length (8 cores, 1 worker): 0.62 ms. Throughput: 1602.22 iter/sec.
Timings for 2048K FFT length (8 cores hyperthreaded, 1 worker): 0.62 ms. Throughput: 1613.19 iter/sec.
Timings for 2100K FFT length (8 cores, 1 worker): 0.77 ms. Throughput: 1300.60 iter/sec.
Timings for 2100K FFT length (8 cores hyperthreaded, 1 worker): 0.72 ms. Throughput: 1386.48 iter/sec.
Timings for 2160K FFT length (8 cores, 1 worker): 0.78 ms. Throughput: 1283.86 iter/sec.
Timings for 2160K FFT length (8 cores hyperthreaded, 1 worker): 0.74 ms. Throughput: 1347.50 iter/sec.
Timings for 2240K FFT length (8 cores, 1 worker): 0.77 ms. Throughput: 1292.93 iter/sec.
Timings for 2240K FFT length (8 cores hyperthreaded, 1 worker): 0.77 ms. Throughput: 1305.98 iter/sec.
Timings for 2304K FFT length (8 cores, 1 worker): 0.79 ms. Throughput: 1270.77 iter/sec.
Timings for 2304K FFT length (8 cores hyperthreaded, 1 worker): 0.75 ms. Throughput: 1326.39 iter/sec.
Timings for 2400K FFT length (8 cores, 1 worker): 0.87 ms. Throughput: 1154.32 iter/sec.
Timings for 2400K FFT length (8 cores hyperthreaded, 1 worker): 0.78 ms. Throughput: 1276.22 iter/sec.
[Sun Mar 27 12:22:17 2022]
Timings for 2520K FFT length (8 cores, 1 worker): 0.93 ms. Throughput: 1074.85 iter/sec.
Timings for 2520K FFT length (8 cores hyperthreaded, 1 worker): 0.83 ms. Throughput: 1207.03 iter/sec.
Timings for 2560K FFT length (8 cores, 1 worker): 0.89 ms. Throughput: 1126.80 iter/sec.
Timings for 2560K FFT length (8 cores hyperthreaded, 1 worker): 0.83 ms. Throughput: 1209.37 iter/sec.
Timings for 2592K FFT length (8 cores, 1 worker): 0.93 ms. Throughput: 1078.50 iter/sec.
Timings for 2592K FFT length (8 cores hyperthreaded, 1 worker): 0.86 ms. Throughput: 1164.72 iter/sec.
Timings for 2688K FFT length (8 cores, 1 worker): 0.94 ms. Throughput: 1067.16 iter/sec.
Timings for 2688K FFT length (8 cores hyperthreaded, 1 worker): 0.89 ms. Throughput: 1122.67 iter/sec.
Timings for 2880K FFT length (8 cores, 1 worker): 0.99 ms. Throughput: 1012.53 iter/sec.
Timings for 2880K FFT length (8 cores hyperthreaded, 1 worker): 0.93 ms. Throughput: 1071.49 iter/sec.
Timings for 2940K FFT length (8 cores, 1 worker): 1.08 ms. Throughput: 929.35 iter/sec.
Timings for 2940K FFT length (8 cores hyperthreaded, 1 worker): 1.09 ms. Throughput: 916.47 iter/sec.
Timings for 3000K FFT length (8 cores, 1 worker): 1.10 ms. Throughput: 912.10 iter/sec.
Timings for 3000K FFT length (8 cores hyperthreaded, 1 worker): 1.06 ms. Throughput: 945.39 iter/sec.
Timings for 3072K FFT length (8 cores, 1 worker): 0.95 ms. Throughput: 1057.48 iter/sec.
Timings for 3072K FFT length (8 cores hyperthreaded, 1 worker): 0.90 ms. Throughput: 1114.23 iter/sec.
Timings for 3136K FFT length (8 cores, 1 worker): 1.09 ms. Throughput: 913.95 iter/sec.
Timings for 3136K FFT length (8 cores hyperthreaded, 1 worker): 1.13 ms. Throughput: 887.51 iter/sec.
Timings for 3200K FFT length (8 cores, 1 worker): 1.16 ms. Throughput: 865.61 iter/sec.
Timings for 3200K FFT length (8 cores hyperthreaded, 1 worker): 1.12 ms. Throughput: 890.03 iter/sec.
Timings for 3360K FFT length (8 cores, 1 worker): 1.17 ms. Throughput: 851.09 iter/sec.
Timings for 3360K FFT length (8 cores hyperthreaded, 1 worker): 1.09 ms. Throughput: 916.23 iter/sec.
Timings for 3456K FFT length (8 cores, 1 worker): 1.19 ms. Throughput: 837.42 iter/sec.
Timings for 3456K FFT length (8 cores hyperthreaded, 1 worker): 1.14 ms. Throughput: 877.30 iter/sec.
Timings for 3600K FFT length (8 cores, 1 worker): 1.37 ms. Throughput: 731.53 iter/sec.
Timings for 3600K FFT length (8 cores hyperthreaded, 1 worker): 1.36 ms. Throughput: 737.57 iter/sec.
Timings for 3840K FFT length (8 cores, 1 worker): 1.39 ms. Throughput: 721.38 iter/sec.
Timings for 3840K FFT length (8 cores hyperthreaded, 1 worker): 1.36 ms. Throughput: 733.49 iter/sec.
Timings for 3920K FFT length (8 cores, 1 worker): 1.44 ms. Throughput: 695.09 iter/sec.
Timings for 3920K FFT length (8 cores hyperthreaded, 1 worker): 1.45 ms. Throughput: 689.96 iter/sec.
Timings for 4032K FFT length (8 cores, 1 worker): 1.38 ms. Throughput: 724.98 iter/sec.
Timings for 4032K FFT length (8 cores hyperthreaded, 1 worker): 1.34 ms. Throughput: 744.70 iter/sec.
Timings for 4200K FFT length (8 cores, 1 worker): 1.51 ms. Throughput: 661.83 iter/sec.
Timings for 4200K FFT length (8 cores hyperthreaded, 1 worker): 1.48 ms. Throughput: 675.92 iter/sec.
Timings for 4320K FFT length (8 cores, 1 worker): 1.63 ms. Throughput: 612.39 iter/sec.
Timings for 4320K FFT length (8 cores hyperthreaded, 1 worker): 1.63 ms. Throughput: 611.89 iter/sec.
Timings for 4480K FFT length (8 cores, 1 worker): 1.66 ms. Throughput: 601.79 iter/sec.
Timings for 4480K FFT length (8 cores hyperthreaded, 1 worker): 1.65 ms. Throughput: 604.46 iter/sec.
Timings for 4608K FFT length (8 cores, 1 worker): 1.66 ms. Throughput: 601.95 iter/sec.
Timings for 4608K FFT length (8 cores hyperthreaded, 1 worker): 1.69 ms. Throughput: 590.82 iter/sec.
Timings for 4704K FFT length (8 cores, 1 worker): 1.77 ms. Throughput: 564.45 iter/sec.
Timings for 4704K FFT length (8 cores hyperthreaded, 1 worker): 1.79 ms. Throughput: 559.79 iter/sec.
Timings for 4800K FFT length (8 cores, 1 worker): 1.94 ms. Throughput: 516.56 iter/sec.
Timings for 4800K FFT length (8 cores hyperthreaded, 1 worker): 2.14 ms. Throughput: 466.37 iter/sec.
Timings for 5040K FFT length (8 cores, 1 worker): 1.79 ms. Throughput: 559.48 iter/sec.
Timings for 5040K FFT length (8 cores hyperthreaded, 1 worker): 1.81 ms. Throughput: 552.19 iter/sec.
Timings for 5120K FFT length (8 cores, 1 worker): 1.84 ms. Throughput: 543.16 iter/sec.
Timings for 5120K FFT length (8 cores hyperthreaded, 1 worker): 1.93 ms. Throughput: 519.06 iter/sec.
Timings for 5184K FFT length (8 cores, 1 worker): 1.97 ms. Throughput: 506.66 iter/sec.
Timings for 5184K FFT length (8 cores hyperthreaded, 1 worker): 2.06 ms. Throughput: 486.16 iter/sec.
Timings for 5376K FFT length (8 cores, 1 worker): 1.97 ms. Throughput: 507.73 iter/sec.
Timings for 5376K FFT length (8 cores hyperthreaded, 1 worker): 2.06 ms. Throughput: 485.58 iter/sec.
Timings for 5760K FFT length (8 cores, 1 worker): 2.39 ms. Throughput: 418.50 iter/sec.
Timings for 5760K FFT length (8 cores hyperthreaded, 1 worker): 2.67 ms. Throughput: 375.08 iter/sec.
Timings for 6048K FFT length (8 cores, 1 worker): 2.31 ms. Throughput: 432.38 iter/sec.
Timings for 6048K FFT length (8 cores hyperthreaded, 1 worker): 2.51 ms. Throughput: 397.76 iter/sec.
Timings for 6144K FFT length (8 cores, 1 worker): 2.37 ms. Throughput: 422.80 iter/sec.
Timings for 6144K FFT length (8 cores hyperthreaded, 1 worker): 2.49 ms. Throughput: 402.26 iter/sec.
Timings for 6272K FFT length (8 cores, 1 worker): 2.43 ms. Throughput: 411.26 iter/sec.
[Sun Mar 27 12:27:23 2022]
Timings for 6272K FFT length (8 cores hyperthreaded, 1 worker): 2.52 ms. Throughput: 397.24 iter/sec.
Timings for 6400K FFT length (8 cores, 1 worker): 2.52 ms. Throughput: 396.96 iter/sec.
Timings for 6400K FFT length (8 cores hyperthreaded, 1 worker): 2.87 ms. Throughput: 348.99 iter/sec.
Timings for 6720K FFT length (8 cores, 1 worker): 2.74 ms. Throughput: 364.69 iter/sec.
Timings for 6720K FFT length (8 cores hyperthreaded, 1 worker): 3.08 ms. Throughput: 324.51 iter/sec.
Timings for 7056K FFT length (8 cores, 1 worker): 2.91 ms. Throughput: 343.47 iter/sec.
Timings for 7056K FFT length (8 cores hyperthreaded, 1 worker): 3.24 ms. Throughput: 308.85 iter/sec.
Timings for 7168K FFT length (8 cores, 1 worker): 2.91 ms. Throughput: 343.41 iter/sec.
Timings for 7168K FFT length (8 cores hyperthreaded, 1 worker): 3.13 ms. Throughput: 319.28 iter/sec.
Timings for 7200K FFT length (8 cores, 1 worker): 2.89 ms. Throughput: 346.59 iter/sec.
Timings for 7200K FFT length (8 cores hyperthreaded, 1 worker): 3.03 ms. Throughput: 330.57 iter/sec.
Timings for 7680K FFT length (8 cores, 1 worker): 3.17 ms. Throughput: 315.58 iter/sec.
Timings for 7680K FFT length (8 cores hyperthreaded, 1 worker): 3.26 ms. Throughput: 306.83 iter/sec.
Timings for 8064K FFT length (8 cores, 1 worker): 3.53 ms. Throughput: 283.34 iter/sec.
Timings for 8064K FFT length (8 cores hyperthreaded, 1 worker): 4.13 ms. Throughput: 242.08 iter/sec.
Timings for 8400K FFT length (8 cores, 1 worker): 3.56 ms. Throughput: 280.82 iter/sec.
Timings for 8400K FFT length (8 cores hyperthreaded, 1 worker): 4.06 ms. Throughput: 246.26 iter/sec.
Timings for 8640K FFT length (8 cores, 1 worker): 3.68 ms. Throughput: 272.01 iter/sec.
Timings for 8640K FFT length (8 cores hyperthreaded, 1 worker): 4.06 ms. Throughput: 246.53 iter/sec.
Timings for 8960K FFT length (8 cores, 1 worker): 3.83 ms. Throughput: 260.80 iter/sec.
Timings for 8960K FFT length (8 cores hyperthreaded, 1 worker): 4.12 ms. Throughput: 242.73 iter/sec.
Timings for 9600K FFT length (8 cores, 1 worker): 4.46 ms. Throughput: 224.31 iter/sec.
Timings for 9600K FFT length (8 cores hyperthreaded, 1 worker): 5.17 ms. Throughput: 193.57 iter/sec.
Timings for 10240K FFT length (8 cores, 1 worker): 4.57 ms. Throughput: 219.06 iter/sec.
Timings for 10240K FFT length (8 cores hyperthreaded, 1 worker): 5.61 ms. Throughput: 178.20 iter/sec.
Timings for 10368K FFT length (8 cores, 1 worker): 4.59 ms. Throughput: 217.75 iter/sec.
Timings for 10368K FFT length (8 cores hyperthreaded, 1 worker): 5.07 ms. Throughput: 197.30 iter/sec.
Timings for 11200K FFT length (8 cores, 1 worker): 5.18 ms. Throughput: 192.88 iter/sec.
Timings for 11200K FFT length (8 cores hyperthreaded, 1 worker): 6.30 ms. Throughput: 158.75 iter/sec.
Timings for 11520K FFT length (8 cores, 1 worker): 5.24 ms. Throughput: 191.02 iter/sec.
Timings for 11520K FFT length (8 cores hyperthreaded, 1 worker): 5.83 ms. Throughput: 171.57 iter/sec.
Timings for 12288K FFT length (8 cores, 1 worker): 5.52 ms. Throughput: 181.04 iter/sec.
Timings for 12288K FFT length (8 cores hyperthreaded, 1 worker): 6.35 ms. Throughput: 157.54 iter/sec.
Timings for 12800K FFT length (8 cores, 1 worker): 5.81 ms. Throughput: 172.07 iter/sec.
Timings for 12800K FFT length (8 cores hyperthreaded, 1 worker): 6.76 ms. Throughput: 148.01 iter/sec.
Timings for 13440K FFT length (8 cores, 1 worker): 6.14 ms. Throughput: 162.83 iter/sec.
Timings for 13440K FFT length (8 cores hyperthreaded, 1 worker): 6.92 ms. Throughput: 144.48 iter/sec.
Timings for 14400K FFT length (8 cores, 1 worker): 6.53 ms. Throughput: 153.08 iter/sec.
Timings for 14400K FFT length (8 cores hyperthreaded, 1 worker): 7.37 ms. Throughput: 135.69 iter/sec.
Timings for 15360K FFT length (8 cores, 1 worker): 7.05 ms. Throughput: 141.85 iter/sec.
Timings for 15360K FFT length (8 cores hyperthreaded, 1 worker): 8.03 ms. Throughput: 124.53 iter/sec.
Timings for 15680K FFT length (8 cores, 1 worker): 7.49 ms. Throughput: 133.45 iter/sec.
Timings for 15680K FFT length (8 cores hyperthreaded, 1 worker): 9.94 ms. Throughput: 100.60 iter/sec.
Timings for 16128K FFT length (8 cores, 1 worker): 7.46 ms. Throughput: 133.96 iter/sec.
Timings for 16128K FFT length (8 cores hyperthreaded, 1 worker): 8.80 ms. Throughput: 113.58 iter/sec.
Timings for 16384K FFT length (8 cores, 1 worker): 7.61 ms. Throughput: 131.38 iter/sec.
Timings for 16384K FFT length (8 cores hyperthreaded, 1 worker): 9.33 ms. Throughput: 107.22 iter/sec.
Timings for 16800K FFT length (8 cores, 1 worker): 7.75 ms. Throughput: 128.99 iter/sec.
Timings for 16800K FFT length (8 cores hyperthreaded, 1 worker): 8.91 ms. Throughput: 112.17 iter/sec.
Timings for 17280K FFT length (8 cores, 1 worker): 7.96 ms. Throughput: 125.60 iter/sec.
Timings for 17280K FFT length (8 cores hyperthreaded, 1 worker): 9.18 ms. Throughput: 108.90 iter/sec.
Timings for 17920K FFT length (8 cores, 1 worker): 8.32 ms. Throughput: 120.16 iter/sec.
Timings for 17920K FFT length (8 cores hyperthreaded, 1 worker): 9.81 ms. Throughput: 101.92 iter/sec.
Timings for 18432K FFT length (8 cores, 1 worker): 8.84 ms. Throughput: 113.16 iter/sec.
Timings for 18432K FFT length (8 cores hyperthreaded, 1 worker): 10.82 ms. Throughput: 92.46 iter/sec.
[Sun Mar 27 12:32:28 2022]
Timings for 18816K FFT length (8 cores, 1 worker): 9.23 ms. Throughput: 108.38 iter/sec.
Timings for 18816K FFT length (8 cores hyperthreaded, 1 worker): 12.69 ms. Throughput: 78.79 iter/sec.
Timings for 19200K FFT length (8 cores, 1 worker): 9.05 ms. Throughput: 110.49 iter/sec.
Timings for 19200K FFT length (8 cores hyperthreaded, 1 worker): 10.67 ms. Throughput: 93.75 iter/sec.
Timings for 20160K FFT length (8 cores, 1 worker): 9.37 ms. Throughput: 106.75 iter/sec.
Timings for 20160K FFT length (8 cores hyperthreaded, 1 worker): 11.16 ms. Throughput: 89.63 iter/sec.
Timings for 20480K FFT length (8 cores, 1 worker): 10.25 ms. Throughput: 97.58 iter/sec.
Timings for 20480K FFT length (8 cores hyperthreaded, 1 worker): 12.43 ms. Throughput: 80.42 iter/sec.
Timings for 20736K FFT length (8 cores, 1 worker): 9.75 ms. Throughput: 102.53 iter/sec.
Timings for 20736K FFT length (8 cores hyperthreaded, 1 worker): 11.93 ms. Throughput: 83.80 iter/sec.
Timings for 21504K FFT length (8 cores, 1 worker): 10.67 ms. Throughput: 93.76 iter/sec.
Timings for 21504K FFT length (8 cores hyperthreaded, 1 worker): 13.04 ms. Throughput: 76.66 iter/sec.
Timings for 21952K FFT length (8 cores, 1 worker): 10.48 ms. Throughput: 95.46 iter/sec.
Timings for 21952K FFT length (8 cores hyperthreaded, 1 worker): 13.11 ms. Throughput: 76.26 iter/sec.
Timings for 22400K FFT length (8 cores, 1 worker): 10.62 ms. Throughput: 94.13 iter/sec.
Timings for 22400K FFT length (8 cores hyperthreaded, 1 worker): 13.08 ms. Throughput: 76.44 iter/sec.
Timings for 23520K FFT length (8 cores, 1 worker): 11.02 ms. Throughput: 90.76 iter/sec.
Timings for 23520K FFT length (8 cores hyperthreaded, 1 worker): 13.70 ms. Throughput: 72.99 iter/sec.
Timings for 24192K FFT length (8 cores, 1 worker): 11.52 ms. Throughput: 86.80 iter/sec.
Timings for 24192K FFT length (8 cores hyperthreaded, 1 worker): 14.54 ms. Throughput: 68.78 iter/sec.
Timings for 24576K FFT length (8 cores, 1 worker): 11.81 ms. Throughput: 84.65 iter/sec.
Timings for 24576K FFT length (8 cores hyperthreaded, 1 worker): 15.21 ms. Throughput: 65.74 iter/sec.
Timings for 25088K FFT length (8 cores, 1 worker): 12.05 ms. Throughput: 82.97 iter/sec.
Timings for 25088K FFT length (8 cores hyperthreaded, 1 worker): 15.49 ms. Throughput: 64.58 iter/sec.
Timings for 25600K FFT length (8 cores, 1 worker): 12.39 ms. Throughput: 80.72 iter/sec.
Timings for 25600K FFT length (8 cores hyperthreaded, 1 worker): 15.77 ms. Throughput: 63.42 iter/sec.
Timings for 26880K FFT length (8 cores, 1 worker): 13.05 ms. Throughput: 76.61 iter/sec.
Timings for 26880K FFT length (8 cores hyperthreaded, 1 worker): 16.56 ms. Throughput: 60.39 iter/sec.
Timings for 27648K FFT length (8 cores, 1 worker): 13.37 ms. Throughput: 74.81 iter/sec.
Timings for 27648K FFT length (8 cores hyperthreaded, 1 worker): 17.39 ms. Throughput: 57.49 iter/sec.
Timings for 28224K FFT length (8 cores, 1 worker): 13.64 ms. Throughput: 73.33 iter/sec.
Timings for 28224K FFT length (8 cores hyperthreaded, 1 worker): 17.83 ms. Throughput: 56.10 iter/sec.
Timings for 28800K FFT length (8 cores, 1 worker): 13.63 ms. Throughput: 73.37 iter/sec.
Timings for 28800K FFT length (8 cores hyperthreaded, 1 worker): 15.13 ms. Throughput: 66.08 iter/sec.
Timings for 30720K FFT length (8 cores, 1 worker): 15.26 ms. Throughput: 65.51 iter/sec.
Timings for 30720K FFT length (8 cores hyperthreaded, 1 worker): 20.17 ms. Throughput: 49.59 iter/sec.
Timings for 31360K FFT length (8 cores, 1 worker): 15.46 ms. Throughput: 64.67 iter/sec.
Timings for 31360K FFT length (8 cores hyperthreaded, 1 worker): 20.59 ms. Throughput: 48.56 iter/sec.
Timings for 32256K FFT length (8 cores, 1 worker): 15.74 ms. Throughput: 63.53 iter/sec.
Timings for 32256K FFT length (8 cores hyperthreaded, 1 worker): 21.23 ms. Throughput: 47.11 iter/sec.[/code]

Magellan3s 2022-03-28 01:01

[QUOTE=kruoli;602721]Wonderful, thank you! :smile: Would you mind running some single threaded (1 worker, 1 thread, no HT benchmarking) tests, starting from 1K?[/QUOTE]

Part 1
[code] Machine#0 (total=65555544KB, DMIProductName="System Product Name", DMIProductVersion="System Version", DMIBoardVendor="ASUSTeK COMPUTER INC.", DMIBoardName="ProArt Z690-CREATOR WIFI", DMIBoardVersion="Rev 1.xx", DMIBoardAssetTag="Default string", DMIChassisVendor="Default string", DMIChassisType=3, DMIChassisVersion="Default string", DMIChassisAssetTag="Default string", DMIBIOSVendor="American Megatrends Inc.", DMIBIOSVersion=0811, DMIBIOSDate=12/15/2021, DMISysVendor=ASUS, Backend=Linux, LinuxCgroup=/, OSName=Linux, OSRelease=5.13.0-37-generic, OSVersion="#42-Ubuntu SMP Tue Mar 15 14:34:06 UTC 2022", HostName=Magellan, Architecture=x86_64, hwlocVersion=2.4.1, ProcessName=mprime)
Package#0 (total=65555544KB, CPUVendor=GenuineIntel, CPUFamilyNumber=6, CPUModelNumber=151, CPUModel="12th Gen Intel(R) Core(TM) i9-12900K", CPUStepping=2)
L3 (size=30720KB, linesize=64, ways=12, Inclusive=0)
L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
Core#0 (cpuset: 0x00000003)
PU#0 (cpuset: 0x00000001)
PU#1 (cpuset: 0x00000002)
L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
Core#4 (cpuset: 0x0000000c)
PU#2 (cpuset: 0x00000004)
PU#3 (cpuset: 0x00000008)
L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
Core#8 (cpuset: 0x00000030)
PU#4 (cpuset: 0x00000010)
PU#5 (cpuset: 0x00000020)
L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
Core#12 (cpuset: 0x000000c0)
PU#6 (cpuset: 0x00000040)
PU#7 (cpuset: 0x00000080)
L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
Core#16 (cpuset: 0x00000300)
PU#8 (cpuset: 0x00000100)
PU#9 (cpuset: 0x00000200)
L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
Core#20 (cpuset: 0x00000c00)
PU#10 (cpuset: 0x00000400)
PU#11 (cpuset: 0x00000800)
L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
Core#24 (cpuset: 0x00003000)
PU#12 (cpuset: 0x00001000)
PU#13 (cpuset: 0x00002000)
L2 (size=1280KB, linesize=64, ways=10, Inclusive=0)
L1d (size=48KB, linesize=64, ways=12, Inclusive=0)
Core#28 (cpuset: 0x0000c000)
PU#14 (cpuset: 0x00004000)
PU#15 (cpuset: 0x00008000)
Prime95 64-bit version 30.7, RdtscTiming=1
FFTlen=1024K all-complex, Type=3, Arch=8, Pass1=128, Pass2=8192, clm=4 (1 core, 1 worker): 0.32 ms. Throughput: 3151.93 iter/sec.
FFTlen=1024K all-complex, Type=3, Arch=8, Pass1=128, Pass2=8192, clm=2 (1 core, 1 worker): 0.32 ms. Throughput: 3087.37 iter/sec.
FFTlen=1024K all-complex, Type=3, Arch=8, Pass1=128, Pass2=8192, clm=1 (1 core, 1 worker): 0.37 ms. Throughput: 2671.78 iter/sec.
FFTlen=1024K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=1024, clm=2 (1 core, 1 worker): 0.39 ms. Throughput: 2534.63 iter/sec.
FFTlen=1024K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=1024, clm=1 (1 core, 1 worker): 0.38 ms. Throughput: 2609.84 iter/sec.
FFTlen=1024K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=512, clm=1 (1 core, 1 worker): 0.45 ms. Throughput: 2222.52 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=128, Pass2=9216, clm=4 (1 core, 1 worker): 0.35 ms. Throughput: 2818.22 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=128, Pass2=9216, clm=2 (1 core, 1 worker): 0.37 ms. Throughput: 2700.36 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=128, Pass2=9216, clm=1 (1 core, 1 worker): 0.40 ms. Throughput: 2474.46 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=192, Pass2=6144, clm=4 (1 core, 1 worker): 0.37 ms. Throughput: 2696.36 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=192, Pass2=6144, clm=2 (1 core, 1 worker): 0.38 ms. Throughput: 2662.90 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=192, Pass2=6144, clm=1 (1 core, 1 worker): 0.38 ms. Throughput: 2649.28 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=1024, clm=2 (1 core, 1 worker): 0.44 ms. Throughput: 2281.93 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=1024, clm=1 (1 core, 1 worker): 0.44 ms. Throughput: 2272.86 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=768, clm=2 (1 core, 1 worker): 0.43 ms. Throughput: 2323.97 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=768, clm=1 (1 core, 1 worker): 0.42 ms. Throughput: 2381.80 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=2304, Pass2=512, clm=1 (1 core, 1 worker): 0.51 ms. Throughput: 1947.46 iter/sec.
FFTlen=1152K all-complex, Type=3, Arch=8, Pass1=3072, Pass2=384, clm=1 (1 core, 1 worker): 0.58 ms. Throughput: 1738.65 iter/sec.
FFTlen=1200K all-complex, Type=3, Arch=8, Pass1=192, Pass2=6400, clm=4 (1 core, 1 worker): 0.38 ms. Throughput: 2598.30 iter/sec.
FFTlen=1200K all-complex, Type=3, Arch=8, Pass1=192, Pass2=6400, clm=2 (1 core, 1 worker): 0.39 ms. Throughput: 2549.45 iter/sec.
FFTlen=1200K all-complex, Type=3, Arch=8, Pass1=192, Pass2=6400, clm=1 (1 core, 1 worker): 0.41 ms. Throughput: 2448.53 iter/sec.
FFTlen=1200K all-complex, Type=3, Arch=8, Pass1=640, Pass2=1920, clm=4 (1 core, 1 worker): 0.46 ms. Throughput: 2196.62 iter/sec.
FFTlen=1200K all-complex, Type=3, Arch=8, Pass1=640, Pass2=1920, clm=2 (1 core, 1 worker): 0.45 ms. Throughput: 2232.31 iter/sec.
FFTlen=1200K all-complex, Type=3, Arch=8, Pass1=640, Pass2=1920, clm=1 (1 core, 1 worker): 0.44 ms. Throughput: 2265.45 iter/sec.
FFTlen=1200K all-complex, Type=3, Arch=8, Pass1=768, Pass2=1600, clm=4 (1 core, 1 worker): 0.47 ms. Throughput: 2141.49 iter/sec.
FFTlen=1200K all-complex, Type=3, Arch=8, Pass1=768, Pass2=1600, clm=2 (1 core, 1 worker): 0.47 ms. Throughput: 2125.35 iter/sec.
FFTlen=1200K all-complex, Type=3, Arch=8, Pass1=768, Pass2=1600, clm=1 (1 core, 1 worker): 0.46 ms. Throughput: 2179.10 iter/sec.
FFTlen=1200K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=640, clm=2 (1 core, 1 worker): 0.49 ms. Throughput: 2061.25 iter/sec.
FFTlen=1200K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=640, clm=1 (1 core, 1 worker): 0.48 ms. Throughput: 2102.26 iter/sec.
FFTlen=1280K all-complex, Type=3, Arch=8, Pass1=128, Pass2=10240, clm=4 (1 core, 1 worker): 0.39 ms. Throughput: 2537.50 iter/sec.
FFTlen=1280K all-complex, Type=3, Arch=8, Pass1=128, Pass2=10240, clm=2 (1 core, 1 worker): 0.41 ms. Throughput: 2457.71 iter/sec.
FFTlen=1280K all-complex, Type=3, Arch=8, Pass1=128, Pass2=10240, clm=1 (1 core, 1 worker): 0.45 ms. Throughput: 2218.23 iter/sec.
FFTlen=1280K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=1024, clm=2 (1 core, 1 worker): 0.48 ms. Throughput: 2075.28 iter/sec.
FFTlen=1280K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=1024, clm=1 (1 core, 1 worker): 0.49 ms. Throughput: 2044.03 iter/sec.
FFTlen=1280K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=640, clm=1 (1 core, 1 worker): 0.49 ms. Throughput: 2024.64 iter/sec.
FFTlen=1344K all-complex, Type=3, Arch=8, Pass1=192, Pass2=7168, clm=4 (1 core, 1 worker): 0.43 ms. Throughput: 2329.84 iter/sec.
FFTlen=1344K all-complex, Type=3, Arch=8, Pass1=192, Pass2=7168, clm=2 (1 core, 1 worker): 0.43 ms. Throughput: 2322.22 iter/sec.
FFTlen=1344K all-complex, Type=3, Arch=8, Pass1=192, Pass2=7168, clm=1 (1 core, 1 worker): 0.46 ms. Throughput: 2189.27 iter/sec.
FFTlen=1344K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=1024, clm=2 (1 core, 1 worker): 0.53 ms. Throughput: 1899.91 iter/sec.
FFTlen=1344K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=1024, clm=1 (1 core, 1 worker): 0.52 ms. Throughput: 1925.47 iter/sec.
FFTlen=1344K all-complex, Type=3, Arch=8, Pass1=3072, Pass2=448, clm=1 (1 core, 1 worker): 0.63 ms. Throughput: 1599.61 iter/sec.
FFTlen=1400K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2240, clm=4 (1 core, 1 worker): 0.52 ms. Throughput: 1908.48 iter/sec.
FFTlen=1400K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2240, clm=2 (1 core, 1 worker): 0.52 ms. Throughput: 1921.31 iter/sec.
FFTlen=1400K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2240, clm=1 (1 core, 1 worker): 0.52 ms. Throughput: 1925.29 iter/sec.
FFTlen=1400K all-complex, Type=3, Arch=8, Pass1=896, Pass2=1600, clm=4 (1 core, 1 worker): 0.57 ms. Throughput: 1756.84 iter/sec.
FFTlen=1400K all-complex, Type=3, Arch=8, Pass1=896, Pass2=1600, clm=2 (1 core, 1 worker): 0.54 ms. Throughput: 1842.64 iter/sec.
FFTlen=1400K all-complex, Type=3, Arch=8, Pass1=896, Pass2=1600, clm=1 (1 core, 1 worker): 0.54 ms. Throughput: 1841.53 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=192, Pass2=7680, clm=4 (1 core, 1 worker): 0.45 ms. Throughput: 2215.36 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=192, Pass2=7680, clm=2 (1 core, 1 worker): 0.46 ms. Throughput: 2173.71 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=192, Pass2=7680, clm=1 (1 core, 1 worker): 0.48 ms. Throughput: 2071.58 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2304, clm=4 (1 core, 1 worker): 0.52 ms. Throughput: 1919.41 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2304, clm=2 (1 core, 1 worker): 0.52 ms. Throughput: 1912.21 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2304, clm=1 (1 core, 1 worker): 0.52 ms. Throughput: 1929.25 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=768, Pass2=1920, clm=4 (1 core, 1 worker): 0.55 ms. Throughput: 1818.95 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=768, Pass2=1920, clm=2 (1 core, 1 worker): 0.54 ms. Throughput: 1840.91 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=768, Pass2=1920, clm=1 (1 core, 1 worker): 0.54 ms. Throughput: 1868.83 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=768, clm=2 (1 core, 1 worker): 0.52 ms. Throughput: 1909.66 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=768, clm=1 (1 core, 1 worker): 0.51 ms. Throughput: 1945.57 iter/sec.
FFTlen=1440K all-complex, Type=3, Arch=8, Pass1=2304, Pass2=640, clm=1 (1 core, 1 worker): 0.57 ms. Throughput: 1766.68 iter/sec.
[Sun Mar 27 19:05:54 2022]
FFTlen=1500K all-complex, Type=3, Arch=8, Pass1=960, Pass2=1600, clm=4 (1 core, 1 worker): 0.62 ms. Throughput: 1603.04 iter/sec.
FFTlen=1500K all-complex, Type=3, Arch=8, Pass1=960, Pass2=1600, clm=2 (1 core, 1 worker): 0.58 ms. Throughput: 1718.72 iter/sec.
FFTlen=1500K all-complex, Type=3, Arch=8, Pass1=960, Pass2=1600, clm=1 (1 core, 1 worker): 0.58 ms. Throughput: 1722.72 iter/sec.
FFTlen=1536K all-complex, Type=3, Arch=8, Pass1=128, Pass2=12288, clm=4 (1 core, 1 worker): 0.47 ms. Throughput: 2127.05 iter/sec.
FFTlen=1536K all-complex, Type=3, Arch=8, Pass1=128, Pass2=12288, clm=2 (1 core, 1 worker): 0.48 ms. Throughput: 2091.64 iter/sec.
FFTlen=1536K all-complex, Type=3, Arch=8, Pass1=128, Pass2=12288, clm=1 (1 core, 1 worker): 0.52 ms. Throughput: 1922.43 iter/sec.
FFTlen=1536K all-complex, Type=3, Arch=8, Pass1=192, Pass2=8192, clm=4 (1 core, 1 worker): 0.48 ms. Throughput: 2087.15 iter/sec.
FFTlen=1536K all-complex, Type=3, Arch=8, Pass1=192, Pass2=8192, clm=2 (1 core, 1 worker): 0.50 ms. Throughput: 2000.05 iter/sec.
FFTlen=1536K all-complex, Type=3, Arch=8, Pass1=192, Pass2=8192, clm=1 (1 core, 1 worker): 0.52 ms. Throughput: 1906.82 iter/sec.
FFTlen=1536K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=1024, clm=2 (1 core, 1 worker): 0.61 ms. Throughput: 1644.07 iter/sec.
FFTlen=1536K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=1024, clm=1 (1 core, 1 worker): 0.58 ms. Throughput: 1724.51 iter/sec.
FFTlen=1536K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=768, clm=1 (1 core, 1 worker): 0.55 ms. Throughput: 1809.02 iter/sec.
FFTlen=1536K all-complex, Type=3, Arch=8, Pass1=3072, Pass2=512, clm=1 (1 core, 1 worker): 0.70 ms. Throughput: 1419.95 iter/sec.
FFTlen=1600K all-complex, Type=3, Arch=8, Pass1=128, Pass2=12800, clm=4 (1 core, 1 worker): 0.54 ms. Throughput: 1836.55 iter/sec.
FFTlen=1600K all-complex, Type=3, Arch=8, Pass1=128, Pass2=12800, clm=2 (1 core, 1 worker): 0.56 ms. Throughput: 1795.41 iter/sec.
FFTlen=1600K all-complex, Type=3, Arch=8, Pass1=128, Pass2=12800, clm=1 (1 core, 1 worker): 0.60 ms. Throughput: 1658.66 iter/sec.
FFTlen=1600K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2560, clm=4 (1 core, 1 worker): 0.59 ms. Throughput: 1704.47 iter/sec.
FFTlen=1600K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2560, clm=2 (1 core, 1 worker): 0.57 ms. Throughput: 1765.59 iter/sec.
FFTlen=1600K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2560, clm=1 (1 core, 1 worker): 0.57 ms. Throughput: 1763.10 iter/sec.
FFTlen=1600K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=1600, clm=2 (1 core, 1 worker): 0.63 ms. Throughput: 1592.88 iter/sec.
FFTlen=1600K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=1600, clm=1 (1 core, 1 worker): 0.62 ms. Throughput: 1612.98 iter/sec.
FFTlen=1680K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2688, clm=4 (1 core, 1 worker): 0.63 ms. Throughput: 1587.84 iter/sec.
FFTlen=1680K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2688, clm=2 (1 core, 1 worker): 0.60 ms. Throughput: 1665.82 iter/sec.
FFTlen=1680K all-complex, Type=3, Arch=8, Pass1=640, Pass2=2688, clm=1 (1 core, 1 worker): 0.60 ms. Throughput: 1661.03 iter/sec.
FFTlen=1680K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2240, clm=4 (1 core, 1 worker): 0.64 ms. Throughput: 1553.56 iter/sec.
FFTlen=1680K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2240, clm=2 (1 core, 1 worker): 0.63 ms. Throughput: 1599.48 iter/sec.
FFTlen=1680K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2240, clm=1 (1 core, 1 worker): 0.61 ms. Throughput: 1636.97 iter/sec.
FFTlen=1680K all-complex, Type=3, Arch=8, Pass1=896, Pass2=1920, clm=4 (1 core, 1 worker): 0.65 ms. Throughput: 1549.72 iter/sec.
FFTlen=1680K all-complex, Type=3, Arch=8, Pass1=896, Pass2=1920, clm=2 (1 core, 1 worker): 0.63 ms. Throughput: 1587.33 iter/sec.
FFTlen=1680K all-complex, Type=3, Arch=8, Pass1=896, Pass2=1920, clm=1 (1 core, 1 worker): 0.63 ms. Throughput: 1596.50 iter/sec.
FFTlen=1728K all-complex, Type=3, Arch=8, Pass1=192, Pass2=9216, clm=4 (1 core, 1 worker): 0.53 ms. Throughput: 1878.73 iter/sec.
FFTlen=1728K all-complex, Type=3, Arch=8, Pass1=192, Pass2=9216, clm=2 (1 core, 1 worker): 0.54 ms. Throughput: 1866.64 iter/sec.
FFTlen=1728K all-complex, Type=3, Arch=8, Pass1=192, Pass2=9216, clm=1 (1 core, 1 worker): 0.57 ms. Throughput: 1764.81 iter/sec.
FFTlen=1728K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2304, clm=4 (1 core, 1 worker): 0.65 ms. Throughput: 1546.72 iter/sec.
FFTlen=1728K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2304, clm=2 (1 core, 1 worker): 0.64 ms. Throughput: 1563.21 iter/sec.
FFTlen=1728K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2304, clm=1 (1 core, 1 worker): 0.61 ms. Throughput: 1647.41 iter/sec.
FFTlen=1728K all-complex, Type=3, Arch=8, Pass1=2304, Pass2=768, clm=1 (1 core, 1 worker): 0.61 ms. Throughput: 1636.96 iter/sec.
FFTlen=1800K all-complex, Type=3, Arch=8, Pass1=960, Pass2=1920, clm=4 (1 core, 1 worker): 0.68 ms. Throughput: 1462.01 iter/sec.
FFTlen=1800K all-complex, Type=3, Arch=8, Pass1=960, Pass2=1920, clm=2 (1 core, 1 worker): 0.68 ms. Throughput: 1472.04 iter/sec.
FFTlen=1800K all-complex, Type=3, Arch=8, Pass1=960, Pass2=1920, clm=1 (1 core, 1 worker): 0.66 ms. Throughput: 1510.16 iter/sec.
FFTlen=1800K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=1600, clm=2 (1 core, 1 worker): 0.68 ms. Throughput: 1475.99 iter/sec.
FFTlen=1800K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=1600, clm=1 (1 core, 1 worker): 0.69 ms. Throughput: 1444.21 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=128, Pass2=15360, clm=4 (1 core, 1 worker): 0.65 ms. Throughput: 1531.00 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=128, Pass2=15360, clm=2 (1 core, 1 worker): 0.66 ms. Throughput: 1503.96 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=128, Pass2=15360, clm=1 (1 core, 1 worker): 0.72 ms. Throughput: 1380.27 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=192, Pass2=10240, clm=4 (1 core, 1 worker): 0.61 ms. Throughput: 1647.53 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=192, Pass2=10240, clm=2 (1 core, 1 worker): 0.59 ms. Throughput: 1685.91 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=192, Pass2=10240, clm=1 (1 core, 1 worker): 0.63 ms. Throughput: 1577.13 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3072, clm=4 (1 core, 1 worker): 0.67 ms. Throughput: 1490.67 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3072, clm=2 (1 core, 1 worker): 0.64 ms. Throughput: 1553.91 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3072, clm=1 (1 core, 1 worker): 0.63 ms. Throughput: 1575.76 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2560, clm=4 (1 core, 1 worker): 0.70 ms. Throughput: 1419.14 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2560, clm=2 (1 core, 1 worker): 0.70 ms. Throughput: 1426.63 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2560, clm=1 (1 core, 1 worker): 0.68 ms. Throughput: 1469.57 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=1920, clm=2 (1 core, 1 worker): 0.71 ms. Throughput: 1406.58 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=1920, clm=1 (1 core, 1 worker): 0.70 ms. Throughput: 1432.09 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=1024, clm=2 (1 core, 1 worker): 0.77 ms. Throughput: 1304.55 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=1024, clm=1 (1 core, 1 worker): 0.73 ms. Throughput: 1371.78 iter/sec.
FFTlen=1920K all-complex, Type=3, Arch=8, Pass1=3072, Pass2=640, clm=1 (1 core, 1 worker): 0.77 ms. Throughput: 1295.06 iter/sec.
[Sun Mar 27 19:10:55 2022]
FFTlen=1960K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3136, clm=4 (1 core, 1 worker): 0.72 ms. Throughput: 1384.18 iter/sec.
FFTlen=1960K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3136, clm=2 (1 core, 1 worker): 0.71 ms. Throughput: 1416.93 iter/sec.
FFTlen=1960K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3136, clm=1 (1 core, 1 worker): 0.72 ms. Throughput: 1390.54 iter/sec.
FFTlen=1960K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2240, clm=4 (1 core, 1 worker): 0.77 ms. Throughput: 1304.58 iter/sec.
FFTlen=1960K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2240, clm=2 (1 core, 1 worker): 0.73 ms. Throughput: 1366.18 iter/sec.
FFTlen=1960K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2240, clm=1 (1 core, 1 worker): 0.74 ms. Throughput: 1343.55 iter/sec.
FFTlen=2000K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3200, clm=4 (1 core, 1 worker): 0.72 ms. Throughput: 1391.21 iter/sec.
FFTlen=2000K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3200, clm=2 (1 core, 1 worker): 0.72 ms. Throughput: 1389.51 iter/sec.
FFTlen=2000K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3200, clm=1 (1 core, 1 worker): 0.72 ms. Throughput: 1391.36 iter/sec.
FFTlen=2000K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=1600, clm=2 (1 core, 1 worker): 0.76 ms. Throughput: 1321.36 iter/sec.
FFTlen=2000K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=1600, clm=1 (1 core, 1 worker): 0.77 ms. Throughput: 1293.76 iter/sec.
FFTlen=2016K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2688, clm=4 (1 core, 1 worker): 0.75 ms. Throughput: 1341.15 iter/sec.
FFTlen=2016K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2688, clm=2 (1 core, 1 worker): 0.73 ms. Throughput: 1369.98 iter/sec.
FFTlen=2016K all-complex, Type=3, Arch=8, Pass1=768, Pass2=2688, clm=1 (1 core, 1 worker): 0.72 ms. Throughput: 1380.61 iter/sec.
FFTlen=2016K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2304, clm=4 (1 core, 1 worker): 0.76 ms. Throughput: 1318.20 iter/sec.
FFTlen=2016K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2304, clm=2 (1 core, 1 worker): 0.75 ms. Throughput: 1341.57 iter/sec.
FFTlen=2016K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2304, clm=1 (1 core, 1 worker): 0.74 ms. Throughput: 1356.18 iter/sec.
FFTlen=2048K all-complex, Type=3, Arch=8, Pass1=128, Pass2=16384, clm=4 (1 core, 1 worker): 0.62 ms. Throughput: 1624.29 iter/sec.
FFTlen=2048K all-complex, Type=3, Arch=8, Pass1=128, Pass2=16384, clm=2 (1 core, 1 worker): 0.64 ms. Throughput: 1569.61 iter/sec.
FFTlen=2048K all-complex, Type=3, Arch=8, Pass1=128, Pass2=16384, clm=1 (1 core, 1 worker): 0.69 ms. Throughput: 1442.22 iter/sec.
FFTlen=2048K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=1024, clm=1 (1 core, 1 worker): 0.75 ms. Throughput: 1335.28 iter/sec.
FFTlen=2100K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2240, clm=4 (1 core, 1 worker): 0.83 ms. Throughput: 1203.76 iter/sec.
FFTlen=2100K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2240, clm=2 (1 core, 1 worker): 0.78 ms. Throughput: 1282.97 iter/sec.
FFTlen=2100K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2240, clm=1 (1 core, 1 worker): 0.80 ms. Throughput: 1253.12 iter/sec.
FFTlen=2100K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=1600, clm=2 (1 core, 1 worker): 0.82 ms. Throughput: 1216.13 iter/sec.
FFTlen=2100K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=1600, clm=1 (1 core, 1 worker): 0.83 ms. Throughput: 1210.09 iter/sec.
FFTlen=2160K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2304, clm=4 (1 core, 1 worker): 0.84 ms. Throughput: 1196.10 iter/sec.
FFTlen=2160K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2304, clm=2 (1 core, 1 worker): 0.80 ms. Throughput: 1251.21 iter/sec.
FFTlen=2160K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2304, clm=1 (1 core, 1 worker): 0.80 ms. Throughput: 1244.86 iter/sec.
FFTlen=2160K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=1920, clm=2 (1 core, 1 worker): 0.80 ms. Throughput: 1243.83 iter/sec.
FFTlen=2160K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=1920, clm=1 (1 core, 1 worker): 0.79 ms. Throughput: 1271.00 iter/sec.
FFTlen=2240K all-complex, Type=3, Arch=8, Pass1=128, Pass2=17920, clm=4 (1 core, 1 worker): 0.78 ms. Throughput: 1289.62 iter/sec.
FFTlen=2240K all-complex, Type=3, Arch=8, Pass1=128, Pass2=17920, clm=2 (1 core, 1 worker): 0.78 ms. Throughput: 1274.68 iter/sec.
FFTlen=2240K all-complex, Type=3, Arch=8, Pass1=128, Pass2=17920, clm=1 (1 core, 1 worker): 0.85 ms. Throughput: 1174.81 iter/sec.
FFTlen=2240K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3584, clm=4 (1 core, 1 worker): 0.78 ms. Throughput: 1279.14 iter/sec.
FFTlen=2240K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3584, clm=2 (1 core, 1 worker): 0.75 ms. Throughput: 1329.32 iter/sec.
FFTlen=2240K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3584, clm=1 (1 core, 1 worker): 0.76 ms. Throughput: 1312.34 iter/sec.
FFTlen=2240K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2560, clm=4 (1 core, 1 worker): 0.81 ms. Throughput: 1241.90 iter/sec.
FFTlen=2240K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2560, clm=2 (1 core, 1 worker): 0.80 ms. Throughput: 1248.73 iter/sec.
FFTlen=2240K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2560, clm=1 (1 core, 1 worker): 0.80 ms. Throughput: 1243.22 iter/sec.
FFTlen=2240K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=2240, clm=2 (1 core, 1 worker): 0.82 ms. Throughput: 1213.41 iter/sec.
FFTlen=2240K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=2240, clm=1 (1 core, 1 worker): 0.81 ms. Throughput: 1229.12 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=128, Pass2=18432, clm=4 (1 core, 1 worker): 0.78 ms. Throughput: 1277.86 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=128, Pass2=18432, clm=2 (1 core, 1 worker): 0.80 ms. Throughput: 1249.71 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=128, Pass2=18432, clm=1 (1 core, 1 worker): 0.86 ms. Throughput: 1165.50 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=192, Pass2=12288, clm=4 (1 core, 1 worker): 0.70 ms. Throughput: 1434.76 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=192, Pass2=12288, clm=2 (1 core, 1 worker): 0.73 ms. Throughput: 1372.91 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=192, Pass2=12288, clm=1 (1 core, 1 worker): 0.73 ms. Throughput: 1374.91 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3072, clm=4 (1 core, 1 worker): 0.80 ms. Throughput: 1248.85 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3072, clm=2 (1 core, 1 worker): 0.77 ms. Throughput: 1292.23 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3072, clm=1 (1 core, 1 worker): 0.75 ms. Throughput: 1327.86 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=2304, clm=2 (1 core, 1 worker): 0.82 ms. Throughput: 1214.71 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=2304, clm=1 (1 core, 1 worker): 0.82 ms. Throughput: 1216.94 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=2304, Pass2=1024, clm=1 (1 core, 1 worker): 0.86 ms. Throughput: 1161.03 iter/sec.
FFTlen=2304K all-complex, Type=3, Arch=8, Pass1=3072, Pass2=768, clm=1 (1 core, 1 worker): 0.82 ms. Throughput: 1221.39 iter/sec.
FFTlen=2352K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3136, clm=4 (1 core, 1 worker): 0.86 ms. Throughput: 1159.94 iter/sec.
FFTlen=2352K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3136, clm=2 (1 core, 1 worker): 0.87 ms. Throughput: 1150.88 iter/sec.
FFTlen=2352K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3136, clm=1 (1 core, 1 worker): 0.83 ms. Throughput: 1201.72 iter/sec.
FFTlen=2352K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2688, clm=4 (1 core, 1 worker): 0.86 ms. Throughput: 1168.02 iter/sec.
[Sun Mar 27 19:15:57 2022]
FFTlen=2352K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2688, clm=2 (1 core, 1 worker): 0.87 ms. Throughput: 1151.76 iter/sec.
FFTlen=2352K all-complex, Type=3, Arch=8, Pass1=896, Pass2=2688, clm=1 (1 core, 1 worker): 0.88 ms. Throughput: 1135.27 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=192, Pass2=12800, clm=4 (1 core, 1 worker): 0.82 ms. Throughput: 1217.39 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=192, Pass2=12800, clm=2 (1 core, 1 worker): 0.83 ms. Throughput: 1206.57 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=192, Pass2=12800, clm=1 (1 core, 1 worker): 0.83 ms. Throughput: 1202.11 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3840, clm=4 (1 core, 1 worker): 0.79 ms. Throughput: 1257.90 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3840, clm=2 (1 core, 1 worker): 0.79 ms. Throughput: 1272.42 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=640, Pass2=3840, clm=1 (1 core, 1 worker): 0.80 ms. Throughput: 1244.00 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3200, clm=4 (1 core, 1 worker): 0.90 ms. Throughput: 1114.88 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3200, clm=2 (1 core, 1 worker): 0.87 ms. Throughput: 1153.50 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3200, clm=1 (1 core, 1 worker): 0.85 ms. Throughput: 1183.31 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2560, clm=4 (1 core, 1 worker): 0.92 ms. Throughput: 1084.00 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2560, clm=2 (1 core, 1 worker): 0.87 ms. Throughput: 1146.49 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2560, clm=1 (1 core, 1 worker): 0.87 ms. Throughput: 1148.81 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=1920, clm=2 (1 core, 1 worker): 0.89 ms. Throughput: 1122.56 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=1920, clm=1 (1 core, 1 worker): 0.91 ms. Throughput: 1095.05 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=1600, clm=2 (1 core, 1 worker): 0.94 ms. Throughput: 1065.88 iter/sec.
FFTlen=2400K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=1600, clm=1 (1 core, 1 worker): 0.92 ms. Throughput: 1090.95 iter/sec.
FFTlen=2520K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2688, clm=4 (1 core, 1 worker): 0.96 ms. Throughput: 1039.11 iter/sec.
FFTlen=2520K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2688, clm=2 (1 core, 1 worker): 0.94 ms. Throughput: 1069.18 iter/sec.
FFTlen=2520K all-complex, Type=3, Arch=8, Pass1=960, Pass2=2688, clm=1 (1 core, 1 worker): 0.90 ms. Throughput: 1107.23 iter/sec.
FFTlen=2520K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=2240, clm=2 (1 core, 1 worker): 0.91 ms. Throughput: 1095.19 iter/sec.
FFTlen=2520K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=2240, clm=1 (1 core, 1 worker): 0.91 ms. Throughput: 1101.32 iter/sec.
FFTlen=2520K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=1920, clm=2 (1 core, 1 worker): 0.96 ms. Throughput: 1037.17 iter/sec.
FFTlen=2520K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=1920, clm=1 (1 core, 1 worker): 0.94 ms. Throughput: 1060.17 iter/sec.
FFTlen=2560K all-complex, Type=3, Arch=8, Pass1=128, Pass2=20480, clm=4 (1 core, 1 worker): 0.86 ms. Throughput: 1160.90 iter/sec.
FFTlen=2560K all-complex, Type=3, Arch=8, Pass1=128, Pass2=20480, clm=2 (1 core, 1 worker): 0.90 ms. Throughput: 1111.83 iter/sec.
FFTlen=2560K all-complex, Type=3, Arch=8, Pass1=128, Pass2=20480, clm=1 (1 core, 1 worker): 0.95 ms. Throughput: 1048.01 iter/sec.
FFTlen=2560K all-complex, Type=3, Arch=8, Pass1=640, Pass2=4096, clm=4 (1 core, 1 worker): 0.87 ms. Throughput: 1150.13 iter/sec.
FFTlen=2560K all-complex, Type=3, Arch=8, Pass1=640, Pass2=4096, clm=2 (1 core, 1 worker): 0.83 ms. Throughput: 1204.47 iter/sec.
FFTlen=2560K all-complex, Type=3, Arch=8, Pass1=640, Pass2=4096, clm=1 (1 core, 1 worker): 0.86 ms. Throughput: 1166.05 iter/sec.
FFTlen=2560K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=2560, clm=2 (1 core, 1 worker): 0.90 ms. Throughput: 1108.28 iter/sec.
FFTlen=2560K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=2560, clm=1 (1 core, 1 worker): 0.89 ms. Throughput: 1122.17 iter/sec.
FFTlen=2592K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=2304, clm=2 (1 core, 1 worker): 0.94 ms. Throughput: 1063.08 iter/sec.
FFTlen=2592K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=2304, clm=1 (1 core, 1 worker): 0.92 ms. Throughput: 1088.26 iter/sec.
FFTlen=2688K all-complex, Type=3, Arch=8, Pass1=128, Pass2=21504, clm=4 (1 core, 1 worker): 0.91 ms. Throughput: 1101.60 iter/sec.
FFTlen=2688K all-complex, Type=3, Arch=8, Pass1=128, Pass2=21504, clm=2 (1 core, 1 worker): 0.94 ms. Throughput: 1066.55 iter/sec.
FFTlen=2688K all-complex, Type=3, Arch=8, Pass1=128, Pass2=21504, clm=1 (1 core, 1 worker): 1.02 ms. Throughput: 979.08 iter/sec.
FFTlen=2688K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3584, clm=4 (1 core, 1 worker): 0.94 ms. Throughput: 1066.03 iter/sec.
FFTlen=2688K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3584, clm=2 (1 core, 1 worker): 0.90 ms. Throughput: 1106.35 iter/sec.
FFTlen=2688K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3584, clm=1 (1 core, 1 worker): 0.92 ms. Throughput: 1082.23 iter/sec.
FFTlen=2688K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3072, clm=4 (1 core, 1 worker): 0.92 ms. Throughput: 1088.25 iter/sec.
FFTlen=2688K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3072, clm=2 (1 core, 1 worker): 0.91 ms. Throughput: 1098.26 iter/sec.
FFTlen=2688K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3072, clm=1 (1 core, 1 worker): 0.93 ms. Throughput: 1076.99 iter/sec.
FFTlen=2688K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=2688, clm=2 (1 core, 1 worker): 0.98 ms. Throughput: 1023.93 iter/sec.
FFTlen=2688K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=2688, clm=1 (1 core, 1 worker): 0.95 ms. Throughput: 1048.73 iter/sec.
FFTlen=2744K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3136, clm=4 (1 core, 1 worker): 1.02 ms. Throughput: 976.94 iter/sec.
FFTlen=2744K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3136, clm=2 (1 core, 1 worker): 1.01 ms. Throughput: 986.58 iter/sec.
FFTlen=2744K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3136, clm=1 (1 core, 1 worker): 1.02 ms. Throughput: 976.67 iter/sec.
FFTlen=2800K all-complex, Type=3, Arch=8, Pass1=640, Pass2=4480, clm=4 (1 core, 1 worker): 1.00 ms. Throughput: 1000.86 iter/sec.
FFTlen=2800K all-complex, Type=3, Arch=8, Pass1=640, Pass2=4480, clm=2 (1 core, 1 worker): 0.99 ms. Throughput: 1014.66 iter/sec.
FFTlen=2800K all-complex, Type=3, Arch=8, Pass1=640, Pass2=4480, clm=1 (1 core, 1 worker): 0.99 ms. Throughput: 1007.22 iter/sec.
FFTlen=2800K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3200, clm=4 (1 core, 1 worker): 1.03 ms. Throughput: 966.31 iter/sec.
FFTlen=2800K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3200, clm=2 (1 core, 1 worker): 1.01 ms. Throughput: 992.26 iter/sec.
FFTlen=2800K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3200, clm=1 (1 core, 1 worker): 1.03 ms. Throughput: 970.36 iter/sec.
FFTlen=2800K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=2240, clm=2 (1 core, 1 worker): 1.02 ms. Throughput: 982.69 iter/sec.
FFTlen=2800K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=2240, clm=1 (1 core, 1 worker): 1.06 ms. Throughput: 943.26 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=192, Pass2=15360, clm=4 (1 core, 1 worker): 0.99 ms. Throughput: 1010.13 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=192, Pass2=15360, clm=2 (1 core, 1 worker): 1.01 ms. Throughput: 994.99 iter/sec.
[Sun Mar 27 19:21:00 2022]
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=192, Pass2=15360, clm=1 (1 core, 1 worker): 1.01 ms. Throughput: 994.10 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=640, Pass2=4608, clm=4 (1 core, 1 worker): 0.99 ms. Throughput: 1014.18 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=640, Pass2=4608, clm=2 (1 core, 1 worker): 0.98 ms. Throughput: 1020.38 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=640, Pass2=4608, clm=1 (1 core, 1 worker): 0.98 ms. Throughput: 1023.48 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3840, clm=4 (1 core, 1 worker): 0.97 ms. Throughput: 1031.53 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3840, clm=2 (1 core, 1 worker): 0.98 ms. Throughput: 1018.25 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=768, Pass2=3840, clm=1 (1 core, 1 worker): 0.97 ms. Throughput: 1032.05 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3072, clm=4 (1 core, 1 worker): 1.03 ms. Throughput: 973.05 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3072, clm=2 (1 core, 1 worker): 1.00 ms. Throughput: 1001.04 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3072, clm=1 (1 core, 1 worker): 1.00 ms. Throughput: 1004.08 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=2560, clm=2 (1 core, 1 worker): 1.03 ms. Throughput: 972.71 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=2560, clm=1 (1 core, 1 worker): 1.01 ms. Throughput: 988.57 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=2304, clm=2 (1 core, 1 worker): 1.06 ms. Throughput: 944.73 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=2304, clm=1 (1 core, 1 worker): 1.04 ms. Throughput: 959.89 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=1920, clm=2 (1 core, 1 worker): 1.11 ms. Throughput: 903.03 iter/sec.
FFTlen=2880K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=1920, clm=1 (1 core, 1 worker): 1.07 ms. Throughput: 930.50 iter/sec.
FFTlen=2940K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3136, clm=4 (1 core, 1 worker): 1.10 ms. Throughput: 911.80 iter/sec.
FFTlen=2940K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3136, clm=2 (1 core, 1 worker): 1.06 ms. Throughput: 939.18 iter/sec.
FFTlen=2940K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3136, clm=1 (1 core, 1 worker): 1.08 ms. Throughput: 925.43 iter/sec.
FFTlen=2940K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=2240, clm=2 (1 core, 1 worker): 1.11 ms. Throughput: 902.62 iter/sec.
FFTlen=2940K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=2240, clm=1 (1 core, 1 worker): 1.11 ms. Throughput: 897.28 iter/sec.
FFTlen=3000K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3200, clm=4 (1 core, 1 worker): 1.09 ms. Throughput: 917.64 iter/sec.
FFTlen=3000K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3200, clm=2 (1 core, 1 worker): 1.09 ms. Throughput: 918.39 iter/sec.
FFTlen=3000K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3200, clm=1 (1 core, 1 worker): 1.10 ms. Throughput: 913.14 iter/sec.
FFTlen=3000K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=1600, clm=2 (1 core, 1 worker): 1.22 ms. Throughput: 823.02 iter/sec.
FFTlen=3000K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=1600, clm=1 (1 core, 1 worker): 1.13 ms. Throughput: 883.72 iter/sec.
FFTlen=3024K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=2688, clm=2 (1 core, 1 worker): 1.09 ms. Throughput: 918.52 iter/sec.
FFTlen=3024K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=2688, clm=1 (1 core, 1 worker): 1.12 ms. Throughput: 895.72 iter/sec.
FFTlen=3024K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=2304, clm=2 (1 core, 1 worker): 1.12 ms. Throughput: 893.46 iter/sec.
FFTlen=3024K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=2304, clm=1 (1 core, 1 worker): 1.13 ms. Throughput: 883.67 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=128, Pass2=24576, clm=4 (1 core, 1 worker): 1.03 ms. Throughput: 968.16 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=128, Pass2=24576, clm=2 (1 core, 1 worker): 1.09 ms. Throughput: 919.66 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=128, Pass2=24576, clm=1 (1 core, 1 worker): 1.16 ms. Throughput: 858.44 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=192, Pass2=16384, clm=4 (1 core, 1 worker): 0.98 ms. Throughput: 1016.52 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=192, Pass2=16384, clm=2 (1 core, 1 worker): 0.97 ms. Throughput: 1036.16 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=192, Pass2=16384, clm=1 (1 core, 1 worker): 1.00 ms. Throughput: 999.56 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=768, Pass2=4096, clm=4 (1 core, 1 worker): 1.06 ms. Throughput: 940.77 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=768, Pass2=4096, clm=2 (1 core, 1 worker): 1.03 ms. Throughput: 971.78 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=768, Pass2=4096, clm=1 (1 core, 1 worker): 1.02 ms. Throughput: 980.02 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=3072, clm=2 (1 core, 1 worker): 1.04 ms. Throughput: 964.86 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=3072, clm=1 (1 core, 1 worker): 1.04 ms. Throughput: 962.62 iter/sec.
FFTlen=3072K all-complex, Type=3, Arch=8, Pass1=3072, Pass2=1024, clm=1 (1 core, 1 worker): 1.19 ms. Throughput: 843.82 iter/sec.
FFTlen=3136K all-complex, Type=3, Arch=8, Pass1=128, Pass2=25088, clm=4 (1 core, 1 worker): 1.09 ms. Throughput: 914.89 iter/sec.
FFTlen=3136K all-complex, Type=3, Arch=8, Pass1=128, Pass2=25088, clm=2 (1 core, 1 worker): 1.12 ms. Throughput: 894.21 iter/sec.
FFTlen=3136K all-complex, Type=3, Arch=8, Pass1=128, Pass2=25088, clm=1 (1 core, 1 worker): 1.19 ms. Throughput: 840.65 iter/sec.
FFTlen=3136K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3584, clm=4 (1 core, 1 worker): 1.13 ms. Throughput: 886.27 iter/sec.
FFTlen=3136K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3584, clm=2 (1 core, 1 worker): 1.09 ms. Throughput: 914.10 iter/sec.
FFTlen=3136K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3584, clm=1 (1 core, 1 worker): 1.10 ms. Throughput: 912.60 iter/sec.
FFTlen=3136K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=3136, clm=2 (1 core, 1 worker): 1.15 ms. Throughput: 868.07 iter/sec.
FFTlen=3136K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=3136, clm=1 (1 core, 1 worker): 1.12 ms. Throughput: 892.16 iter/sec.
FFTlen=3200K all-complex, Type=3, Arch=8, Pass1=640, Pass2=5120, clm=4 (1 core, 1 worker): 1.07 ms. Throughput: 934.44 iter/sec.
FFTlen=3200K all-complex, Type=3, Arch=8, Pass1=640, Pass2=5120, clm=2 (1 core, 1 worker): 1.09 ms. Throughput: 915.35 iter/sec.
FFTlen=3200K all-complex, Type=3, Arch=8, Pass1=640, Pass2=5120, clm=1 (1 core, 1 worker): 1.09 ms. Throughput: 916.03 iter/sec.
FFTlen=3200K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=3200, clm=2 (1 core, 1 worker): 1.16 ms. Throughput: 864.52 iter/sec.
FFTlen=3200K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=3200, clm=1 (1 core, 1 worker): 1.15 ms. Throughput: 872.21 iter/sec.
FFTlen=3200K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=2560, clm=2 (1 core, 1 worker): 1.16 ms. Throughput: 859.35 iter/sec.
FFTlen=3200K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=2560, clm=1 (1 core, 1 worker): 1.15 ms. Throughput: 872.60 iter/sec.
FFTlen=3200K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=1600, clm=1 (1 core, 1 worker): 1.19 ms. Throughput: 838.23 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=192, Pass2=17920, clm=4 (1 core, 1 worker): 1.18 ms. Throughput: 848.14 iter/sec.
[Sun Mar 27 19:26:05 2022]
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=192, Pass2=17920, clm=2 (1 core, 1 worker): 1.19 ms. Throughput: 843.30 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=192, Pass2=17920, clm=1 (1 core, 1 worker): 1.21 ms. Throughput: 825.37 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=640, Pass2=5376, clm=4 (1 core, 1 worker): 1.15 ms. Throughput: 871.80 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=640, Pass2=5376, clm=2 (1 core, 1 worker): 1.13 ms. Throughput: 881.51 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=640, Pass2=5376, clm=1 (1 core, 1 worker): 1.16 ms. Throughput: 860.93 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=768, Pass2=4480, clm=4 (1 core, 1 worker): 1.20 ms. Throughput: 835.16 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=768, Pass2=4480, clm=2 (1 core, 1 worker): 1.19 ms. Throughput: 837.77 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=768, Pass2=4480, clm=1 (1 core, 1 worker): 1.18 ms. Throughput: 850.72 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3840, clm=4 (1 core, 1 worker): 1.16 ms. Throughput: 865.73 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3840, clm=2 (1 core, 1 worker): 1.15 ms. Throughput: 868.80 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=896, Pass2=3840, clm=1 (1 core, 1 worker): 1.18 ms. Throughput: 846.59 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3584, clm=4 (1 core, 1 worker): 1.22 ms. Throughput: 819.21 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3584, clm=2 (1 core, 1 worker): 1.17 ms. Throughput: 854.76 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3584, clm=1 (1 core, 1 worker): 1.18 ms. Throughput: 848.26 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=2688, clm=2 (1 core, 1 worker): 1.19 ms. Throughput: 837.40 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=2688, clm=1 (1 core, 1 worker): 1.23 ms. Throughput: 811.18 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=2560, clm=2 (1 core, 1 worker): 1.24 ms. Throughput: 805.15 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=2560, clm=1 (1 core, 1 worker): 1.22 ms. Throughput: 820.19 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=2240, clm=2 (1 core, 1 worker): 1.25 ms. Throughput: 799.50 iter/sec.
FFTlen=3360K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=2240, clm=1 (1 core, 1 worker): 1.22 ms. Throughput: 819.88 iter/sec.
FFTlen=3456K all-complex, Type=3, Arch=8, Pass1=192, Pass2=18432, clm=4 (1 core, 1 worker): 1.18 ms. Throughput: 844.28 iter/sec.
FFTlen=3456K all-complex, Type=3, Arch=8, Pass1=192, Pass2=18432, clm=2 (1 core, 1 worker): 1.19 ms. Throughput: 838.91 iter/sec.
FFTlen=3456K all-complex, Type=3, Arch=8, Pass1=192, Pass2=18432, clm=1 (1 core, 1 worker): 1.23 ms. Throughput: 816.09 iter/sec.
FFTlen=3456K all-complex, Type=3, Arch=8, Pass1=768, Pass2=4608, clm=4 (1 core, 1 worker): 1.18 ms. Throughput: 847.55 iter/sec.
FFTlen=3456K all-complex, Type=3, Arch=8, Pass1=768, Pass2=4608, clm=2 (1 core, 1 worker): 1.17 ms. Throughput: 855.66 iter/sec.
FFTlen=3456K all-complex, Type=3, Arch=8, Pass1=768, Pass2=4608, clm=1 (1 core, 1 worker): 1.18 ms. Throughput: 847.56 iter/sec.
FFTlen=3456K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=3072, clm=2 (1 core, 1 worker): 1.19 ms. Throughput: 841.62 iter/sec.
FFTlen=3456K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=3072, clm=1 (1 core, 1 worker): 1.18 ms. Throughput: 848.84 iter/sec.
FFTlen=3456K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=2304, clm=2 (1 core, 1 worker): 1.28 ms. Throughput: 780.70 iter/sec.
FFTlen=3456K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=2304, clm=1 (1 core, 1 worker): 1.25 ms. Throughput: 803.03 iter/sec.
FFTlen=3528K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=3136, clm=2 (1 core, 1 worker): 1.29 ms. Throughput: 776.93 iter/sec.
FFTlen=3528K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=3136, clm=1 (1 core, 1 worker): 1.29 ms. Throughput: 778.19 iter/sec.
FFTlen=3528K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=2688, clm=2 (1 core, 1 worker): 1.32 ms. Throughput: 759.09 iter/sec.
FFTlen=3528K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=2688, clm=1 (1 core, 1 worker): 1.29 ms. Throughput: 773.00 iter/sec.
FFTlen=3584K all-complex, Type=3, Arch=8, Pass1=128, Pass2=28672, clm=4 (1 core, 1 worker): 1.25 ms. Throughput: 802.15 iter/sec.
FFTlen=3584K all-complex, Type=3, Arch=8, Pass1=128, Pass2=28672, clm=2 (1 core, 1 worker): 1.29 ms. Throughput: 772.34 iter/sec.
FFTlen=3584K all-complex, Type=3, Arch=8, Pass1=128, Pass2=28672, clm=1 (1 core, 1 worker): 1.34 ms. Throughput: 745.92 iter/sec.
FFTlen=3584K all-complex, Type=3, Arch=8, Pass1=896, Pass2=4096, clm=4 (1 core, 1 worker): 1.24 ms. Throughput: 807.41 iter/sec.
FFTlen=3584K all-complex, Type=3, Arch=8, Pass1=896, Pass2=4096, clm=2 (1 core, 1 worker): 1.24 ms. Throughput: 808.15 iter/sec.
FFTlen=3584K all-complex, Type=3, Arch=8, Pass1=896, Pass2=4096, clm=1 (1 core, 1 worker): 1.25 ms. Throughput: 797.79 iter/sec.
FFTlen=3584K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=3584, clm=2 (1 core, 1 worker): 1.25 ms. Throughput: 800.30 iter/sec.
FFTlen=3584K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=3584, clm=1 (1 core, 1 worker): 1.22 ms. Throughput: 822.68 iter/sec.
FFTlen=3600K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3840, clm=4 (1 core, 1 worker): 1.29 ms. Throughput: 776.08 iter/sec.
FFTlen=3600K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3840, clm=2 (1 core, 1 worker): 1.25 ms. Throughput: 799.45 iter/sec.
FFTlen=3600K all-complex, Type=3, Arch=8, Pass1=960, Pass2=3840, clm=1 (1 core, 1 worker): 1.27 ms. Throughput: 789.32 iter/sec.
FFTlen=3600K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=3200, clm=2 (1 core, 1 worker): 1.31 ms. Throughput: 765.44 iter/sec.
FFTlen=3600K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=3200, clm=1 (1 core, 1 worker): 1.28 ms. Throughput: 780.14 iter/sec.
FFTlen=3600K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=1920, clm=2 (1 core, 1 worker): 1.40 ms. Throughput: 713.39 iter/sec.
FFTlen=3600K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=1920, clm=1 (1 core, 1 worker): 1.35 ms. Throughput: 741.33 iter/sec.
FFTlen=3600K all-complex, Type=3, Arch=8, Pass1=2304, Pass2=1600, clm=1 (1 core, 1 worker): 1.39 ms. Throughput: 718.64 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=128, Pass2=30720, clm=4 (1 core, 1 worker): 1.35 ms. Throughput: 739.52 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=128, Pass2=30720, clm=2 (1 core, 1 worker): 1.36 ms. Throughput: 737.32 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=128, Pass2=30720, clm=1 (1 core, 1 worker): 1.47 ms. Throughput: 682.10 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=192, Pass2=20480, clm=4 (1 core, 1 worker): 1.35 ms. Throughput: 741.79 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=192, Pass2=20480, clm=2 (1 core, 1 worker): 1.36 ms. Throughput: 735.06 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=192, Pass2=20480, clm=1 (1 core, 1 worker): 1.40 ms. Throughput: 715.40 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=640, Pass2=6144, clm=4 (1 core, 1 worker): 1.33 ms. Throughput: 749.36 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=640, Pass2=6144, clm=2 (1 core, 1 worker): 1.27 ms. Throughput: 784.78 iter/sec.
[Sun Mar 27 19:31:06 2022]
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=640, Pass2=6144, clm=1 (1 core, 1 worker): 1.32 ms. Throughput: 757.66 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=768, Pass2=5120, clm=4 (1 core, 1 worker): 1.35 ms. Throughput: 742.37 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=768, Pass2=5120, clm=2 (1 core, 1 worker): 1.33 ms. Throughput: 753.72 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=768, Pass2=5120, clm=1 (1 core, 1 worker): 1.30 ms. Throughput: 767.07 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=960, Pass2=4096, clm=4 (1 core, 1 worker): 1.38 ms. Throughput: 722.89 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=960, Pass2=4096, clm=2 (1 core, 1 worker): 1.34 ms. Throughput: 746.06 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=960, Pass2=4096, clm=1 (1 core, 1 worker): 1.35 ms. Throughput: 738.36 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=3840, clm=2 (1 core, 1 worker): 1.33 ms. Throughput: 751.59 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=3840, clm=1 (1 core, 1 worker): 1.30 ms. Throughput: 766.46 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=3072, clm=2 (1 core, 1 worker): 1.32 ms. Throughput: 755.91 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=3072, clm=1 (1 core, 1 worker): 1.36 ms. Throughput: 735.65 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=2560, clm=2 (1 core, 1 worker): 1.42 ms. Throughput: 704.79 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=2560, clm=1 (1 core, 1 worker): 1.39 ms. Throughput: 721.32 iter/sec.
FFTlen=3840K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=1920, clm=1 (1 core, 1 worker): 1.39 ms. Throughput: 718.03 iter/sec.
FFTlen=3920K all-complex, Type=3, Arch=8, Pass1=896, Pass2=4480, clm=4 (1 core, 1 worker): 1.44 ms. Throughput: 692.31 iter/sec.
FFTlen=3920K all-complex, Type=3, Arch=8, Pass1=896, Pass2=4480, clm=2 (1 core, 1 worker): 1.45 ms. Throughput: 689.97 iter/sec.
FFTlen=3920K all-complex, Type=3, Arch=8, Pass1=896, Pass2=4480, clm=1 (1 core, 1 worker): 1.46 ms. Throughput: 683.77 iter/sec.
FFTlen=3920K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=3136, clm=2 (1 core, 1 worker): 1.44 ms. Throughput: 692.57 iter/sec.
FFTlen=3920K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=3136, clm=1 (1 core, 1 worker): 1.46 ms. Throughput: 686.07 iter/sec.
FFTlen=4000K all-complex, Type=3, Arch=8, Pass1=640, Pass2=6400, clm=4 (1 core, 1 worker): 1.40 ms. Throughput: 716.06 iter/sec.
FFTlen=4000K all-complex, Type=3, Arch=8, Pass1=640, Pass2=6400, clm=2 (1 core, 1 worker): 1.39 ms. Throughput: 720.40 iter/sec.
FFTlen=4000K all-complex, Type=3, Arch=8, Pass1=640, Pass2=6400, clm=1 (1 core, 1 worker): 1.40 ms. Throughput: 716.77 iter/sec.
FFTlen=4000K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=3200, clm=2 (1 core, 1 worker): 1.45 ms. Throughput: 691.63 iter/sec.
FFTlen=4000K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=3200, clm=1 (1 core, 1 worker): 1.49 ms. Throughput: 670.92 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=192, Pass2=21504, clm=4 (1 core, 1 worker): 1.44 ms. Throughput: 696.09 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=192, Pass2=21504, clm=2 (1 core, 1 worker): 1.45 ms. Throughput: 688.63 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=192, Pass2=21504, clm=1 (1 core, 1 worker): 1.48 ms. Throughput: 677.13 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=768, Pass2=5376, clm=4 (1 core, 1 worker): 1.42 ms. Throughput: 705.45 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=768, Pass2=5376, clm=2 (1 core, 1 worker): 1.42 ms. Throughput: 705.94 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=768, Pass2=5376, clm=1 (1 core, 1 worker): 1.41 ms. Throughput: 707.64 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=896, Pass2=4608, clm=4 (1 core, 1 worker): 1.43 ms. Throughput: 701.30 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=896, Pass2=4608, clm=2 (1 core, 1 worker): 1.43 ms. Throughput: 699.97 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=896, Pass2=4608, clm=1 (1 core, 1 worker): 1.42 ms. Throughput: 704.28 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=3584, clm=2 (1 core, 1 worker): 1.42 ms. Throughput: 702.09 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=3584, clm=1 (1 core, 1 worker): 1.42 ms. Throughput: 705.31 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=3072, clm=2 (1 core, 1 worker): 1.43 ms. Throughput: 700.91 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=3072, clm=1 (1 core, 1 worker): 1.45 ms. Throughput: 689.72 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=2688, clm=2 (1 core, 1 worker): 1.53 ms. Throughput: 652.45 iter/sec.
FFTlen=4032K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=2688, clm=1 (1 core, 1 worker): 1.50 ms. Throughput: 667.34 iter/sec.
FFTlen=4096K all-complex, Type=3, Arch=8, Pass1=128, Pass2=32768, clm=4 (1 core, 1 worker): 1.46 ms. Throughput: 686.80 iter/sec.
FFTlen=4096K all-complex, Type=3, Arch=8, Pass1=128, Pass2=32768, clm=2 (1 core, 1 worker): 1.49 ms. Throughput: 670.92 iter/sec.
FFTlen=4096K all-complex, Type=3, Arch=8, Pass1=128, Pass2=32768, clm=1 (1 core, 1 worker): 1.56 ms. Throughput: 640.98 iter/sec.
FFTlen=4096K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=4096, clm=2 (1 core, 1 worker): 1.39 ms. Throughput: 718.48 iter/sec.
FFTlen=4096K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=4096, clm=1 (1 core, 1 worker): 1.38 ms. Throughput: 722.90 iter/sec.
FFTlen=4116K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=3136, clm=2 (1 core, 1 worker): 1.60 ms. Throughput: 623.26 iter/sec.
FFTlen=4116K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=3136, clm=1 (1 core, 1 worker): 1.59 ms. Throughput: 628.46 iter/sec.
FFTlen=4200K all-complex, Type=3, Arch=8, Pass1=960, Pass2=4480, clm=4 (1 core, 1 worker): 1.61 ms. Throughput: 622.67 iter/sec.
FFTlen=4200K all-complex, Type=3, Arch=8, Pass1=960, Pass2=4480, clm=2 (1 core, 1 worker): 1.55 ms. Throughput: 646.72 iter/sec.
FFTlen=4200K all-complex, Type=3, Arch=8, Pass1=960, Pass2=4480, clm=1 (1 core, 1 worker): 1.55 ms. Throughput: 644.29 iter/sec.
FFTlen=4200K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=3200, clm=2 (1 core, 1 worker): 1.57 ms. Throughput: 636.15 iter/sec.
FFTlen=4200K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=3200, clm=1 (1 core, 1 worker): 1.59 ms. Throughput: 630.80 iter/sec.
FFTlen=4200K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=2240, clm=2 (1 core, 1 worker): 1.68 ms. Throughput: 595.58 iter/sec.
FFTlen=4200K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=2240, clm=1 (1 core, 1 worker): 1.61 ms. Throughput: 619.52 iter/sec.
FFTlen=4320K all-complex, Type=3, Arch=8, Pass1=960, Pass2=4608, clm=4 (1 core, 1 worker): 1.59 ms. Throughput: 627.00 iter/sec.
FFTlen=4320K all-complex, Type=3, Arch=8, Pass1=960, Pass2=4608, clm=2 (1 core, 1 worker): 1.53 ms. Throughput: 653.04 iter/sec.
FFTlen=4320K all-complex, Type=3, Arch=8, Pass1=960, Pass2=4608, clm=1 (1 core, 1 worker): 1.53 ms. Throughput: 651.97 iter/sec.
FFTlen=4320K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=3840, clm=2 (1 core, 1 worker): 1.53 ms. Throughput: 652.06 iter/sec.
FFTlen=4320K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=3840, clm=1 (1 core, 1 worker): 1.51 ms. Throughput: 662.05 iter/sec.
[Sun Mar 27 19:36:07 2022]
FFTlen=4320K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=2304, clm=2 (1 core, 1 worker): 1.69 ms. Throughput: 591.65 iter/sec.
FFTlen=4320K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=2304, clm=1 (1 core, 1 worker): 1.61 ms. Throughput: 623.04 iter/sec.
FFTlen=4320K all-complex, Type=3, Arch=8, Pass1=2304, Pass2=1920, clm=1 (1 core, 1 worker): 1.64 ms. Throughput: 608.34 iter/sec.
FFTlen=4480K all-complex, Type=3, Arch=8, Pass1=640, Pass2=7168, clm=4 (1 core, 1 worker): 1.56 ms. Throughput: 641.42 iter/sec.
FFTlen=4480K all-complex, Type=3, Arch=8, Pass1=640, Pass2=7168, clm=2 (1 core, 1 worker): 1.54 ms. Throughput: 650.74 iter/sec.
FFTlen=4480K all-complex, Type=3, Arch=8, Pass1=640, Pass2=7168, clm=1 (1 core, 1 worker): 1.53 ms. Throughput: 651.92 iter/sec.
FFTlen=4480K all-complex, Type=3, Arch=8, Pass1=896, Pass2=5120, clm=4 (1 core, 1 worker): 1.64 ms. Throughput: 611.59 iter/sec.
FFTlen=4480K all-complex, Type=3, Arch=8, Pass1=896, Pass2=5120, clm=2 (1 core, 1 worker): 1.57 ms. Throughput: 635.23 iter/sec.
FFTlen=4480K all-complex, Type=3, Arch=8, Pass1=896, Pass2=5120, clm=1 (1 core, 1 worker): 1.65 ms. Throughput: 607.70 iter/sec.
FFTlen=4480K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=4480, clm=2 (1 core, 1 worker): 1.60 ms. Throughput: 623.07 iter/sec.
FFTlen=4480K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=4480, clm=1 (1 core, 1 worker): 1.66 ms. Throughput: 604.17 iter/sec.
FFTlen=4480K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=3584, clm=2 (1 core, 1 worker): 1.63 ms. Throughput: 613.34 iter/sec.
FFTlen=4480K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=3584, clm=1 (1 core, 1 worker): 1.66 ms. Throughput: 602.35 iter/sec.
FFTlen=4480K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=2240, clm=1 (1 core, 1 worker): 1.64 ms. Throughput: 610.04 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=192, Pass2=24576, clm=4 (1 core, 1 worker): 1.68 ms. Throughput: 593.77 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=192, Pass2=24576, clm=2 (1 core, 1 worker): 1.68 ms. Throughput: 596.64 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=192, Pass2=24576, clm=1 (1 core, 1 worker): 1.75 ms. Throughput: 570.28 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=768, Pass2=6144, clm=4 (1 core, 1 worker): 1.64 ms. Throughput: 608.17 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=768, Pass2=6144, clm=2 (1 core, 1 worker): 1.59 ms. Throughput: 629.71 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=768, Pass2=6144, clm=1 (1 core, 1 worker): 1.61 ms. Throughput: 621.90 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=4608, clm=2 (1 core, 1 worker): 1.59 ms. Throughput: 627.67 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=4608, clm=1 (1 core, 1 worker): 1.63 ms. Throughput: 614.96 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=4096, clm=2 (1 core, 1 worker): 1.62 ms. Throughput: 615.75 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=4096, clm=1 (1 core, 1 worker): 1.63 ms. Throughput: 614.93 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=3072, clm=2 (1 core, 1 worker): 1.66 ms. Throughput: 603.38 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=3072, clm=1 (1 core, 1 worker): 1.63 ms. Throughput: 612.19 iter/sec.
FFTlen=4608K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=2304, clm=1 (1 core, 1 worker): 1.69 ms. Throughput: 590.89 iter/sec.
FFTlen=4704K all-complex, Type=3, Arch=8, Pass1=192, Pass2=25088, clm=4 (1 core, 1 worker): 1.76 ms. Throughput: 567.53 iter/sec.
FFTlen=4704K all-complex, Type=3, Arch=8, Pass1=192, Pass2=25088, clm=2 (1 core, 1 worker): 1.78 ms. Throughput: 561.34 iter/sec.
FFTlen=4704K all-complex, Type=3, Arch=8, Pass1=192, Pass2=25088, clm=1 (1 core, 1 worker): 1.76 ms. Throughput: 566.65 iter/sec.
FFTlen=4704K all-complex, Type=3, Arch=8, Pass1=896, Pass2=5376, clm=4 (1 core, 1 worker): 1.74 ms. Throughput: 574.28 iter/sec.
FFTlen=4704K all-complex, Type=3, Arch=8, Pass1=896, Pass2=5376, clm=2 (1 core, 1 worker): 1.74 ms. Throughput: 573.58 iter/sec.
FFTlen=4704K all-complex, Type=3, Arch=8, Pass1=896, Pass2=5376, clm=1 (1 core, 1 worker): 1.74 ms. Throughput: 573.94 iter/sec.
FFTlen=4704K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=3584, clm=2 (1 core, 1 worker): 1.78 ms. Throughput: 560.59 iter/sec.
FFTlen=4704K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=3584, clm=1 (1 core, 1 worker): 1.73 ms. Throughput: 579.57 iter/sec.
FFTlen=4704K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=3136, clm=2 (1 core, 1 worker): 1.81 ms. Throughput: 551.31 iter/sec.
FFTlen=4704K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=3136, clm=1 (1 core, 1 worker): 1.83 ms. Throughput: 546.85 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=640, Pass2=7680, clm=4 (1 core, 1 worker): 1.69 ms. Throughput: 590.67 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=640, Pass2=7680, clm=2 (1 core, 1 worker): 1.66 ms. Throughput: 602.16 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=640, Pass2=7680, clm=1 (1 core, 1 worker): 1.71 ms. Throughput: 586.20 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=768, Pass2=6400, clm=4 (1 core, 1 worker): 1.77 ms. Throughput: 566.36 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=768, Pass2=6400, clm=2 (1 core, 1 worker): 1.72 ms. Throughput: 581.13 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=768, Pass2=6400, clm=1 (1 core, 1 worker): 1.69 ms. Throughput: 590.64 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=960, Pass2=5120, clm=4 (1 core, 1 worker): 1.80 ms. Throughput: 554.67 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=960, Pass2=5120, clm=2 (1 core, 1 worker): 1.75 ms. Throughput: 572.72 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=960, Pass2=5120, clm=1 (1 core, 1 worker): 1.73 ms. Throughput: 579.23 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=3840, clm=2 (1 core, 1 worker): 1.74 ms. Throughput: 574.98 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=3840, clm=1 (1 core, 1 worker): 1.77 ms. Throughput: 565.66 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=3200, clm=2 (1 core, 1 worker): 1.84 ms. Throughput: 544.18 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=3200, clm=1 (1 core, 1 worker): 1.79 ms. Throughput: 559.18 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=2560, clm=2 (1 core, 1 worker): 1.86 ms. Throughput: 537.00 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=2560, clm=1 (1 core, 1 worker): 1.80 ms. Throughput: 556.08 iter/sec.
FFTlen=4800K all-complex, Type=3, Arch=8, Pass1=3072, Pass2=1600, clm=1 (1 core, 1 worker): 1.97 ms. Throughput: 508.33 iter/sec.
[/CODE]

Magellan3s 2022-03-28 01:02

[QUOTE=Magellan3s;602744]Part 1[/QUOTE]
Part 2


[CODE]FFTlen=5040K all-complex, Type=3, Arch=8, Pass1=960, Pass2=5376, clm=4 (1 core, 1 worker): 1.89 ms. Throughput: 529.79 iter/sec.
FFTlen=5040K all-complex, Type=3, Arch=8, Pass1=960, Pass2=5376, clm=2 (1 core, 1 worker): 1.85 ms. Throughput: 541.56 iter/sec.
FFTlen=5040K all-complex, Type=3, Arch=8, Pass1=960, Pass2=5376, clm=1 (1 core, 1 worker): 1.86 ms. Throughput: 536.58 iter/sec.
FFTlen=5040K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=4480, clm=2 (1 core, 1 worker): 1.86 ms. Throughput: 538.13 iter/sec.
FFTlen=5040K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=4480, clm=1 (1 core, 1 worker): 1.84 ms. Throughput: 542.45 iter/sec.
[Sun Mar 27 19:41:10 2022]
FFTlen=5040K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=3840, clm=2 (1 core, 1 worker): 1.87 ms. Throughput: 534.60 iter/sec.
FFTlen=5040K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=3840, clm=1 (1 core, 1 worker): 1.89 ms. Throughput: 527.81 iter/sec.
FFTlen=5040K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=2688, clm=2 (1 core, 1 worker): 2.01 ms. Throughput: 496.35 iter/sec.
FFTlen=5040K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=2688, clm=1 (1 core, 1 worker): 1.90 ms. Throughput: 526.32 iter/sec.
FFTlen=5040K all-complex, Type=3, Arch=8, Pass1=2304, Pass2=2240, clm=1 (1 core, 1 worker): 1.98 ms. Throughput: 504.91 iter/sec.
FFTlen=5120K all-complex, Type=3, Arch=8, Pass1=640, Pass2=8192, clm=4 (1 core, 1 worker): 1.82 ms. Throughput: 550.40 iter/sec.
FFTlen=5120K all-complex, Type=3, Arch=8, Pass1=640, Pass2=8192, clm=2 (1 core, 1 worker): 1.80 ms. Throughput: 556.89 iter/sec.
FFTlen=5120K all-complex, Type=3, Arch=8, Pass1=640, Pass2=8192, clm=1 (1 core, 1 worker): 1.87 ms. Throughput: 534.73 iter/sec.
FFTlen=5120K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=5120, clm=2 (1 core, 1 worker): 1.84 ms. Throughput: 542.09 iter/sec.
FFTlen=5120K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=5120, clm=1 (1 core, 1 worker): 1.84 ms. Throughput: 544.20 iter/sec.
FFTlen=5120K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=4096, clm=2 (1 core, 1 worker): 1.89 ms. Throughput: 529.99 iter/sec.
FFTlen=5120K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=4096, clm=1 (1 core, 1 worker): 1.91 ms. Throughput: 523.88 iter/sec.
FFTlen=5120K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=2560, clm=1 (1 core, 1 worker): 1.91 ms. Throughput: 524.89 iter/sec.
FFTlen=5184K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=4608, clm=2 (1 core, 1 worker): 1.89 ms. Throughput: 530.38 iter/sec.
FFTlen=5184K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=4608, clm=1 (1 core, 1 worker): 1.88 ms. Throughput: 532.95 iter/sec.
FFTlen=5184K all-complex, Type=3, Arch=8, Pass1=2304, Pass2=2304, clm=1 (1 core, 1 worker): 1.93 ms. Throughput: 516.83 iter/sec.
FFTlen=5376K all-complex, Type=3, Arch=8, Pass1=192, Pass2=28672, clm=4 (1 core, 1 worker): 2.07 ms. Throughput: 483.05 iter/sec.
FFTlen=5376K all-complex, Type=3, Arch=8, Pass1=192, Pass2=28672, clm=2 (1 core, 1 worker): 2.06 ms. Throughput: 485.93 iter/sec.
FFTlen=5376K all-complex, Type=3, Arch=8, Pass1=192, Pass2=28672, clm=1 (1 core, 1 worker): 2.06 ms. Throughput: 484.57 iter/sec.
FFTlen=5376K all-complex, Type=3, Arch=8, Pass1=768, Pass2=7168, clm=4 (1 core, 1 worker): 2.04 ms. Throughput: 490.75 iter/sec.
FFTlen=5376K all-complex, Type=3, Arch=8, Pass1=768, Pass2=7168, clm=2 (1 core, 1 worker): 1.98 ms. Throughput: 504.64 iter/sec.
FFTlen=5376K all-complex, Type=3, Arch=8, Pass1=768, Pass2=7168, clm=1 (1 core, 1 worker): 1.95 ms. Throughput: 513.24 iter/sec.
FFTlen=5376K all-complex, Type=3, Arch=8, Pass1=896, Pass2=6144, clm=4 (1 core, 1 worker): 2.04 ms. Throughput: 489.59 iter/sec.
FFTlen=5376K all-complex, Type=3, Arch=8, Pass1=896, Pass2=6144, clm=2 (1 core, 1 worker): 1.98 ms. Throughput: 503.97 iter/sec.
FFTlen=5376K all-complex, Type=3, Arch=8, Pass1=896, Pass2=6144, clm=1 (1 core, 1 worker): 1.97 ms. Throughput: 507.93 iter/sec.
FFTlen=5376K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=5376, clm=2 (1 core, 1 worker): 1.97 ms. Throughput: 507.32 iter/sec.
FFTlen=5376K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=5376, clm=1 (1 core, 1 worker): 1.97 ms. Throughput: 506.88 iter/sec.
FFTlen=5376K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=4096, clm=2 (1 core, 1 worker): 2.00 ms. Throughput: 499.65 iter/sec.
FFTlen=5376K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=4096, clm=1 (1 core, 1 worker): 2.02 ms. Throughput: 494.80 iter/sec.
FFTlen=5376K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=3584, clm=2 (1 core, 1 worker): 2.06 ms. Throughput: 485.99 iter/sec.
FFTlen=5376K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=3584, clm=1 (1 core, 1 worker): 1.99 ms. Throughput: 501.90 iter/sec.
FFTlen=5376K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=2688, clm=1 (1 core, 1 worker): 2.07 ms. Throughput: 483.51 iter/sec.
FFTlen=5600K all-complex, Type=3, Arch=8, Pass1=896, Pass2=6400, clm=4 (1 core, 1 worker): 2.20 ms. Throughput: 454.75 iter/sec.
FFTlen=5600K all-complex, Type=3, Arch=8, Pass1=896, Pass2=6400, clm=2 (1 core, 1 worker): 2.09 ms. Throughput: 478.24 iter/sec.
FFTlen=5600K all-complex, Type=3, Arch=8, Pass1=896, Pass2=6400, clm=1 (1 core, 1 worker): 2.17 ms. Throughput: 460.98 iter/sec.
FFTlen=5600K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=4480, clm=2 (1 core, 1 worker): 2.15 ms. Throughput: 465.58 iter/sec.
FFTlen=5600K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=4480, clm=1 (1 core, 1 worker): 2.18 ms. Throughput: 459.01 iter/sec.
FFTlen=5760K all-complex, Type=3, Arch=8, Pass1=192, Pass2=30720, clm=4 (1 core, 1 worker): 2.30 ms. Throughput: 433.85 iter/sec.
FFTlen=5760K all-complex, Type=3, Arch=8, Pass1=192, Pass2=30720, clm=2 (1 core, 1 worker): 2.31 ms. Throughput: 433.11 iter/sec.
FFTlen=5760K all-complex, Type=3, Arch=8, Pass1=192, Pass2=30720, clm=1 (1 core, 1 worker): 2.30 ms. Throughput: 434.72 iter/sec.
FFTlen=5760K all-complex, Type=3, Arch=8, Pass1=640, Pass2=9216, clm=4 (1 core, 1 worker): 2.17 ms. Throughput: 459.80 iter/sec.
FFTlen=5760K all-complex, Type=3, Arch=8, Pass1=640, Pass2=9216, clm=2 (1 core, 1 worker): 2.11 ms. Throughput: 473.77 iter/sec.
FFTlen=5760K all-complex, Type=3, Arch=8, Pass1=640, Pass2=9216, clm=1 (1 core, 1 worker): 2.11 ms. Throughput: 473.98 iter/sec.
FFTlen=5760K all-complex, Type=3, Arch=8, Pass1=768, Pass2=7680, clm=4 (1 core, 1 worker): 2.22 ms. Throughput: 450.10 iter/sec.
FFTlen=5760K all-complex, Type=3, Arch=8, Pass1=768, Pass2=7680, clm=2 (1 core, 1 worker): 2.15 ms. Throughput: 465.91 iter/sec.
FFTlen=5760K all-complex, Type=3, Arch=8, Pass1=768, Pass2=7680, clm=1 (1 core, 1 worker): 2.18 ms. Throughput: 459.57 iter/sec.
FFTlen=5760K all-complex, Type=3, Arch=8, Pass1=960, Pass2=6144, clm=4 (1 core, 1 worker): 2.29 ms. Throughput: 437.01 iter/sec.
FFTlen=5760K all-complex, Type=3, Arch=8, Pass1=960, Pass2=6144, clm=2 (1 core, 1 worker): 2.16 ms. Throughput: 462.38 iter/sec.
FFTlen=5760K all-complex, Type=3, Arch=8, Pass1=960, Pass2=6144, clm=1 (1 core, 1 worker): 2.13 ms. Throughput: 469.89 iter/sec.
FFTlen=5760K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=5120, clm=2 (1 core, 1 worker): 2.12 ms. Throughput: 470.76 iter/sec.
FFTlen=5760K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=5120, clm=1 (1 core, 1 worker): 2.19 ms. Throughput: 456.97 iter/sec.
FFTlen=5760K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=4608, clm=2 (1 core, 1 worker): 2.21 ms. Throughput: 451.52 iter/sec.
FFTlen=5760K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=4608, clm=1 (1 core, 1 worker): 2.21 ms. Throughput: 451.88 iter/sec.
FFTlen=5760K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=3840, clm=2 (1 core, 1 worker): 2.24 ms. Throughput: 446.91 iter/sec.
FFTlen=5760K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=3840, clm=1 (1 core, 1 worker): 2.22 ms. Throughput: 450.78 iter/sec.
FFTlen=5760K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=3072, clm=2 (1 core, 1 worker): 2.34 ms. Throughput: 426.45 iter/sec.
FFTlen=5760K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=3072, clm=1 (1 core, 1 worker): 2.17 ms. Throughput: 460.55 iter/sec.
[Sun Mar 27 19:46:11 2022]
FFTlen=5760K all-complex, Type=3, Arch=8, Pass1=2304, Pass2=2560, clm=1 (1 core, 1 worker): 2.26 ms. Throughput: 442.27 iter/sec.
FFTlen=5760K all-complex, Type=3, Arch=8, Pass1=3072, Pass2=1920, clm=1 (1 core, 1 worker): 2.41 ms. Throughput: 414.09 iter/sec.
FFTlen=5880K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=4480, clm=2 (1 core, 1 worker): 2.33 ms. Throughput: 429.59 iter/sec.
FFTlen=5880K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=4480, clm=1 (1 core, 1 worker): 2.29 ms. Throughput: 435.88 iter/sec.
FFTlen=5880K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=3136, clm=2 (1 core, 1 worker): 2.44 ms. Throughput: 410.12 iter/sec.
FFTlen=5880K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=3136, clm=1 (1 core, 1 worker): 2.34 ms. Throughput: 428.00 iter/sec.
FFTlen=6000K all-complex, Type=3, Arch=8, Pass1=960, Pass2=6400, clm=4 (1 core, 1 worker): 2.42 ms. Throughput: 413.63 iter/sec.
FFTlen=6000K all-complex, Type=3, Arch=8, Pass1=960, Pass2=6400, clm=2 (1 core, 1 worker): 2.31 ms. Throughput: 432.28 iter/sec.
FFTlen=6000K all-complex, Type=3, Arch=8, Pass1=960, Pass2=6400, clm=1 (1 core, 1 worker): 2.30 ms. Throughput: 434.38 iter/sec.
FFTlen=6000K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=3200, clm=2 (1 core, 1 worker): 2.52 ms. Throughput: 397.37 iter/sec.
FFTlen=6000K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=3200, clm=1 (1 core, 1 worker): 2.37 ms. Throughput: 422.08 iter/sec.
FFTlen=6048K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=5376, clm=2 (1 core, 1 worker): 2.33 ms. Throughput: 429.23 iter/sec.
FFTlen=6048K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=5376, clm=1 (1 core, 1 worker): 2.33 ms. Throughput: 429.75 iter/sec.
FFTlen=6048K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=4608, clm=2 (1 core, 1 worker): 2.35 ms. Throughput: 426.39 iter/sec.
FFTlen=6048K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=4608, clm=1 (1 core, 1 worker): 2.35 ms. Throughput: 425.48 iter/sec.
FFTlen=6048K all-complex, Type=3, Arch=8, Pass1=2304, Pass2=2688, clm=1 (1 core, 1 worker): 2.42 ms. Throughput: 413.56 iter/sec.
FFTlen=6144K all-complex, Type=3, Arch=8, Pass1=192, Pass2=32768, clm=4 (1 core, 1 worker): 2.62 ms. Throughput: 382.14 iter/sec.
FFTlen=6144K all-complex, Type=3, Arch=8, Pass1=192, Pass2=32768, clm=2 (1 core, 1 worker): 2.57 ms. Throughput: 389.52 iter/sec.
FFTlen=6144K all-complex, Type=3, Arch=8, Pass1=192, Pass2=32768, clm=1 (1 core, 1 worker): 2.71 ms. Throughput: 369.63 iter/sec.
FFTlen=6144K all-complex, Type=3, Arch=8, Pass1=768, Pass2=8192, clm=4 (1 core, 1 worker): 2.43 ms. Throughput: 412.08 iter/sec.
FFTlen=6144K all-complex, Type=3, Arch=8, Pass1=768, Pass2=8192, clm=2 (1 core, 1 worker): 2.36 ms. Throughput: 424.00 iter/sec.
FFTlen=6144K all-complex, Type=3, Arch=8, Pass1=768, Pass2=8192, clm=1 (1 core, 1 worker): 2.44 ms. Throughput: 410.12 iter/sec.
FFTlen=6144K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=6144, clm=2 (1 core, 1 worker): 2.36 ms. Throughput: 424.44 iter/sec.
FFTlen=6144K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=6144, clm=1 (1 core, 1 worker): 2.38 ms. Throughput: 420.56 iter/sec.
FFTlen=6144K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=4096, clm=2 (1 core, 1 worker): 2.47 ms. Throughput: 404.23 iter/sec.
FFTlen=6144K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=4096, clm=1 (1 core, 1 worker): 2.41 ms. Throughput: 414.23 iter/sec.
FFTlen=6144K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=3072, clm=1 (1 core, 1 worker): 2.37 ms. Throughput: 422.26 iter/sec.
FFTlen=6272K all-complex, Type=3, Arch=8, Pass1=896, Pass2=7168, clm=4 (1 core, 1 worker): 2.59 ms. Throughput: 385.67 iter/sec.
FFTlen=6272K all-complex, Type=3, Arch=8, Pass1=896, Pass2=7168, clm=2 (1 core, 1 worker): 2.44 ms. Throughput: 409.36 iter/sec.
FFTlen=6272K all-complex, Type=3, Arch=8, Pass1=896, Pass2=7168, clm=1 (1 core, 1 worker): 2.48 ms. Throughput: 402.80 iter/sec.
FFTlen=6272K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=3136, clm=1 (1 core, 1 worker): 2.52 ms. Throughput: 397.11 iter/sec.
FFTlen=6400K all-complex, Type=3, Arch=8, Pass1=640, Pass2=10240, clm=4 (1 core, 1 worker): 2.52 ms. Throughput: 396.72 iter/sec.
FFTlen=6400K all-complex, Type=3, Arch=8, Pass1=640, Pass2=10240, clm=2 (1 core, 1 worker): 2.45 ms. Throughput: 408.94 iter/sec.
FFTlen=6400K all-complex, Type=3, Arch=8, Pass1=640, Pass2=10240, clm=1 (1 core, 1 worker): 2.48 ms. Throughput: 403.00 iter/sec.
FFTlen=6400K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=6400, clm=2 (1 core, 1 worker): 2.50 ms. Throughput: 399.28 iter/sec.
FFTlen=6400K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=6400, clm=1 (1 core, 1 worker): 2.47 ms. Throughput: 404.10 iter/sec.
FFTlen=6400K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=5120, clm=2 (1 core, 1 worker): 2.54 ms. Throughput: 393.10 iter/sec.
FFTlen=6400K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=5120, clm=1 (1 core, 1 worker): 2.53 ms. Throughput: 395.37 iter/sec.
FFTlen=6400K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=3200, clm=1 (1 core, 1 worker): 2.59 ms. Throughput: 386.84 iter/sec.
FFTlen=6720K all-complex, Type=3, Arch=8, Pass1=896, Pass2=7680, clm=4 (1 core, 1 worker): 2.83 ms. Throughput: 353.93 iter/sec.
FFTlen=6720K all-complex, Type=3, Arch=8, Pass1=896, Pass2=7680, clm=2 (1 core, 1 worker): 2.70 ms. Throughput: 371.02 iter/sec.
FFTlen=6720K all-complex, Type=3, Arch=8, Pass1=896, Pass2=7680, clm=1 (1 core, 1 worker): 2.84 ms. Throughput: 352.20 iter/sec.
FFTlen=6720K all-complex, Type=3, Arch=8, Pass1=960, Pass2=7168, clm=4 (1 core, 1 worker): 2.85 ms. Throughput: 350.29 iter/sec.
FFTlen=6720K all-complex, Type=3, Arch=8, Pass1=960, Pass2=7168, clm=2 (1 core, 1 worker): 2.68 ms. Throughput: 372.84 iter/sec.
FFTlen=6720K all-complex, Type=3, Arch=8, Pass1=960, Pass2=7168, clm=1 (1 core, 1 worker): 2.72 ms. Throughput: 368.32 iter/sec.
FFTlen=6720K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=5376, clm=2 (1 core, 1 worker): 2.71 ms. Throughput: 368.66 iter/sec.
FFTlen=6720K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=5376, clm=1 (1 core, 1 worker): 2.73 ms. Throughput: 366.75 iter/sec.
FFTlen=6720K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=5120, clm=2 (1 core, 1 worker): 2.72 ms. Throughput: 368.01 iter/sec.
FFTlen=6720K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=5120, clm=1 (1 core, 1 worker): 2.73 ms. Throughput: 366.34 iter/sec.
FFTlen=6720K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=4480, clm=2 (1 core, 1 worker): 2.85 ms. Throughput: 351.08 iter/sec.
FFTlen=6720K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=4480, clm=1 (1 core, 1 worker): 2.75 ms. Throughput: 363.16 iter/sec.
FFTlen=6720K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=3584, clm=2 (1 core, 1 worker): 2.89 ms. Throughput: 345.48 iter/sec.
FFTlen=6720K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=3584, clm=1 (1 core, 1 worker): 2.71 ms. Throughput: 368.82 iter/sec.
FFTlen=6720K all-complex, Type=3, Arch=8, Pass1=3072, Pass2=2240, clm=1 (1 core, 1 worker): 2.95 ms. Throughput: 338.73 iter/sec.
FFTlen=6912K all-complex, Type=3, Arch=8, Pass1=768, Pass2=9216, clm=4 (1 core, 1 worker): 2.89 ms. Throughput: 345.82 iter/sec.
FFTlen=6912K all-complex, Type=3, Arch=8, Pass1=768, Pass2=9216, clm=2 (1 core, 1 worker): 2.80 ms. Throughput: 357.64 iter/sec.
FFTlen=6912K all-complex, Type=3, Arch=8, Pass1=768, Pass2=9216, clm=1 (1 core, 1 worker): 2.79 ms. Throughput: 358.96 iter/sec.
[Sun Mar 27 19:51:13 2022]
FFTlen=6912K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=6144, clm=2 (1 core, 1 worker): 2.77 ms. Throughput: 361.06 iter/sec.
FFTlen=6912K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=6144, clm=1 (1 core, 1 worker): 2.74 ms. Throughput: 365.39 iter/sec.
FFTlen=6912K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=4608, clm=2 (1 core, 1 worker): 2.89 ms. Throughput: 345.69 iter/sec.
FFTlen=6912K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=4608, clm=1 (1 core, 1 worker): 2.80 ms. Throughput: 357.35 iter/sec.
FFTlen=6912K all-complex, Type=3, Arch=8, Pass1=2304, Pass2=3072, clm=1 (1 core, 1 worker): 2.80 ms. Throughput: 357.21 iter/sec.
FFTlen=6912K all-complex, Type=3, Arch=8, Pass1=3072, Pass2=2304, clm=1 (1 core, 1 worker): 3.04 ms. Throughput: 329.47 iter/sec.
FFTlen=7056K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=5376, clm=2 (1 core, 1 worker): 2.92 ms. Throughput: 342.87 iter/sec.
FFTlen=7056K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=5376, clm=1 (1 core, 1 worker): 2.88 ms. Throughput: 346.68 iter/sec.
FFTlen=7056K all-complex, Type=3, Arch=8, Pass1=2304, Pass2=3136, clm=1 (1 core, 1 worker): 2.98 ms. Throughput: 335.84 iter/sec.
FFTlen=7168K all-complex, Type=3, Arch=8, Pass1=896, Pass2=8192, clm=4 (1 core, 1 worker): 3.07 ms. Throughput: 326.02 iter/sec.
FFTlen=7168K all-complex, Type=3, Arch=8, Pass1=896, Pass2=8192, clm=2 (1 core, 1 worker): 2.89 ms. Throughput: 346.18 iter/sec.
FFTlen=7168K all-complex, Type=3, Arch=8, Pass1=896, Pass2=8192, clm=1 (1 core, 1 worker): 3.05 ms. Throughput: 328.13 iter/sec.
FFTlen=7168K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=7168, clm=2 (1 core, 1 worker): 2.94 ms. Throughput: 340.41 iter/sec.
FFTlen=7168K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=7168, clm=1 (1 core, 1 worker): 2.90 ms. Throughput: 344.44 iter/sec.
FFTlen=7168K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=3584, clm=1 (1 core, 1 worker): 2.95 ms. Throughput: 339.18 iter/sec.
FFTlen=7200K all-complex, Type=3, Arch=8, Pass1=960, Pass2=7680, clm=4 (1 core, 1 worker): 3.14 ms. Throughput: 318.33 iter/sec.
FFTlen=7200K all-complex, Type=3, Arch=8, Pass1=960, Pass2=7680, clm=2 (1 core, 1 worker): 2.92 ms. Throughput: 341.92 iter/sec.
FFTlen=7200K all-complex, Type=3, Arch=8, Pass1=960, Pass2=7680, clm=1 (1 core, 1 worker): 3.08 ms. Throughput: 324.55 iter/sec.
FFTlen=7200K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=6400, clm=2 (1 core, 1 worker): 2.91 ms. Throughput: 343.11 iter/sec.
FFTlen=7200K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=6400, clm=1 (1 core, 1 worker): 2.94 ms. Throughput: 340.22 iter/sec.
FFTlen=7200K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=3840, clm=2 (1 core, 1 worker): 3.15 ms. Throughput: 317.09 iter/sec.
FFTlen=7200K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=3840, clm=1 (1 core, 1 worker): 2.98 ms. Throughput: 335.99 iter/sec.
FFTlen=7200K all-complex, Type=3, Arch=8, Pass1=2304, Pass2=3200, clm=1 (1 core, 1 worker): 3.00 ms. Throughput: 332.99 iter/sec.
FFTlen=7680K all-complex, Type=3, Arch=8, Pass1=640, Pass2=12288, clm=4 (1 core, 1 worker): 3.20 ms. Throughput: 312.35 iter/sec.
FFTlen=7680K all-complex, Type=3, Arch=8, Pass1=640, Pass2=12288, clm=2 (1 core, 1 worker): 3.14 ms. Throughput: 318.30 iter/sec.
FFTlen=7680K all-complex, Type=3, Arch=8, Pass1=640, Pass2=12288, clm=1 (1 core, 1 worker): 3.16 ms. Throughput: 316.58 iter/sec.
FFTlen=7680K all-complex, Type=3, Arch=8, Pass1=768, Pass2=10240, clm=4 (1 core, 1 worker): 3.29 ms. Throughput: 303.76 iter/sec.
FFTlen=7680K all-complex, Type=3, Arch=8, Pass1=768, Pass2=10240, clm=2 (1 core, 1 worker): 3.16 ms. Throughput: 316.10 iter/sec.
FFTlen=7680K all-complex, Type=3, Arch=8, Pass1=768, Pass2=10240, clm=1 (1 core, 1 worker): 3.15 ms. Throughput: 317.23 iter/sec.
FFTlen=7680K all-complex, Type=3, Arch=8, Pass1=960, Pass2=8192, clm=4 (1 core, 1 worker): 3.38 ms. Throughput: 295.56 iter/sec.
FFTlen=7680K all-complex, Type=3, Arch=8, Pass1=960, Pass2=8192, clm=2 (1 core, 1 worker): 3.16 ms. Throughput: 315.99 iter/sec.
FFTlen=7680K all-complex, Type=3, Arch=8, Pass1=960, Pass2=8192, clm=1 (1 core, 1 worker): 3.34 ms. Throughput: 299.50 iter/sec.
FFTlen=7680K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=7680, clm=2 (1 core, 1 worker): 3.17 ms. Throughput: 315.57 iter/sec.
FFTlen=7680K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=7680, clm=1 (1 core, 1 worker): 3.32 ms. Throughput: 300.85 iter/sec.
FFTlen=7680K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=6144, clm=2 (1 core, 1 worker): 3.24 ms. Throughput: 308.44 iter/sec.
FFTlen=7680K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=6144, clm=1 (1 core, 1 worker): 3.21 ms. Throughput: 311.30 iter/sec.
FFTlen=7680K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=5120, clm=2 (1 core, 1 worker): 3.30 ms. Throughput: 302.80 iter/sec.
FFTlen=7680K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=5120, clm=1 (1 core, 1 worker): 3.23 ms. Throughput: 309.22 iter/sec.
FFTlen=7680K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=4096, clm=2 (1 core, 1 worker): 3.41 ms. Throughput: 293.54 iter/sec.
FFTlen=7680K all-complex, Type=3, Arch=8, Pass1=1920, Pass2=4096, clm=1 (1 core, 1 worker): 3.22 ms. Throughput: 310.25 iter/sec.
FFTlen=7680K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=3840, clm=1 (1 core, 1 worker): 3.24 ms. Throughput: 308.33 iter/sec.
FFTlen=7680K all-complex, Type=3, Arch=8, Pass1=3072, Pass2=2560, clm=1 (1 core, 1 worker): 3.42 ms. Throughput: 292.15 iter/sec.
FFTlen=8000K all-complex, Type=3, Arch=8, Pass1=640, Pass2=12800, clm=4 (1 core, 1 worker): 3.48 ms. Throughput: 286.99 iter/sec.
FFTlen=8000K all-complex, Type=3, Arch=8, Pass1=640, Pass2=12800, clm=2 (1 core, 1 worker): 3.43 ms. Throughput: 291.71 iter/sec.
FFTlen=8000K all-complex, Type=3, Arch=8, Pass1=640, Pass2=12800, clm=1 (1 core, 1 worker): 3.44 ms. Throughput: 290.42 iter/sec.
FFTlen=8000K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=6400, clm=2 (1 core, 1 worker): 3.38 ms. Throughput: 295.91 iter/sec.
FFTlen=8000K all-complex, Type=3, Arch=8, Pass1=1280, Pass2=6400, clm=1 (1 core, 1 worker): 3.36 ms. Throughput: 297.30 iter/sec.
FFTlen=8064K all-complex, Type=3, Arch=8, Pass1=896, Pass2=9216, clm=4 (1 core, 1 worker): 3.63 ms. Throughput: 275.67 iter/sec.
FFTlen=8064K all-complex, Type=3, Arch=8, Pass1=896, Pass2=9216, clm=2 (1 core, 1 worker): 3.43 ms. Throughput: 291.42 iter/sec.
FFTlen=8064K all-complex, Type=3, Arch=8, Pass1=896, Pass2=9216, clm=1 (1 core, 1 worker): 3.44 ms. Throughput: 290.56 iter/sec.
FFTlen=8064K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=7168, clm=2 (1 core, 1 worker): 3.37 ms. Throughput: 296.50 iter/sec.
FFTlen=8064K all-complex, Type=3, Arch=8, Pass1=1152, Pass2=7168, clm=1 (1 core, 1 worker): 3.36 ms. Throughput: 297.39 iter/sec.
FFTlen=8064K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=6144, clm=2 (1 core, 1 worker): 3.43 ms. Throughput: 291.18 iter/sec.
FFTlen=8064K all-complex, Type=3, Arch=8, Pass1=1344, Pass2=6144, clm=1 (1 core, 1 worker): 3.37 ms. Throughput: 296.44 iter/sec.
FFTlen=8064K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=5376, clm=2 (1 core, 1 worker): 3.52 ms. Throughput: 283.94 iter/sec.
FFTlen=8064K all-complex, Type=3, Arch=8, Pass1=1536, Pass2=5376, clm=1 (1 core, 1 worker): 3.45 ms. Throughput: 290.02 iter/sec.
FFTlen=8064K all-complex, Type=3, Arch=8, Pass1=2304, Pass2=3584, clm=1 (1 core, 1 worker): 3.41 ms. Throughput: 293.30 iter/sec.
[Sun Mar 27 19:56:18 2022]
FFTlen=8064K all-complex, Type=3, Arch=8, Pass1=3072, Pass2=2688, clm=1 (1 core, 1 worker): 3.62 ms. Throughput: 276.30 iter/sec.
FFTlen=8192K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=8192, clm=2 (1 core, 1 worker): 3.43 ms. Throughput: 291.90 iter/sec.
FFTlen=8192K all-complex, Type=3, Arch=8, Pass1=1024, Pass2=8192, clm=1 (1 core, 1 worker): 3.62 ms. Throughput: 276.22 iter/sec.
FFTlen=8192K all-complex, Type=3, Arch=8, Pass1=2048, Pass2=4096, clm=1 (1 core, 1 worker): 3.46 ms. Throughput: 288.83 iter/sec.[/code]


All times are UTC. The time now is 06:56.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2022, Jelsoft Enterprises Ltd.