View Single Post
Old 2017-02-09, 11:31   #15
Just call me Henry
henryzz's Avatar
Sep 2007
Cambridge (GMT/BST)

23·739 Posts

Originally Posted by paulunderwood View Post
Thanks for the tips, nordi

Incidentally, I added gwset_num_threads ( &gwdata, 4 ); to every line with gwinit2(&gwdata, sizeof(gwhandle), (char *) GWNUM_VERSION); and got a multi-threaded PFGW working at 300% on my Haswell 4770k. Good for speedy proofs, but not so good for overall number-crunching throughput
It is more efficient the larger the FFT. There would be people using it if this feature was added to the main source. Please do rouge.
A larger fft can fit in the L3 cache with 1x4threads than 4x1threads
henryzz is offline   Reply With Quote