mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > PrimeNet

Reply
 
Thread Tools
Old 2011-05-05, 16:51   #12
ewmayer
2ω=0
 
ewmayer's Avatar
 
Sep 2002
República de California

19×613 Posts
Default

I should add one caveat to my assertions above: If one spends a significant time of a transform-based squaring in the DWT-weighting and output-normalization steps, then there might indeed be a modest savings from doing a longer FFT with smaller input words which allow for 2 or more dyadic multiplies in a row.

I haven't donw precise timings in a while but I seem to recall the typical overhead for the DWT-weight/unweight and normalize-and-propagate-carries is 10-20% of the overall modmul time.

For my Mlucas code, there is a roughly 10% additional overhead in Mersenne-mod mode due to "wrapper step" needed around the dyadic-mul, to convert the outputs of the fundamentally complex-input FFT I use to ones reflective of a real-vector FFT. (I didn't want to code 2 separate FFTs for e.g. Fermat-mod and Mersenne-mod arithmetic...Fermat-mod uses a straight complex FFT if one uses the right-angle-transform weighting trick).
ewmayer is online now   Reply With Quote
Old 2011-05-05, 17:07   #13
cheesehead
 
cheesehead's Avatar
 
"Richard B. Woods"
Aug 2002
Wisconsin USA

22·3·641 Posts
Default

Quote:
Originally Posted by ewmayer View Post
Fermat-mod uses a straight complex FFT if one uses the right-angle-transform weighting trick).
Careful! GIMPS may lose a lot of Republicans if that comment gets out!
cheesehead is offline   Reply With Quote
Reply



All times are UTC. The time now is 22:27.


Fri Aug 6 22:27:10 UTC 2021 up 14 days, 16:56, 1 user, load averages: 2.89, 3.24, 3.19

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.