![]() |
Round off error problems
1 Attachment(s)
Hello guys! Hopefully everyone is enjoying or enduring the weather. It's bad weather here at Iowa.
I was running cudalucas 2.06beta on an RTX 2080, but it seems like I kept on getting this error [CODE]Round off error at iteration = 22746300, err = 0.375 > 0.35, fft = 4704K.[/CODE] majority of the time it goes back to normal, [CODE]Round off error at iteration = 22746300, err = 0.375 > 0.35, fft = 4704K. Restarting from last checkpoint to see if the error is repeatable. Using threads: square 256, splice 128. Continuing M87515717 @ iteration 22740001 with fft length 4704K, 25.98% done | Feb 26 21:05:49 | M87515717 22750000 0x600e2a5aac882d3a | 4704K 0.34375 4.2361 42.35s | 3:04:14:02 25.99% | Looks like the error went away, continuing.[/CODE] but sometimes the error repeats, [CODE]Round off error at iteration = 5632400, err = 0.375 > 0.35, fft = 4704K. Restarting from last checkpoint to see if the error is repeatable. Using threads: square 256, splice 128. Continuing M87515717 @ iteration 5630001 with fft length 4704K, 6.43% done Round off error at iteration = 5632400, err = 0.35938 > 0.35, fft = 4704K. The error persists. Trying a larger fft until the next checkpoint. Using threads: square 256, splice 128. Continuing M87515717 @ iteration 5630001 with fft length 5120K, 6.43% done | Feb 26 00:11:57 | M87515717 5640000 0x8692fae95ad89471 | 5120K 0.03906 4.0596 40.59s | 3:20:19:49 6.44% | Resettng fft.[/CODE] I understand the error is not a big issue. But the frequency that this is occurring is alarmingly high and it concerns me. This GPU just finished a DC right before this assignment for [URL="https://www.mersenne.org/report_exponent/?exp_lo=50153029&full=1"]M50153029[/URL]. But, is a LL residue from an LL test like this still trustworthy after it is completed? Any suggestions how to resolve this problem? Even if reliability is not an issue, I would say these errors are using excessive computation time since it has to rollback to the previous checkpoint. I have attached partial log for an LL test for [URL="https://www.mersenne.org/report_exponent/?exp_lo=87515717&full=1"]M87515717[/URL], from 6.43% to 26.09%. Much thanks! [ATTACH]19958[/ATTACH] |
A roundoff error < 0.4 simply means you are testing an exponent near the limits of what that FFT size can support. Your hardware and end result are just fine.
Your choice is to endure the rollbacks of force using a larger FFT size (I don't know how to do that). |
[QUOTE=Prime95;509558]A roundoff error < 0.4 simply means you are testing an exponent near the limits of what that FFT size can support. Your hardware and end result are just fine.
Your choice is to endure the rollbacks of force using a larger FFT size (I don't know how to do that).[/QUOTE] Okay, thanks for the clarification. Good to know that the roundoff is fine, now gonna figure out how to force a larger size FFT. Thanks again! |
[QUOTE=dcheuk;509559]Okay, thanks for the clarification. Good to know that the roundoff is fine, now gonna figure out how to force a larger size FFT.
Thanks again![/QUOTE] Oh duh, all I have to do to force FFT length increase is to enter F into the console and then hit enter. lol stupid me UPDATE: increasing the FFT length seems to have solved the problem. No more errors! yay. Thanks. |
[QUOTE=dcheuk;509554]Hello guys! Hopefully everyone is enjoying or enduring the weather. It's bad weather here at Iowa.
I was running cudalucas 2.06beta on an RTX 2080, but it seems like I kept on getting this error [CODE]Round off error at iteration = 22746300, err = 0.375 > 0.35, fft = 4704K.[/CODE]majority of the time it goes back to normal, [CODE]Round off error at iteration = 22746300, err = 0.375 > 0.35, fft = 4704K. Restarting from last checkpoint to see if the error is repeatable. Using threads: square 256, splice 128. Continuing M87515717 @ iteration 22740001 with fft length 4704K, 25.98% done | Feb 26 21:05:49 | M87515717 22750000 0x600e2a5aac882d3a | 4704K 0.34375 4.2361 42.35s | 3:04:14:02 25.99% | Looks like the error went away, continuing.[/CODE]but sometimes the error repeats, [CODE]Round off error at iteration = 5632400, err = 0.375 > 0.35, fft = 4704K. Restarting from last checkpoint to see if the error is repeatable. Using threads: square 256, splice 128. Continuing M87515717 @ iteration 5630001 with fft length 4704K, 6.43% done Round off error at iteration = 5632400, err = 0.35938 > 0.35, fft = 4704K. The error persists. Trying a larger fft until the next checkpoint. Using threads: square 256, splice 128. Continuing M87515717 @ iteration 5630001 with fft length 5120K, 6.43% done | Feb 26 00:11:57 | M87515717 5640000 0x8692fae95ad89471 | 5120K 0.03906 4.0596 40.59s | 3:20:19:49 6.44% | Resettng fft.[/CODE]I understand the error is not a big issue. But the frequency that this is occurring is alarmingly high and it concerns me. This GPU just finished a DC right before this assignment for [URL="https://www.mersenne.org/report_exponent/?exp_lo=50153029&full=1"]M50153029[/URL]. But, is a LL residue from an LL test like this still trustworthy after it is completed? Any suggestions how to resolve this problem? Even if reliability is not an issue, I would say these errors are using excessive computation time since it has to rollback to the previous checkpoint. I have attached partial log for an LL test for [URL="https://www.mersenne.org/report_exponent/?exp_lo=87515717&full=1"]M87515717[/URL], from 6.43% to 26.09%. Much thanks! [ATTACH]19958[/ATTACH][/QUOTE] I would genuinely advise you to conduct a FFT benchmark and thread benchmark as 5120K doesn't seem to be the fastest FFT in my case (I'm using Pascal/volta so I am not sure about Turing optimization and how it deals with FFT, with 5184K FFT being near the speed of 4608K and much faster than 5120K, it is also able to tolerate higher exponents than 5120K so this is highly advised as you can speed up your work and increase efficiency). Then you would just go to the CUDALucas.ini file to change the FFT at the very bottom as well as the thread as shown in the benchmark list. Refer to the instruction in the ini file for how to input the values. |
[QUOTE=xx005fs;509566]I would genuinely advise you to conduct a FFT benchmark and thread benchmark as 5120K doesn't seem to be the fastest FFT in my case (I'm using Pascal/volta so I am not sure about Turing optimization and how it deals with FFT, with 5184K FFT being near the speed of 4608K and much faster than 5120K, it is also able to tolerate higher exponents than 5120K so this is highly advised as you can speed up your work and increase efficiency). Then you would just go to the CUDALucas.ini file to change the FFT at the very bottom as well as the thread as shown in the benchmark list. Refer to the instruction in the ini file for how to input the values.[/QUOTE]
Alright, understood, gonna read readme and run the benchmark. You're right I noticed that surprisingly after increasing the FFT size the time to complete each iteration decreased lol |
[QUOTE=dcheuk;509593]Alright, understood, gonna read readme and run the benchmark.
You're right I noticed that surprisingly after increasing the FFT size the time to complete each iteration decreased lol[/QUOTE] I suggest you read kriesel's newer, revised readme: [url]https://www.mersenneforum.org/showpost.php?p=503784&postcount=6[/url] |
| All times are UTC. The time now is 14:58. |
Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.