![]() |
|
|
#100 | |
|
May 2011
Orange Park, FL
88510 Posts |
Quote:
|
|
|
|
|
|
|
#101 |
|
Einyen
Dec 2003
Denmark
C5616 Posts |
In build 5 I had a PRP test at 88.22M where it immediately chose 4704K FFT. It had SoftCrossoverAdjust=-0.004 in prime.txt
Then another instance had a PRP test at 88.26M where it started to test 4608K FFT and when that had the average roundoff error 0.30835 it chose 4800K FFT. Granted that instance had SoftCrossover=1.0 and SoftCrossoverAdjust=-0.008 in prime.txt, but why did it not test 4704K FFT? I now removed all SoftCrossover and SoftCrossoverAdjust from all instances, it was a leftover from when I added it to build 3 when I got so many roundoff > 0.4 errors. Last fiddled with by ATH on 2018-12-16 at 18:09 |
|
|
|
|
|
#102 |
|
Einyen
Dec 2003
Denmark
2×1,579 Posts |
There might be a problem with choosing the best FFT parameters after restarting mprime, see here:
https://www.mersenneforum.org/showpo...&postcount=133 |
|
|
|
|
|
#103 |
|
P90 years forever!
Aug 2002
Yeehaw, FL
2·3,767 Posts |
29.5 build 6.
1) Fixed testing tiny numbers with AVX-512 FFTs, numbers less than about 5120 bits are forced to use FMA FFTs. 2) Torture test dialog box allows picking AVX-512, AVX2, AVX, SSE2 FFTs. 3) Changed FFT crossovers. ATH, let me know if this is better. 4) Eliminated soft crossovers. Instead, I hope to get the crossovers right in the gwnum code. Soft crossovers can be re-enabled (see undoc.txt). 5) Eliminated a memory leak when a benchmark is interrupted. 6) In linux a pid file is created (mprime.pid). 7) New swizzle code is used. FFTs should be about 0% faster. Well, maybe small FFTs that run in the L2 cache will see a tiny speed bump. Not fixed: 1) The hangs reported in benchmarking. Please try again with this release. I cannot get it to happen in Linux. Is this a Windows-only issue? Does this only happen benchmarking multi-threaded FFTs? Is CPU usage 0% at the time of the hang? Hangs are usually due to a deadlock. Benchmarks use locks to sync the start of all the worker threads. Multi-threaded FFTs also use locks to coordinate threads. Linux 64-bit: ftp://mersenne.org/gimps/p95v295b6.linux64.tar.gz Windows 64-bit: ftp://mersenne.org/gimps/p95v295b6.win64.zip |
|
|
|
|
|
#104 | ||
|
Sep 2003
5×11×47 Posts |
Wait, does this build include the Chuck fix? You didn't mention anything about that (saving registers).
Quote:
Quote:
|
||
|
|
|
|
|
#105 |
|
P90 years forever!
Aug 2002
Yeehaw, FL
2×3,767 Posts |
|
|
|
|
|
|
#106 |
|
Romulan Interpreter
Jun 2011
Thailand
25B516 Posts |
|
|
|
|
|
|
#107 | |
|
"/X\(‘-‘)/X\"
Jan 2013
2×5×293 Posts |
Quote:
|
|
|
|
|
|
|
#108 | |
|
"TF79LL86GIMPS96gpu17"
Mar 2017
US midwest
2×32×7×43 Posts |
Quote:
set allowed memory to 8192M. Set throughput benchmark to do 2048 - 32768K, 1-3,6 workers, hyperthreaded too. First try, stalls early in 2048K. At this point there was an mfakto instance on the UHD630 IGP and mfaktc instance on the GTX1050Ti gpu. Halt mfakto on UHD630 for second try. Another benchmark stall early in 2048k. Halt mfaktc and retry. Application crash. Retry; benchmark stall again. Try #4: nearly finished 2560K before stalling, 8 minutes is longest of 4 tries before stall or crash. Try 5: no hyperthreading, made it to 23040K before stalling. Try 6 starting from 23040K reached 32768K. Last fiddled with by kriesel on 2019-01-04 at 10:12 |
|
|
|
|
|
|
#109 | |
|
"Kieren"
Jul 2011
In My Own Galaxy!
2×3×1,693 Posts |
Quote:
i7-6700K, 4300 MHz, 16 GiB RAM, 3200 MHz, Win 7 Pro I will try the latest version when I am able to check on it sooner than after work (~10 hours.) Last fiddled with by kladner on 2019-01-04 at 11:42 |
|
|
|
|
|
|
#110 |
|
P90 years forever!
Aug 2002
Yeehaw, FL
753410 Posts |
Aaargh! Why can't I reproduce the hang! Makes debugging much harder.
For those that can reproduce it quickly, does a normal torture test or daily work ever hang? If you add "TortureTestThreads=<# of logical cores>" to prime.txt and then run a torture test with just one worker, does that hang? In the meantime, I'll re-examine the bench code. |
|
|
|