mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2014-11-17, 20:44   #1277
Bdot
 
Bdot's Avatar
 
Nov 2010
Germany

3·199 Posts
Default

Quote:
Originally Posted by AK76 View Post
One question: i noticed that kernel barret32 is faster than barret15. How change the mfakto settings to use "32"?
That is the new "GCN3" GPUType setting, coming with mfakto 0.15. In version 0.14, there is no way to let mfakto select your fastest kernel, cl_barrett32_76_gs for 73- or 74-bit-tests. By setting GPUType=VLIW5, you will get cl_barrett32_77_gs, which is already quite a bit faster than cl_barrett15_73_gs (for 73 bit) and a lot faster than cl_barrett15_82_gs (which would normally be used for 74-bit-tests).
Bdot is offline   Reply With Quote
Old 2014-12-15, 18:47   #1278
Siegmund
 
Siegmund's Avatar
 
Mar 2014

2×52 Posts
Default

Bit of noob question here.

I have just gotten a new computer at my work. I was using mfaktc on my previous computer without any difficulty. The new computer has the AMD 7570 video card (obviously its an office machine not a gaming machine, this is their standard video card in new machines still here) rather than the NVIDIA so I am using mfakto for the first time now.

I was able to start and configure it without any trouble but GUI responsiveness is extremely bad. I changed NumStreams from 3 to 2 and 1, and GridSize all they way down to 0 one step at a time, but it made very little difference -- mfakto is still running almost as fast as before and the GUI is still very slow (~3 seconds to respond to a click, though the mouse position updates instantly.) I experienced some modest lag before using mfaktc but it was usually bearable. This is not.

Do I have any other configuration options to improve responsiveness, or do I just need to plan on only running mfakto at night?

mfakto is using cl_barret32_77_gs_4 and reporting 52 GHz/day progress on an exponent.
Siegmund is offline   Reply With Quote
Old 2014-12-15, 19:14   #1279
Xyzzy
 
Xyzzy's Avatar
 
"Mike"
Aug 2002

22×29×71 Posts
Default

Quote:
Originally Posted by Siegmund View Post
Do I have any other configuration options to improve responsiveness, or do I just need to plan on only running mfakto at night?
http://www.mersenneforum.org/showthread.php?t=18088
Xyzzy is offline   Reply With Quote
Old 2014-12-15, 19:43   #1280
Siegmund
 
Siegmund's Avatar
 
Mar 2014

2×52 Posts
Default

Reading that thread called my attention to changing GPUSieveSize, which was just what was needed. Thank you!

The throughput only changed by ~2% when I lowered it enough to regain control of the machine, so no need for me to actually use the switching program.

Last fiddled with by Siegmund on 2014-12-15 at 19:44
Siegmund is offline   Reply With Quote
Old 2014-12-15, 19:44   #1281
Bdot
 
Bdot's Avatar
 
Nov 2010
Germany

10010101012 Posts
Default

Quote:
Originally Posted by Siegmund View Post
Bit of noob question here.

I have just gotten a new computer at my work. I was using mfaktc on my previous computer without any difficulty. The new computer has the AMD 7570 video card (obviously its an office machine not a gaming machine, this is their standard video card in new machines still here) rather than the NVIDIA so I am using mfakto for the first time now.

I was able to start and configure it without any trouble but GUI responsiveness is extremely bad. I changed NumStreams from 3 to 2 and 1, and GridSize all they way down to 0 one step at a time, but it made very little difference -- mfakto is still running almost as fast as before and the GUI is still very slow (~3 seconds to respond to a click, though the mouse position updates instantly.) I experienced some modest lag before using mfaktc but it was usually bearable. This is not.

Do I have any other configuration options to improve responsiveness, or do I just need to plan on only running mfakto at night?

mfakto is using cl_barret32_77_gs_4 and reporting 52 GHz/day progress on an exponent.
There are a few options to try.
First of all, the settings you mentioned are ignored for the GPU sieve. When running the GPU sieve, you should tweak these parameters for better responsiveness: low but non-zero FlushInterval (3, 2, 1), lower GPUSieveSize and lower GPUSieveProcessSize should each help. I'd try tweaking them in this order for best responsiveness-gain per performance-loss ratio.

On the other hand, depending on the CPU power you have available, you could also try switching to the CPU sieve (SieveOnGPU=0). If you have multiple CPU cores, you can run multiple mfakto instances, so that each of them can use a higher SievePrimes value, increasing the overall throughput. As most of the delays come from the GPU sieve kernel, this option may result in good responsiveness, even at higher NumStreams and GridSize values. Plus: the GPU sieve kernel is not very efficient on your GPU. To run multiple instances, you either create copies in different directories (necessary if you use MISFIT), or only use separate worktodo and ini files (see -i option).

Last fiddled with by Bdot on 2014-12-15 at 19:47 Reason: Plus: the GPU sieve kernel is not very efficient on your GPU.
Bdot is offline   Reply With Quote
Old 2014-12-18, 04:20   #1282
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

23×271 Posts
Default

Hmm... with the latest 14.12(Omega) drivers I'm getting a few errors on -st2.
Code:
Selftest statistics                                    
  number of tests           335478
  successful tests          335469
  no factor found           9

selftest FAILED!

ERROR: selftest failed, exiting.
Code:
no factor for M67094119 from 2^81 to 2^82 [mfakto 0.15pre5-MGW cl_barrett15_82_gs_2]
no factor for M45448679 from 2^81 to 2^82 [mfakto 0.15pre5-MGW cl_barrett15_82_gs_2]
no factor for M30568231 from 2^81 to 2^82 [mfakto 0.15pre5-MGW cl_barrett15_82_gs_2]
no factor for M71065531 from 2^81 to 2^82 [mfakto 0.15pre5-MGW cl_barrett15_82_gs_2]
no factor for M72067427 from 2^82 to 2^83 [mfakto 0.15pre5-MGW cl_barrett15_83_gs_2]
no factor for M52031087 from 2^82 to 2^83 [mfakto 0.15pre5-MGW cl_barrett15_83_gs_2]
no factor for M49346867 from 2^82 to 2^83 [mfakto 0.15pre5-MGW cl_barrett15_83_gs_2]
no factor for M45588523 from 2^87 to 2^88 [mfakto 0.15pre5-MGW cl_barrett15_88_gs_2]
no factor for M71115521 from 2^87 to 2^88 [mfakto 0.15pre5-MGW cl_barrett15_88_gs_2]
Radeon R9 285
kracker is offline   Reply With Quote
Old 2014-12-18, 18:32   #1283
Bdot
 
Bdot's Avatar
 
Nov 2010
Germany

3×199 Posts
Default

Quote:
Originally Posted by kracker View Post
Hmm... with the latest 14.12(Omega) drivers I'm getting a few errors on -st2.
Oh no ... I know why I avoided this version so far.

But I guess I will need to make it work sooner or later. Hopefully they did not reduce the precision. The failing kernels are the ones that are closest to the limits of what "float" gives you. Can you try if forcing it to GCN2 would make it succeed?
Bdot is offline   Reply With Quote
Old 2014-12-21, 16:20   #1284
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

23·271 Posts
Default

Quote:
Originally Posted by Bdot View Post
Oh no ... I know why I avoided this version so far.

But I guess I will need to make it work sooner or later. Hopefully they did not reduce the precision. The failing kernels are the ones that are closest to the limits of what "float" gives you. Can you try if forcing it to GCN2 would make it succeed?
Switching to GCN2 passes all tests with -st2.
kracker is offline   Reply With Quote
Old 2014-12-22, 18:04   #1285
Bdot
 
Bdot's Avatar
 
Nov 2010
Germany

11258 Posts
Default

Quote:
Originally Posted by kracker View Post
Switching to GCN2 passes all tests with -st2.
Thank you for the test, it kind of confirms a change in AMD's single precision calculations. I will still debug it to see where exactly the error gets too big, but the solution seems to point to double precision for these kernels.
Bdot is offline   Reply With Quote
Old 2015-01-02, 12:49   #1286
Bdot
 
Bdot's Avatar
 
Nov 2010
Germany

3·199 Posts
Default

Quote:
Originally Posted by kracker View Post
Switching to GCN2 passes all tests with -st2.
Did you check the performance of the 14.12 driver with mfakto 0.14?

My HD7950 dropped from 430GHz with the older driver to 240GHz after installing 14.12! Still 100% load, similar power consumption. The GCN binary shows slightly higher register usage (from 52 to 59). The new binary code is even a bit smaller and does not have bad memory access patterns. Completely puzzled what's going on.

But most importantly, I cannot reproduce the -st2 failure! 0.14 as well as 0.15pre5 do find all factors!

With that I will do a change that I wanted to do since a long time (some MODBASECASE check error that only occurs in these 6x15-bit kernels). When done, you will need to test for me ...

But I'll try to get rid of this driver as soon as possible! Maybe a little more checking with AMD's CodeXL - but if that cannot tell why the performance is so poor, I can only recommend to stay below 14.12.
Bdot is offline   Reply With Quote
Old 2015-01-02, 22:20   #1287
Bdot
 
Bdot's Avatar
 
Nov 2010
Germany

11258 Posts
Default

Hmm. Weird. Now it is at 424 GHz ... I'll monitor it. Looks like something ran in the background ...
Bdot is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
gpuOwL: an OpenCL program for Mersenne primality testing preda GpuOwl 2718 2021-07-06 18:30
mfaktc: a CUDA program for Mersenne prefactoring TheJudger GPU Computing 3497 2021-06-05 12:27
LL with OpenCL msft GPU Computing 433 2019-06-23 21:11
OpenCL for FPGAs TObject GPU Computing 2 2013-10-12 21:09
Program to TF Mersenne numbers with more than 1 sextillion digits? Stargate38 Factoring 24 2011-11-03 00:34

All times are UTC. The time now is 17:23.


Mon Aug 2 17:23:48 UTC 2021 up 10 days, 11:52, 0 users, load averages: 1.85, 2.18, 2.22

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.