mersenneforum.org

mersenneforum.org (https://www.mersenneforum.org/index.php)
-   GPU Computing (https://www.mersenneforum.org/forumdisplay.php?f=92)
-   -   mfaktc: a CUDA program for Mersenne prefactoring (https://www.mersenneforum.org/showthread.php?t=12827)

Xyzzy 2020-09-28 11:46

[QUOTE=kriesel;558068]Maybe post [URL="https://www.mersenneforum.org/showpost.php?p=525178&postcount=3189"]3189[/URL]'s attachment will help (also linked to at 3208).[/QUOTE]Thanks!

:tu:

storm5510 2020-09-28 14:03

[QUOTE=DrobinsonPE;558069]The version of mfaktc I am using is the linux compiled version I found at this post [URL]https://mersenneforum.org/showpost.php?p=526922&postcount=3208[/URL]

The mfaktc.ini file was already configured with the settings GPUSieveProcessSize=32, GPUSieveSize=2047, GPUSievePrimes=82486.[/QUOTE]

I checked the archives I have downloaded to date. None have [I]mfaktc.ini[/I]. Only the binaries. I will do some more digging.

[U]Edit[/U]: I found it in the Linux archive in the link above. The settings were mostly like the quoted above. The best 0.1% time was 1.876 seconds. 3349 GHz-d/day.

Neutron3529 2020-10-01 05:59

[QUOTE=James Heinrich;557962]If you have a completed TF run could you please submit the data so I can calculate the values for RTX 3080?
[URL]https://www.mersenne.ca/mfaktc.php#benchmark[/URL][/QUOTE]


borrowed a machine with 4 RTX 3090 and found a quite strange thing:
the GPU util cannot reach 100% even I am running a single mfaktc in a single GPU


uploaded 2 result, and may try gpuowl later

Neutron3529 2020-10-01 07:16

1 Attachment(s)
[QUOTE=Neutron3529;558427]borrowed a machine with 4 RTX 3090 and found a quite strange thing:
the GPU util cannot reach 100% even I am running a single mfaktc in a single GPU

uploaded 2 result, and may try gpuowl later[/QUOTE]
results are uploaded, but do not tried gpuowl since `gmpxx.h` not found.


all the results generate by mfaktc is here:
[ATTACH]23449[/ATTACH]

moebius 2020-10-01 07:52

[QUOTE=Neutron3529;558427]uploaded 2 result, and may try gpuowl later[/QUOTE]
Please make a short gpuowl benchmark with the exponent 77936867, so that we can directly compare the values ​​of the graphics cards, thank you.
[URL="https://mersenneforum.org/showthread.php?p=558317#post558317"]https://mersenneforum.org/showthread.php?p=558317#post558317
[/URL]

kriesel 2020-10-01 13:28

[QUOTE=Neutron3529;558427]borrowed a machine with 4 RTX 3090 and found a quite strange thing:
the GPU util cannot reach 100% even I am running a single mfaktc in a single GPU[/QUOTE]I would expect that. Fast gpus need multiple instances as well as large gpusieveprimes and other tuning typically. Tune with a single instance first, then test performance versus number of tuned instances is the approach I use. The effect seems to be stronger, the faster the gpu is. Solid state disk or ramdisk might help also.

James Heinrich 2020-10-01 13:51

[QUOTE=Neutron3529;558427]borrowed a machine with 4 RTX 3090 and found a quite strange thing: the GPU util cannot reach 100% even I am running a single mfaktc in a single GPU[/QUOTE]This is normal on high-performance GPUs. 1080 will get to about 95%, 2080 will get to about 80% (apparently 30x0 same). The GPU is just too fast, the little bit that the CPU does can't keep up. In production use running two instances of mfaktc should allow optimal throughput (splitting the CPU load across two cores).

storm5510 2020-10-01 14:12

1 Attachment(s)
[QUOTE=James Heinrich;558473]This is normal on high-performance GPUs. 1080 will get to about 95%, 2080 will get to about 80% (apparently 30x0 same). The GPU is just too fast, the little bit that the CPU does can't keep up. In production use running two instances of mfaktc should allow optimal throughput (splitting the CPU load across two cores).[/QUOTE]


There is a solution for this. I do not know about Linux, but with Windows it is possible to set the CPU speed based on a percentage of its capability. Default minimum is something like 5%. It will not respond to a quick pulse very much. Set it to 85%, for example, with no load and it will respond much faster. I noticed that when I have [I]Prime95[/I] running, my GPU performance, with [I]mfaktc[/I], increased considerably.

kriesel 2020-10-01 17:02

[QUOTE=kriesel;558466]Fast gpus need multiple instances as well as large [STRIKE]gpusieveprimes[/STRIKE] and other tuning typically. Tune with a single instance first, then test performance versus number of tuned instances is the approach I use. The effect seems to be stronger, the faster the gpu is. Solid state disk or ramdisk might help also.[/QUOTE]Oops, meant large GpuSieveSize there.
Needing multiple instances for full performance is typical for gpus faster than ~GTX1050Ti, even with prime95 fairly fully utilizing the cpu, keeping cpu clock rates at highest sustainable levels, and on -gpu sieving enabled in mfaktc. The faster the gpu the more it matters. Two instances does a pretty good job on some gpu models; I use 3 instances to get the most throughput from GTX2080x. So it's no surprise the GTX3080 is underutilized with a single instance. Also, was the GTX3080 mfaktc test thoroughly tuned?

I think the lower than 100% utilization in mfaktc has to do with time for saving checkpoint files and generating console output, and activities that may be limited by pcie bandwidth. Running multiple instances lets gpu resources work on something in one instance while another instance is waiting for the cpu side of mfaktc and the OS to get things done occasionally and communication across pcie to occur.
For comparison, GTX1080Ti shows 98% utilization in gpuowl with one instance.
Mfakto shows much less effect of tuning than mfaktc for equivalent gpu speed. So maybe it has to do with CUDA call overhead.

For more, see detailed mfaktc tune analyses on GTX1080Ti and RTX2080 Super [URL="https://www.mersenneforum.org/showpost.php?p=526899&postcount=8"]here[/URL]. I saw 90% utilization with 256 gpusievesize on RTX2080Super, but 2047 gpusievesize and a good tune otherwise boosted it a lot.

storm5510 2020-10-04 14:47

I just began using [I]Ubuntu 20.04 LTS[/I]. The archive [COLOR=DarkRed]mfaktc-0.21-linux64.cuda10.1-gpusievesize2047.tar.gz[/COLOR] does not contain the libraries needed to run. Where can I find them?

Viliam Furik 2020-10-04 15:28

[QUOTE=storm5510;558854]I just began using [I]Ubuntu 20.04 LTS[/I]. The archive [COLOR=DarkRed]mfaktc-0.21-linux64.cuda10.1-gpusievesize2047.tar.gz[/COLOR] does not contain the libraries needed to run. Where can I find them?[/QUOTE]

You can often find CUDA DLLs by googling them. [URL="https://www.dll-files.com/cudart64_101.dll.html"]This[/URL] should work, and shouldn't have viruses (99% sure it doesn't, but do as you wish).


All times are UTC. The time now is 22:00.

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.