![]() |
[QUOTE=TheJudger;221247]Hi Aillas,
no, not really. :sad: Did you try the examples from the CUDA SDK? Oliver[/QUOTE] No, just take mfakt sources and compile it. Maybe QUATTRO cards are a bit different. I will test the next version :smile: I will try sample later... Thanks. |
1 Attachment(s)
Hello everybody,
find attached mfaktc 0.10. :smile: Highlights of this version: - two new runtime options: Stages and StopAfterFactor (see mfaktc.ini for details) - modified the stream scheduling (suggested by Ethan). Older versions assumed the the streams are processed in the way there were issued. The new way improves the performance a little bit in some cases (e.g. multiple instances of mfaktc) and narrows the gap between Windows and Linux (but doesn't solve the Windows / CUDA 3.1 / 25x.xx driver bug?! :sad:) - threads per grid is determined during runtime based on the number of multiprocessors of the GPU. This was necessary since Nvidia releases more and more GPUs with a non-power-of-two number of multiprocessors... Oliver |
Looking forward for a x64 binary for sm_11 arch :smile:
|
[QUOTE=TheJudger;223027]Hello everybody,
find attached mfaktc 0.10. :smile: Highlights of this version: - two new runtime options: Stages and StopAfterFactor (see mfaktc.ini for details) - modified the stream scheduling (suggested by Ethan). Older versions assumed the the streams are processed in the way there were issued. The new way improves the performance a little bit in some cases (e.g. multiple instances of mfaktc) and narrows the gap between Windows and Linux (but doesn't solve the Windows / CUDA 3.1 / 25x.xx driver bug?! :sad:) - threads per grid is determined during runtime based on the number of multiprocessors of the GPU. This was necessary since Nvidia releases more and more GPUs with a non-power-of-two number of multiprocessors... Oliver[/QUOTE] I assume that now I can finish my exponents after having switched to 0.10... and have a little boost? Luigi |
Hello Luigi,
[QUOTE=ET_;223035]I assume that now I can finish my exponents after having switched to 0.10... and have a little boost?[/QUOTE] mfaktc only accepts checkpoint files which were written by the same version. So when you've started your test with 0.09 you have to finish it with 0.09 or restart from scratch with 0.10. About the little performance boost... you're talking about your GTX 275 and running Linux? I would that this is [B]not[/B] a configuration where you'll see the little improvement. [B]Spoiler alert:[/B] But don't be too sad, you'll see an improvement with the next version! 0.11 has a faster sieve. I've tested SievePrimes 20.000, 30.000 and 40.000, in all cases the new sieve does ~25% more throughput (good for users with GTX 4xx). On the other hand this could be used to increase SievePrimes which will remove more candidates during sieving. On my system the sieve of 0.10 with SievePrimes=20.000 is capable to generate ~89M/s candidates and the new 0.11 does the same speed with SievePrimes=40.000 (which yields 3-4% more candidates removed ==> 3-4% overall speed increase). :smile: Now the bad news: - 0.11 need more testing - you have to wait a little bit :razz: Oliver |
Hi,
the new version is working on my config: Config: Ubuntu 10.04 nvidia driver 256.35 CUDA 3.1 GPU: NVIDIA QUATTRO 140M So for now I'm trying it on 3321931967,76,77 (I need to reserve it). I have a question. When I run mfaktc 0.10, my computer is unusable. It seems mfacktc is using all GPU power and so, I can't use my computer (to login or to open a window). I also notice that it use one of the core to 100%. Is it normal? Thanks |
[QUOTE=Aillas;223160]Hi,
the new version is working on my config: Config: Ubuntu 10.04 nvidia driver 256.35 CUDA 3.1 GPU: NVIDIA QUATTRO 140M So for now I'm trying it on 3321931967,76,77 (I need to reserve it). I have a question. When I run mfaktc 0.10, my computer is unusable. It seems mfacktc is using all GPU power and so, I can't use my computer (to login or to open a window). I also notice that it use one of the core to 100%. Is it normal? Thanks[/QUOTE] Hi Aillas, I presume that you are running Ubuntu 64 bits. I experimented the same behavior when using mfaktc with Ubuntu_64 9.10 and nVidia GTX 275: the graphical interface is nearly unusable. It doesn't affect my life, thanks to the resume file: when I need to access my desktop, I turn mfaktc off... :smile: As for the CPU usage, the program uses two sections: one runs on the GPU, the other (the siever IIRC) tries to keep up with the GPU, preparing presieved intervals of testing factors. So in short, yes, it is quite normal that one core is kept busy during elaboration. Now check out your exponent [URL="http://www.mersenneforum.org/showthread.php?p=223042#post223042"]here[/URL], and remember to post your results [URL="http://www.mersenneforum.org/showthread.php?p=223052#post223052"]here[/URL]. Good luck! Luigi |
[QUOTE=ET_;223162]Hi Aillas, I presume that you are running Ubuntu 64 bits.
I experimented the same behavior when using mfaktc with Ubuntu_64 9.10 and nVidia GTX 275: the graphical interface is nearly unusable. It doesn't affect my life, thanks to the resume file: when I need to access my desktop, I turn mfaktc off... :smile: As for the CPU usage, the program uses two sections: one runs on the GPU, the other (the siever IIRC) tries to keep up with the GPU, preparing presieved intervals of testing factors. So in short, yes, it is quite normal that one core is kept busy during elaboration. Now check out your exponent [URL="http://www.mersenneforum.org/showthread.php?p=223042#post223042"]here[/URL], and remember to post your results [URL="http://www.mersenneforum.org/showthread.php?p=223052#post223052"]here[/URL]. Good luck! Luigi[/QUOTE] Thanks for the information. Ok, I will let it run so. PS: It's the 32 bit version of Ubuntu. Is it a problem ? |
Hello Aillas,
[QUOTE=Aillas;223160]I have a question. When I run mfaktc 0.10, my computer is unusable. It seems mfacktc is using all GPU power and so, I can't use my computer (to login or to open a window). I also notice that it use one of the core to 100%. Is it normal? [/QUOTE] 100% CPU usage is normal. This is a bit ugly on "slow GPUs" because it wastes CPU cycles but on "fast GPUs" this is OK because you can't have enough CPU power for the sieve on "fast GPUs". About the unusability... this is normal, too. You can try to - lower the THREADS_PER_GRID_MAX to e.g. 1<<16 (params.h, requiers recompile) - run only one stream (NumStreams=1 in mfaktc.ini) This seems to depend on the GPU, too. "slow GPUs" have a higher runtime on a single kernel launch and the GPU can only process one thing at once... so there are no GUI updates while the kernel runs. Faster GPU = lower runtime per kernel => more GUI updates per second. It seems that those Geforce 4xx series are [B]much[/B] better than their predecessors in this situation. :smile: I know, this doesn't help you, sorry. :sad: Oliver |
[QUOTE=Aillas;223163]PS: It's the 32 bit version of Ubuntu. Is it a problem ?[/QUOTE]
It should work on 32bit, too. I tried some 32bit builds on my 64bit Linux and they seem to work as expected. Just the sieve is ~33% slower on 32bit but this won't hurt you. I think you're hitting SievePrimes=100000 easily, right? Oliver |
This is the standard behavior. So it's ok for me. I didn't want to run the program many days for nothing.
Now, I'm curious how many days it will take to sieve 3321931967 from 76 to 77 bit on a Quatro 140 M. Maybe next time I should try exponants in the new lower range... Thanks for your support. Ludovic |
| All times are UTC. The time now is 22:50. |
Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.