mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2011-12-06, 15:36   #1398
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3×29×83 Posts
Default

Do you know why 2.1 is worse than 2.0?

Edit: So a 460's slower than a 465/470/480 by nature of its compute compatibility.... http://developer.nvidia.com/cuda-gpus

Last fiddled with by Dubslow on 2011-12-06 at 15:39
Dubslow is offline   Reply With Quote
Old 2011-12-06, 15:59   #1399
TheJudger
 
TheJudger's Avatar
 
"Oliver"
Mar 2005
Germany

45716 Posts
Default

Quote:
Originally Posted by Dubslow View Post
Do you know why 2.1 is worse than 2.0?

Edit: So a 460's slower than a 465/470/480 by nature of its compute compatibility.... http://developer.nvidia.com/cuda-gpus
ILP (instruction-level parallelism) for CC 2.1. One could say that they saved one instruction scheduler.
mfaktc has a lot of dependent instructions (carry flag) so ILP doesn't work here. Of course I could write a kernel without the use of the carry flag but my guess is that it is much slower on all archtictures.

CC 2.x is much better than CC 1.x for mfaktc because 2.x can do int32 multiplication native while CC 1.x can't.

Oliver
TheJudger is offline   Reply With Quote
Old 2011-12-06, 16:06   #1400
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3·29·83 Posts
Default

Hmm. I understand the third line, but the first two are beyond me...

:)
And that's why I'm not the developer.
Dubslow is offline   Reply With Quote
Old 2011-12-06, 20:33   #1401
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

2·3·1,693 Posts
Default Timing run results #2

To James Heinrich:

This is from my partner's machine. It's an i7 920, 2811MHz, 3GB RAM, XP 32bit. I did turn off the Turbo for this run.

Code:
Asus 9600 GT, fanless
GPU @ 650MHz
Usage 99%
Factor M54097591, No factor found, 70-71
Time/class 18.08s
Total Time 4h 51m 42s
Affinity not set
3,3,3 on the Streams and GridSize
AllowSleep=0
Obviously, the CPU is twiddling its non-existent thumbs waiting on this card. SP stuck at 200,000. avg wait 10,300. I just threw this in to give a low end marker.

This box is going to be Win7-64 with 9GB RAM before too much longer.
kladner is offline   Reply With Quote
Old 2011-12-07, 11:28   #1402
nucleon
 
nucleon's Avatar
 
Mar 2003
Melbourne

20316 Posts
Default

Quote:
Originally Posted by TheJudger View Post
Well, this might be not so easy...
  • single instance of mfaktc will measure CPU performance, not GPU performance for the highend GPUs
What Oliver said.

I have 4x GTX580s in my setup:
  • 2 of them are installed in a i7-2600k@4.5GHz - last 10day average - 564.6GHz-days/day combined, sieve primes=5000, GPU usage 96-98% need more cpu :( (all 4 cores used)
  • 1 installed in i7-920@2.8GHz - last 10day average 315.9GHz-days/day, sieve primes=12000, GPU load=99% (all 4 cores used)
  • 1 installed in AMD FX8120@3.8GHz, - last 4 days average 201.6GHz-days/day, sieve primes=5000, GPU load=72%, (Sorry, I can't recommend this CPU at all for any reason)

-- Craig
nucleon is offline   Reply With Quote
Old 2011-12-07, 17:24   #1403
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

342110 Posts
Default

Quote:
Originally Posted by Dubslow View Post
Thanks, that was helpful (... although contained more than one conflicting datum; I'm not sure it's 100% accurate).

Quote:
Originally Posted by TheJudger View Post
Well, this might be not so easy...
  • compute capability 1.0 (G80 chip): wont work
  • compute capability 1.1-1.3: same speed
  • compute capability 2.0: currently best GFLOPS/mfaktc performance
  • compute capability 2.1: ~20-35% slower than 2.0 for same GFLOPS
You're right. It wasn't easy, and I concur about your performance conclusions.
There are many factors that affect it (from overclocked GPU speed, SievePrimes setting, CPU powering it, etc etc) so my numbers are naturally quite rough, but compiling all the results give a general pattern. I'm using these approximated multipliers for GFLOPS to GHz-days/day:
v1.1-1.3 = 14.0
v2.0 = 5.0
v2.1 = 7.5

I'm pretty confident about the v1.1 results (3 very close results), less so about v2.0 and 2.1, but it's at least in the ballpark.

My chart is now scaled according to the compute version:
http://mersenne-aries.sili.net/mfaktc.php
James Heinrich is offline   Reply With Quote
Old 2011-12-08, 14:36   #1404
RichD
 
RichD's Avatar
 
Sep 2008
Kansas

D3B16 Posts
Default

Anybody have experience with the Linux 290.10 driver from nVidia?
RichD is offline   Reply With Quote
Old 2011-12-08, 14:42   #1405
James Heinrich
 
James Heinrich's Avatar
 
"James Heinrich"
May 2004
ex-Northern Ontario

11·311 Posts
Default

Quote:
Originally Posted by James Heinrich View Post
I'm using these approximated multipliers for GFLOPS to GHz-days/day
And by that, of course, I mean the complete opposite.
e.g. 8800GT = v1.1 @ 504 GFLOPS. 504 / 14 = 36 GHz-days/day expected.
James Heinrich is offline   Reply With Quote
Old 2011-12-09, 01:04   #1406
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3·29·83 Posts
Default

Quote:
Originally Posted by RichD View Post
Anybody have experience with the Linux 290.10 driver from nVidia?
I would love to tell you, but for whatever reason, using the nVidia install file crashes my GUI (Ubuntu 11.04). (This also applies for previous drivers as well.)
Dubslow is offline   Reply With Quote
Old 2011-12-09, 04:53   #1407
Ralf Recker
 
Ralf Recker's Avatar
 
Oct 2010

191 Posts
Default

Quote:
Originally Posted by Dubslow View Post
I would love to tell you, but for whatever reason, using the nVidia install file crashes my GUI (Ubuntu 11.04). (This also applies for previous drivers as well.)
Probably a conflict between the nouveau drivers and the drivers from NVIDIA. It's not only necessary to completely disable the nouveau drivers before trying to install any NVIDIA driver, you also have to prevent the load of the kernel module for the nouveau driver by blacklisting the nouveau kernel module.

This PPA might be useful for you:

https://launchpad.net/~ubuntu-x-swat/+archive/x-updates

Last fiddled with by Ralf Recker on 2011-12-09 at 05:29 Reason: nouveau drivers of course, not noveau :)
Ralf Recker is offline   Reply With Quote
Old 2011-12-09, 05:18   #1408
Dubslow
Basketry That Evening!
 
Dubslow's Avatar
 
"Bunslow the Bold"
Jun 2011
40<A<43 -89<O<-88

3·29·83 Posts
Default

Installing the nvidia-current package fixes the GUI. I remember at install time I elected to install proprietary drivers. I'll take a look though and see what I can do.
Dubslow is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
mfakto: an OpenCL program for Mersenne prefactoring Bdot GPU Computing 1676 2021-06-30 21:23
The P-1 factoring CUDA program firejuggler GPU Computing 753 2020-12-12 18:07
gr-mfaktc: a CUDA program for generalized repunits prefactoring MrRepunit GPU Computing 32 2020-11-11 19:56
mfaktc 0.21 - CUDA runtime wrong keisentraut Software 2 2020-08-18 07:03
World's second-dumbest CUDA program fivemack Programming 112 2015-02-12 22:51

All times are UTC. The time now is 10:27.


Mon Aug 2 10:27:13 UTC 2021 up 10 days, 4:56, 0 users, load averages: 1.71, 1.43, 1.25

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.