![]() |
|
|
#540 | |
|
"Jerry"
Nov 2011
Vancouver, WA
1,123 Posts |
Unless someone invests in a dual Xeon system, I don't see why anyone would want to TF on more than one video card in the same system anyway (you could run it, but you wouldn't be able to max out both GPUs). An i7 3960x with 12 hyperthreads could maybe do it, but I wouldn't want to spend the money to see. A better use for two high-end GPUs in a system is TF on one and CUDALucas on the other (if it's nVidia).
Quote:
|
|
|
|
|
|
|
#541 | |
|
Sep 2006
The Netherlands
72910 Posts |
Quote:
So both those should be faster than the GTX580 |
|
|
|
|
|
|
#542 | |
|
"Mr. Meeseeks"
Jan 2012
California, USA
23·271 Posts |
Quote:
![]() Oh, and the reason I got a 7770 instead of a GTX 550, was because it was slightly faster, and the 7770 used less power too, so.. (this is before Kepler, which pushed compute down) EDIT: well, not slightly. 7770 vs 550 *Ti* http://www.anandtech.com/bench/Product/536?vs=541 Last fiddled with by kracker on 2012-11-26 at 19:44 |
|
|
|
|
|
|
#543 | |
|
Sep 2006
The Netherlands
36 Posts |
Quote:
So a small kernel makes sense. I wanna write 2 kernels though, also one to get above 70 bits. If the 23 bits trick works i definitely will report that of course. I wrote my own C code for producing candidates. I use a bigger primebase here than TheJudger for example for Wagstaff and have different sieving tricks (unknown whether that's faster). Wagstaff is slightly different from Mersenne there and i run at other CPU's with other sorts of caches here. So far i only wrote the TF code in C and in OpenCL i had only toyed and experimented. Yet my plan from some months ago was to have something going by end of november so i only got a few days now to keep me warm in this office as the gas price is too high you know. Better burn electricity :) Will keep you updated. This isn't rocket science with equipment out of the 1950s, this is the real thing :) Last fiddled with by diep on 2012-11-26 at 19:49 |
|
|
|
|
|
|
#544 | |
|
"Jerry"
Nov 2011
Vancouver, WA
1,123 Posts |
Quote:
|
|
|
|
|
|
|
#545 | |
|
"Mr. Meeseeks"
Jan 2012
California, USA
23×271 Posts |
Quote:
|
|
|
|
|
|
|
#546 | |
|
Sep 2006
The Netherlands
36 Posts |
Quote:
Blaming the GTX590 of software limitations is not very fair i'd say. What bandwidth did you get when you benchmarked from CPU to GPU at the GTX590? I've got 8 core Xeon L5420 machine here (2 cpu's a machine). They're $150 on ebay. Note i have motherboards that are pci-e 2.0 (seaburg chipset). Under full load (without the Tesla's nor the 6970) the machines eat 170 watt. Last fiddled with by diep on 2012-11-26 at 20:04 |
|
|
|
|
|
|
#547 |
|
"Mr. Meeseeks"
Jan 2012
California, USA
1000011110002 Posts |
What? the 590 IS clocked down, I believe heat issues, so it WILL be slower than 2 580's, plus SLI and mfaktx doesn't work optimally.
|
|
|
|
|
|
#548 | |
|
Sep 2006
The Netherlands
36 Posts |
Quote:
So it is logical then that 2x 580 is faster than a single GTX590 :) Also the power usage of 2x 580 is kind of stupid much when running gpgpu. what is it 1000+ watts machine cores included? |
|
|
|
|
|
|
#549 | ||
|
"Jerry"
Nov 2011
Vancouver, WA
1,123 Posts |
Quote:
These are the systems the 590 was tested in: i7-3770k @ 4.5, MSI Z77A-G41, 16Gb, GTX 580 i7 3770k stock, ASUS P8H67-M PRO/CSM, 16Gb, GTX 580 i7 2700k stock, Biostar TP67xe, 16Gb, GTX 580 i7 2700k stock, ECS H61H2-M2, 8Gb, GTX 580 Quote:
Last fiddled with by flashjh on 2012-11-26 at 20:18 |
||
|
|
|
|
|
#550 | |
|
Sep 2006
The Netherlands
36 Posts |
Quote:
I'm busy with Trial Factoring here. CUDAlucas is something *totally* different man. Trial factoring is using integers. The 590 is fast for integers. CUDAlucas is a FFT. Of course that is no good plan at the 590 as the FFT's require shared RAM and are lobotomized for double precision. For double precision number crunching get Tesla's from Nvidia! I've got a bunch of Tesla C2075's here they are fast for floating point! You mixup trial factoring with DWT? with TF you need fast cpu's and cpu cores and good software to resupply. That thing can go what is it 800 million/s or so that GTX590? How the hell to generate enough FC's with a quad core 2700 chip? Over here i got a cluster of 8 nodes L5420 dual socket Xeons. 64 cores in total and another box of 16 cores AMD 8356's. Enough to supply the videocard for TF? :) Last fiddled with by diep on 2012-11-26 at 20:24 |
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| gpuOwL: an OpenCL program for Mersenne primality testing | preda | GpuOwl | 2718 | 2021-07-06 18:30 |
| mfaktc: a CUDA program for Mersenne prefactoring | TheJudger | GPU Computing | 3497 | 2021-06-05 12:27 |
| LL with OpenCL | msft | GPU Computing | 433 | 2019-06-23 21:11 |
| OpenCL for FPGAs | TObject | GPU Computing | 2 | 2013-10-12 21:09 |
| Program to TF Mersenne numbers with more than 1 sextillion digits? | Stargate38 | Factoring | 24 | 2011-11-03 00:34 |