![]() |
|
|
#309 |
|
"Oliver"
Mar 2005
Germany
11×101 Posts |
Hi,
I just re-read the CUDA documentation about streams... it is a bad idea to hope that the independent streams are executed in a specific order. I'll try to fix this in 0.10. Oliver |
|
|
|
|
|
#310 |
|
Mar 2010
3×137 Posts |
|
|
|
|
|
|
#311 |
|
"Ethan O'Connor"
Oct 2002
GIMPS since Jan 1996
22·23 Posts |
Ah, good call! In fact I was always writing the next h_ktab directly on top of the one most recently queued for transfer, so this would be a very likely occurrence. The failures are non-deterministic so this argues for the same problem.
|
|
|
|
|
|
#312 |
|
Jul 2009
Tokyo
26216 Posts |
Hi TheJudger,
My GTX460 is cheaper version. Inno3D N460-1DDN-G5GW GDDR5 768MB (192bit-bus) core clock:675MHz mem clock:3600MHz (real clock:900MHz) Thank you, |
|
|
|
|
|
#313 |
|
"Oliver"
Mar 2005
Germany
11·101 Posts |
Hi msft,
thank you for the benchmark. ![]() I can see that the GTX 460 ist at least as fast as my GTX 275 for the 75bit kernel. Seems to be still CPU limited (SievePrimes too high). Can you rerun it with the mfaktc.ini in the current directory (it seems to be missing in your run, mfaktc.ini contains the runtime parameters)? Thank you! Oliver |
|
|
|
|
|
#314 | |
|
Jul 2009
Tokyo
10011000102 Posts |
Quote:
|
|
|
|
|
|
|
#315 |
|
"Oliver"
Mar 2005
Germany
21278 Posts |
Thank you, msft!
so for the 75 bit kernel we have ~175% performance compared to my (mild factory overclocked) GTX 275 ![]() ~55% performance of an GTX 480 The 1GiB variant of the GTX 460 shouldn't change those timings, memory bandwidth isn't very important for mfaktc. There are allready some GTX 460 with 1600MHz shader clock rate (1350MHz default) ![]() You made my decission easy... I'll upgrade to a GTX 460! ![]() According to the reviews in the web the GTX 460 consumes less power than my GTX 275 and generate less noise. ![]() Oliver |
|
|
|
|
|
#316 |
|
Mar 2010
41110 Posts |
msft, I wonder why your GTX 460 has only 224 unified shaders.
Should be 336. Could mfaktc be wrong? |
|
|
|
|
|
#317 |
|
Oct 2002
France
2×3×23 Posts |
Hi,
I compile mfakt 0.09 (with CUDA 3.1) and the program doesn't run. Config: Ubuntu 10.04 nvidia driver 256.35 CUDA 3.1 GPU: NVIDIA QUATTRO 140M when I run mfakt -st here is the result: Code:
Compiletime Options THREADS_PER_GRID 983040 THREADS_PER_BLOCK 256 SIEVE_SIZE_LIMIT 32kiB SIEVE_SIZE 230945bits VERBOSE_TIMING disabled MORE_CLASSES disabled Runtime Options SievePrimes 25000 SievePrimesAdjust 1 NumStreams 3 WorkFile worktodo.txt Checkpoints enabled CUDA device info name: Quadro NVS 140M compute capabilities: 1.1 maximum threads per block: 512 number of multiprocessors: 2 (16 shader cores) clock rate: 800MHz cudaStreamCreate() failed PS:(I also tried the compilation option --gpu-architecture=sm_10) Thanks |
|
|
|
|
|
#318 | |
|
Jul 2009
Tokyo
2×5×61 Posts |
Maybe.
Quote:
|
|
|
|
|
|
|
#319 | |
|
"Oliver"
Mar 2005
Germany
100010101112 Posts |
Hi Aillas,
Quote:
![]() Did you try the examples from the CUDA SDK? --- hi msft, Karl Yes, I need to adjust this. But it is only a cosmetic error. I calculate the number of shader cores only for display, nothing depends on this calculation. I thought that some users might feel uncomfortable with the number of multiprocessors: "Hey, my GPU has e.g. 256 cores, why does mfaktc only show 32 (multiprocessors)?") The calculation of shader cores was easy before Fermi: just multiply the number of multiprocessors by 8. Now Nvidia has other configurations, too... 32 cores per multiprocessor (compute capabiltiy 2.0 / GTX 465/470/480) 48 cores per multiprocessor (compute capabiltiy 2.1 / GTX 460) But Nvidia doesn't tell the number of cores per multiprocessor before they launch the products... But again, it is just a cosmetic issue! Oliver |
|
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| mfakto: an OpenCL program for Mersenne prefactoring | Bdot | GPU Computing | 1676 | 2021-06-30 21:23 |
| The P-1 factoring CUDA program | firejuggler | GPU Computing | 753 | 2020-12-12 18:07 |
| gr-mfaktc: a CUDA program for generalized repunits prefactoring | MrRepunit | GPU Computing | 32 | 2020-11-11 19:56 |
| mfaktc 0.21 - CUDA runtime wrong | keisentraut | Software | 2 | 2020-08-18 07:03 |
| World's second-dumbest CUDA program | fivemack | Programming | 112 | 2015-02-12 22:51 |