![]() |
|
|
#1200 | |
|
Nov 2010
Germany
3×199 Posts |
Quote:
![]() Thanks again, Jayder, for pointing out this issue to me. I'm not yet sure how to address this, though ... It goes in line with another observation regarding the use of double precision: On my HD7950, it improves performance by 5%. On HD7850, performance drops by 7%. It looks like a lot of device-dependent #ifdefs need to go into the kernel files, which I tried to avoid so far (the IntelHD bugs were the first to require that). I may also need to create a separate device class for the high-end GCN's because of their faster DP performance. Thank you all for your offers to test ... with these additional changes coming, I think it makes no sense to send out a test version right now. I'll come back to you ... |
|
|
|
|
|
|
#1201 | |
|
"Victor de Hollander"
Aug 2011
the Netherlands
23×3×72 Posts |
Quote:
|
|
|
|
|
|
|
#1202 |
|
Sep 2014
1910 Posts |
|
|
|
|
|
|
#1203 |
|
"Graham uses ISO 8601"
Mar 2014
AU, Sydney
35 Posts |
The production rate figures quoted for that HD7950 1100MHz are not too far removed from rates for my HD7950 1000MHz working on exponents in the 118 million space, (i.e. 2 power p minus 1).
I've been watching this space anticipating a new release for 74 bit exploration, but can probably only use that out of hours, since I've never been able to adequately duck lack of responsiveness issues. |
|
|
|
|
|
#1204 | |
|
Nov 2010
Germany
3·199 Posts |
I used a 100M exponent for that test.
Quote:
The new release will have to wait a bit longer as I currently have very little time to work on it (and there are still a few things to do). Last fiddled with by Bdot on 2014-10-08 at 07:46 Reason: FlushInterval: start with 3 downwards |
|
|
|
|
|
|
#1205 |
|
Sep 2014
19 Posts |
On my ATI R9 290 i use FlushInterval=0. 3,2,1 works much worse than "0".
GPUSieveSize=5 or 6 GPUSieveProcessSize=16 GPUSievePrimes= between 30000 and 80000 For example: today i run exp 70M bit 71-72 Rate 2500M/s, 430GHz-d/day. Soon i will test 100M candidates on different bits, to comapre my GPU performance with Bdot's 7950. |
|
|
|
|
|
#1206 |
|
"Victor de Hollander"
Aug 2011
the Netherlands
23×3×72 Posts |
|
|
|
|
|
|
#1207 |
|
Sep 2014
19 Posts |
Less throughput.
|
|
|
|
|
|
#1208 |
|
Nov 2010
Germany
3×199 Posts |
Ah, thanks for that clarification!
Yes, of course, the best throughput is achieved when the GPU is not shared with anything, especially not 3D-Games or screen-updates in general .My suggestion was meant towards a more responsive system at the cost of as little as possible throughput. Regarding your performance measurements: throughput should scale linearly with the GPU clock speed (or shader clock). Memory clock has very little influence. |
|
|
|
|
|
#1209 | |
|
"Graham uses ISO 8601"
Mar 2014
AU, Sydney
F316 Posts |
Quote:
I recognise that the writer is not necessarily english first. I suspect that various minor inflections might have conveyed a more intended meaning. One such moderation would be to suggest an intention of a more responsive system at the cost of as little as possible throughput reduction. |
|
|
|
|
|
|
#1210 |
|
Nov 2010
Germany
11258 Posts |
Oh .. I see
. Thank you for trying to extract what I really meant (I wish all forum members did that consistently). I probably wanted to say something like "the responsiveness improvement should cost you as little performance as possible" ... And you're totally right about English not being my first language. It was actually the third I started to learn. Thinking about this all again, it would probably be much easier for me to aim for as little throughput as possible than what I was trying over the past few years
|
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| gpuOwL: an OpenCL program for Mersenne primality testing | preda | GpuOwl | 2719 | 2021-08-05 22:43 |
| mfaktc: a CUDA program for Mersenne prefactoring | TheJudger | GPU Computing | 3497 | 2021-06-05 12:27 |
| LL with OpenCL | msft | GPU Computing | 433 | 2019-06-23 21:11 |
| OpenCL for FPGAs | TObject | GPU Computing | 2 | 2013-10-12 21:09 |
| Program to TF Mersenne numbers with more than 1 sextillion digits? | Stargate38 | Factoring | 24 | 2011-11-03 00:34 |