mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2016-01-23, 22:39   #1343
airsquirrels
 
airsquirrels's Avatar
 
"David"
Jul 2015
Ohio

11·47 Posts
Default

Quote:
Originally Posted by kracker View Post
.


Has anyone else been able to verify before/after performance changes from changing the memory allocation type?
airsquirrels is offline   Reply With Quote
Old 2016-01-23, 22:48   #1344
Rodrigo
 
Rodrigo's Avatar
 
Jun 2010
Pennsylvania

16468 Posts
Default

Thanks @wombatman and @lalera.

Rodrigo

Last fiddled with by Rodrigo on 2016-01-23 at 22:50 Reason: typo
Rodrigo is offline   Reply With Quote
Old 2016-01-23, 23:02   #1345
lalera
 
lalera's Avatar
 
Jul 2003

13×47 Posts
Default

my first results for v 0.14 and v 0.14p
Attached Files
File Type: txt v014v014p.txt (7.3 KB, 214 views)
lalera is offline   Reply With Quote
Old 2016-01-24, 01:02   #1346
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

23×271 Posts
Default

Quote:
Originally Posted by lalera View Post
my first results for v 0.14 and v 0.14p
Try benchmarking with an actual workload instead of the selftests, the reason for the selftests are more of a "sanity check" rather than a benchmark or hardware tester.
kracker is offline   Reply With Quote
Old 2016-01-24, 16:38   #1347
lalera
 
lalera's Avatar
 
Jul 2003

13·47 Posts
Default

hi,
the file i posted above (#1345) represents my work in the range i am doing
here is an example of a actual workload on the tf-wave
Code:
Factor=N/A,132961627,71,72

v 0.14
got assignment: exp=132961627 bit_min=71 bit_max=72 (3.60 GHz-days)
Starting trial factoring M132961627 from 2^71 to 2^72 (3.60GHz-days)
Using GPU kernel "cl_barrett15_73_gs_2"
Date    Time | class   Pct |   time     ETA | GHz-d/day    Sieve     Wait
Jan 24 17:08 | 4617 100.0% |  0.589   0m00s |    549.62    82485    0.00%
no factor for M132961627 from 2^71 to 2^72 [mfakto 0.14-Win cl_barrett15_73_gs_2]
tf(): total time spent:  9m 27.220s (547.89 GHz-days / day)

v 0.14p
got assignment: exp=132961627 bit_min=71 bit_max=72 (3.60 GHz-days)
Starting trial factoring M132961627 from 2^71 to 2^72 (3.60GHz-days)
Using GPU kernel "cl_barrett15_73_gs_2"
Date    Time | class   Pct |   time     ETA | GHz-d/day    Sieve     Wait
Jan 24 17:21 | 4617 100.0% |  0.578   0m00s |    560.08    82485    0.00%
no factor for M132961627 from 2^71 to 2^72 [mfakto 0.14-MGW cl_barrett15_73_gs_2]
tf(): total time spent:  9m 10.661s (564.37 GHz-days / day)
lalera is offline   Reply With Quote
Old 2016-01-24, 16:48   #1348
lalera
 
lalera's Avatar
 
Jul 2003

13·47 Posts
Default

hi,
thanks to kracker for the program !
i am using it now
lalera is offline   Reply With Quote
Old 2016-01-28, 03:11   #1349
PopGoesTheWease
 
Jan 2016

102 Posts
Thumbs up

Quote:
Originally Posted by airsquirrels View Post
Has anyone else been able to verify before/after performance changes from changing the memory allocation type?
Just changed over to 0.14p and wow! On my workstation (dual socket Win7x64) there is a ~14% improvement. On my server (quad socket Win2k8R2) there is a ~20% improvement. Same model cards both in x16 slots.

PopGoesTheWease is offline   Reply With Quote
Old 2016-01-28, 03:12   #1350
PopGoesTheWease
 
Jan 2016

210 Posts
Default

Quote:
Originally Posted by airsquirrels View Post
Has anyone else been able to verify before/after performance changes from changing the memory allocation type?
Just changed over to 0.14p and wow! On my workstation (dual socket Win7x64) there is a ~14% improvement. On my server (quad socket Win2k8R2) there is a ~20% improvement. Same model cards both in x16 slots.
PopGoesTheWease is offline   Reply With Quote
Old 2016-02-02, 01:06   #1351
airsquirrels
 
airsquirrels's Avatar
 
"David"
Jul 2015
Ohio

11·47 Posts
Default

Quote:
Originally Posted by PopGoesTheWease View Post
Just changed over to 0.14p and wow! On my workstation (dual socket Win7x64) there is a ~14% improvement. On my server (quad socket Win2k8R2) there is a ~20% improvement. Same model cards both in x16 slots.
Are your CPU cores running prime95 or otherwise loaded? Switching to on-GPU memory would significantly reduce the need for CPU use for memory transfers, which may have been stalling you. Also, if you are using older / slower main RAM that would also explain the high gains despite being in x16 slots.
airsquirrels is offline   Reply With Quote
Old 2016-02-02, 21:39   #1352
derekg
 
derekg's Avatar
 
Jul 2003

2×23 Posts
Default

Using an R9 390X on a 2.0 x8 slot throughput went from 624.15 GHZ-days per day to 696.91.

Derek
derekg is offline   Reply With Quote
Old 2016-03-13, 06:13   #1353
fregames
 
Jan 2015
USA

310 Posts
Unhappy Win8 upgrade

I just upgraded to windows 8 and performance has dropped to 7Ghz-Days/Day. I'm usually running at ~275 each. Is there any known issue that would cause this? I'm using 3 R9 270x cards.

--
Turns out I needed to install drivers. my bad

Last fiddled with by fregames on 2016-03-13 at 06:49 Reason: Fail, I figured it out.
fregames is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
gpuOwL: an OpenCL program for Mersenne primality testing preda GpuOwl 2718 2021-07-06 18:30
mfaktc: a CUDA program for Mersenne prefactoring TheJudger GPU Computing 3497 2021-06-05 12:27
LL with OpenCL msft GPU Computing 433 2019-06-23 21:11
OpenCL for FPGAs TObject GPU Computing 2 2013-10-12 21:09
Program to TF Mersenne numbers with more than 1 sextillion digits? Stargate38 Factoring 24 2011-11-03 00:34

All times are UTC. The time now is 17:23.


Mon Aug 2 17:23:52 UTC 2021 up 10 days, 11:52, 0 users, load averages: 2.10, 2.22, 2.23

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.