mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2013-09-07, 12:10   #89
Robish
 
"Rob Gahan"
Aug 2013
Ireland

448 Posts
Smile Great work

Fantastic works guys! well done! Ill try it now with a 7870 if I can.

cheers

Rob
Robish is offline   Reply With Quote
Old 2013-09-08, 08:59   #90
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2·5·61 Posts
Default

Quote:
Originally Posted by Robish View Post
Fantastic works guys! well done! Ill try it now with a 7870 if I can.
Please wait until the windows version available.
msft is offline   Reply With Quote
Old 2013-09-08, 13:36   #91
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

37×59 Posts
Default

Quote:
Originally Posted by msft View Post
Please wait until the windows version available.
Still some loose ends on my side though.

Quote:
Originally Posted by msft View Post
Normal.
It is slow except power of two.
So isn't it possible to not select fft lengths the power of two unless explicitly asked with -f?
kracker is offline   Reply With Quote
Old 2013-09-08, 14:38   #92
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2·5·61 Posts
Default

Quote:
Originally Posted by kracker View Post
So isn't it possible to not select fft lengths the power of two unless explicitly asked with -f?
This version Normalize can not support only pow2.
Too large fft size make error.
Code:
$ ./clLucas 1398269

start M1398269 fft length = 73728
Iteration 10000 M( 1398269 )C, 0xa4a6d2f0e34629db, n = 73728, clLucas v1.00 err = 0.09375 (1:01 real, 6.0343 ms/iter, ETA 2:18:47)

$ ./clLucas -f 147456 1398269

start M1398269 fft length = 147456
err = 16,fft length = 147456 exiting.
Warning:  Program terminating, but clFFT resources not freed.
Please consider explicitly calling clfftTeardown( ).
msft is offline   Reply With Quote
Old 2013-09-10, 01:14   #93
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

37×59 Posts
Default



M( 1398269 )P, n = 73728, CUDALucas v1.66
Attached Thumbnails
Click image for larger version

Name:	cl.png
Views:	196
Size:	372.0 KB
ID:	10240  

Last fiddled with by kracker on 2013-09-10 at 01:16
kracker is offline   Reply With Quote
Old 2013-09-10, 02:59   #94
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2×5×61 Posts
Default

Quote:
Originally Posted by kracker View Post


M( 1398269 )P, n = 73728, CUDALucas v1.66
Congratulations.
Thank you lots of work.
msft is offline   Reply With Quote
Old 2013-09-10, 05:18   #95
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2×5×61 Posts
Default

Quote:
Originally Posted by kracker View Post
M( 1398269 )P, n = 73728, CUDALucas v1.66
I think your 7770 25% faster than my 7750.
Attached Thumbnails
Click image for larger version

Name:	Screenshot from 2013-09-10 14:15:06.png
Views:	163
Size:	200.1 KB
ID:	10242  
msft is offline   Reply With Quote
Old 2013-09-10, 07:55   #96
LaurV
Romulan Interpreter
 
LaurV's Avatar
 
"name field"
Jun 2011
Thailand

1029110 Posts
Default

Quote:
Originally Posted by msft View Post
I think your 7770 25% faster than my 7750.
Someone post a win64 exe and I tell you how fast a 7970 and a 7990 are...

Huh? What do you think?
LaurV is offline   Reply With Quote
Old 2013-09-10, 08:31   #97
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2×5×61 Posts
Default

Quote:
Originally Posted by LaurV View Post
Someone post a win64 exe and I tell you how fast a 7970 and a 7990 are...

Huh? What do you think?
It's can't be tell,Obesity.
400~800%,I guess.
msft is offline   Reply With Quote
Old 2013-09-10, 14:24   #98
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

42078 Posts
Default

7750 vs two 7970.
http://anandtech.com/bench/product/535?vs=588

Also, DP ratio is diffrent, 1/16(?) for 7750, 1/4 for 79xx.

Right now doing a DC check with my 7770
kracker is offline   Reply With Quote
Old 2013-09-10, 17:52   #99
TeknoHog
 
TeknoHog's Avatar
 
Mar 2010
Jyvaskyla, Finland

448 Posts
Default

I have followed this project with great interest, and I look forward to a usable release. So far, it has been a little hard to piece together all the information necessary for compilation. For example, when trying to compile 0.59, I was left wondering what openclsdkdefs.mk is supposed to be, and where I should find SDKApplication.hpp. I use Linux and I'm not exactly a beginner with these things, but for some reason I feel completely lost here, as if I was missing some essential package (besides AMD APP SDK and clAmdFft, of course).

One reason why I mention this now is that I recently released an automatic work assignment/submission tool for mfakto, and I'd like to extend it for LL testing. Of course, I would try CudaLucas if I had any Nvidia hardware.
TeknoHog is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
mfakto: an OpenCL program for Mersenne prefactoring Bdot GPU Computing 1724 2023-06-04 23:31
Can't get OpenCL to work on HD7950 Ubuntu 14.04.5 LTS VictordeHolland Linux 4 2018-04-11 13:44
OpenCL accellerated lattice siever pstach Factoring 1 2014-05-23 01:03
OpenCL for FPGAs TObject GPU Computing 2 2013-10-12 21:09
AMD's Graphics Core Next- a reason to accelerate towards OpenCL? Belteshazzar GPU Computing 19 2012-03-07 18:58

All times are UTC. The time now is 15:26.


Fri Jul 7 15:26:40 UTC 2023 up 323 days, 12:55, 0 users, load averages: 1.03, 1.11, 1.09

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔