mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware > GPU Computing

Reply
 
Thread Tools
Old 2013-06-15, 04:18   #1
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2·5·61 Posts
Default LL with OpenCL

I try ported LL to OpenCL.
msft is offline   Reply With Quote
Old 2013-06-15, 04:24   #2
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2×5×61 Posts
Default

Sometime clAmdFft-1.10.321 have precision problem.
7750:
Quote:
$ sh -x ./run.sh
+ rm *.o a.out
+ g++ -c main.cpp -I /opt/AMDAPP/include/ -I /opt/clAmdFft-1.10.321/include/
+ g++ -c clFFTPlans.cpp -I /opt/AMDAPP/include/ -I /opt/clAmdFft-1.10.321/include/
+ g++ main.o clFFTPlans.o /opt/clAmdFft-1.10.321/lib64/libclAmdFft.Runtime.so -lOpenCL -lfftw3
+ export LD_LIBRARY_PATH=:/opt/clAmdFft-1.10.321/lib64/
+ time ./a.out
Using device: Capeverde
AmdFFT_Z2Z size= 2048 time= 0.080000 msec
Everything went fine!
3.52user 7.27system 0:09.48elapsed 113%CPU (0avgtext+0avgdata 381360maxresident)k
0inputs+1680outputs (0major+38531minor)pagefaults 0swaps
+ diff+ fft_fftw.dat fft_cl.dat
head -n 40
2,16c2,16
< 1 -1.673975300084231e+05 1.663735300084231e+05
< 2 -8.395397960358397e+04 8.292997960358398e+04
< 3 -5.613911373064458e+04 5.511511373064457e+04
< 4 -4.223141898575165e+04 4.120741898575165e+04
< 5 -3.388659268655724e+04 3.286259268655724e+04
< 6 -2.832320060429759e+04 2.729920060429759e+04
< 7 -2.434919949694704e+04 2.332519949694705e+04
< 8 -2.136856774250665e+04 2.034456774250665e+04
< 9 -1.905018221676692e+04 1.802618221676692e+04
< 10 -1.719536904441296e+04 1.617136904441295e+04
< 11 -1.567769939498128e+04 1.465369939498128e+04
< 12 -1.441288738083825e+04 1.338888738083825e+04
< 13 -1.334258123301454e+04 1.231858123301454e+04
< 14 -1.242510111596777e+04 1.140110111596777e+04
< 15 -1.162988181643994e+04 1.060588181643995e+04
---
> 1 -1.673975291656877e+05 1.663735279914048e+05
> 2 -8.395397960358400e+04 8.292997960358398e+04
> 3 -5.613911438886409e+04 5.511511209925302e+04
> 4 -4.223141898575164e+04 4.120741898575164e+04
> 5 -3.388659171768492e+04 3.286259310540232e+04
> 6 -2.832320060429760e+04 2.729920060429759e+04
> 7 -2.434919920591193e+04 2.332519938365545e+04
> 8 -2.136856763176386e+04 2.034456749267493e+04
> 9 -1.905018202758640e+04 1.802618176363859e+04
> 10 -1.719536897123582e+04 1.617136882738273e+04
> 11 -1.567769950455412e+04 1.465369873990205e+04
> 12 -1.441288733405879e+04 1.338888718385923e+04
> 13 -1.334258082835818e+04 1.231858121315468e+04
> 14 -1.242510108971055e+04 1.140110093114974e+04
> 15 -1.162988166206707e+04 1.060588158595162e+04
18c18
< 17 -1.031992388900484e+04 9.295923889004827e+03
---
> 17 -1.031992383364508e+04 9.295923772879578e+03
Compare LINE 3.
Attached Files
File Type: bz2 0.78.tar.bz2 (3.8 KB, 502 views)
msft is offline   Reply With Quote
Old 2013-06-15, 20:26   #3
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

1000011110012 Posts
Default

I hear the AMD dev forums is a nice place to ask if you get stuck, here.
kracker is offline   Reply With Quote
Old 2013-06-16, 02:11   #4
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2·5·61 Posts
Default

Quote:
Originally Posted by kracker View Post
I hear the AMD dev forums is a nice place to ask if you get stuck, here.
Thank you information.
Problem of the addition order is subtle.
I'll try to investigate a little more.
msft is offline   Reply With Quote
Old 2013-06-16, 19:11   #5
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

32·241 Posts
Default

I just noticed something... The GTX 780 has a DP ratio of 1/24, GCN has a ratio of around 1/4. Not sure if it will mean anything, just something I read.
kracker is offline   Reply With Quote
Old 2013-06-17, 10:39   #6
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2×5×61 Posts
Default

Mul loop on GPU.
Quote:
$ pwd
/opt/AMDAPP/samples/opencl/cl/0.19/0.19
msft@msft-desktop:/opt/AMDAPP/samp
les/opencl/cl/0.19/0.19$ sh -x ./run.sh
+ export LD_LIBRARY_PATH=:/opt/clAmdFft-1.10.321/lib64/
$ time ./a.out 216091
Platform :Advanced Micro Devices, Inc.
Device 0 : Capeverde

Build Options are : -D KHR_DP_EXTENSION
---
216001 32768
M( 216091 )P, n = 32768, MacLucasFFTW v8.1 Ballester

real 12m43.947s
user 6m19.212s
sys 3m8.580s
Attached Files
File Type: bz2 0.19.tar.bz2 (42.4 KB, 473 views)
msft is offline   Reply With Quote
Old 2013-06-18, 03:08   #7
msft
 
msft's Avatar
 
Jul 2009
Tokyo

61010 Posts
Default

All Loop on GPU.
7750:
Quote:
$ pwd
/opt/AMDAPP/samples/opencl/cl/0.27/0.27
$ sh -x ./run.sh
$ export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/opt/clAmdFft-1.10.321/lib64/
$ time ./a.out 216091
Platform :Advanced Micro Devices, Inc.
Device 0 : Capeverde
----
216001 32768
M( 216091 )P, n = 32768, MacLucasFFTW v8.1 Ballester

real 16m51.839s
user 1m26.733s
sys 3m35.473s
Caution: Very Slow and System crash suddenly.
Attached Files
File Type: bz2 0.27.tar.bz2 (37.8 KB, 376 views)
msft is offline   Reply With Quote
Old 2013-06-18, 14:17   #8
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

41718 Posts
Default

Is there any way you can give me binaries for linux or even windows for me to tinker around with? I tried compiling, but it just doesn't work for me here...
kracker is offline   Reply With Quote
Old 2013-06-18, 14:57   #9
msft
 
msft's Avatar
 
Jul 2009
Tokyo

26216 Posts
Default

Quote:
Originally Posted by kracker View Post
Is there any way you can give me binaries for linux or even windows for me to tinker around with? I tried compiling, but it just doesn't work for me here...
Can you compile MatrixMulImage on linux ?
Quote:
desktop:/opt/AMDAPP/samples/opencl/cl/app/MatrixMulImage$ make
mkdir -p depends/x86_64
perl ../../../../../make/fastdep.pl -I. -I../../../../../include -I../../../../
../samples/opencl/SDKUtil/include -I../../../../../samples/bolt/BoltUtil/includ
e -I../../../../../samples/C++Amp/AmpUtil/include --obj-suffix='.o' --obj-prefi
x='build/debug/x86_64//' MatrixMulImage.cpp > depends/x86_64/MatrixMulImage.depe
nd
mkdir -p build/debug/x86_64/
Building build/debug/x86_64//MatrixMulImage.o
g++ -Wpointer-arith -Wfloat-equal -g3 -ffor-scope -I ../../../../../sample
s/opencl/SDKUtil/include -I ../../../../../samples/bolt/BoltUtil/include -I
../../../../../samples/C++Amp/AmpUtil/include -I "/opt/AMDAPP/include" -I ../.
./../../../include -o build/debug/x86_64//MatrixMulImage.o -c MatrixMulImage.c
pp
Building build/debug/x86_64/MatrixMulImage
g++ -o build/debug/x86_64/MatrixMulImage build/debug/x86_64//MatrixMulImage.o -l
pthread -ldl -L/usr/X11R6/lib -lSDKUtil -lOpenCL -L../../../../../lib/x86_6
4 -L../../../../../TempSDKUtil/lib/x86_64 -L"/opt/AMDAPP/lib/x86_64"
install -D build/debug/x86_64/MatrixMulImage ../../../../../samples/opencl/bin/x
86_64/MatrixMulImage
for f in MatrixMulImage_Kernels.cl; do \
install -D $f ../../../../../samples/opencl/bin/x86_64/$f; \
done
msft is offline   Reply With Quote
Old 2013-06-18, 15:06   #10
msft
 
msft's Avatar
 
Jul 2009
Tokyo

2×5×61 Posts
Default

0.27 a.out on Ubuntu 12.04 LTS
Quote:
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/opt/clAmdFft-1.10.321/lib64/
time ./a.out 607
Attached Files
File Type: bz2 a.out.bz2 (509.7 KB, 373 views)
msft is offline   Reply With Quote
Old 2013-06-18, 18:27   #11
kracker
 
kracker's Avatar
 
"Mr. Meeseeks"
Jan 2012
California, USA

32×241 Posts
Default

I tried booting linux after not using it for around a month I think... for some reason wouldn't boot, so I tried reinstalling both Ubuntu and Fedora.. no luck, it boots to a black screen, the fallback graphic mode and nomodeset didn't work either... damn linux :P

Last fiddled with by kracker on 2013-06-18 at 18:28
kracker is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
mfakto: an OpenCL program for Mersenne prefactoring Bdot GPU Computing 1668 2020-12-22 15:38
Can't get OpenCL to work on HD7950 Ubuntu 14.04.5 LTS VictordeHolland Linux 4 2018-04-11 13:44
OpenCL accellerated lattice siever pstach Factoring 1 2014-05-23 01:03
OpenCL for FPGAs TObject GPU Computing 2 2013-10-12 21:09
AMD's Graphics Core Next- a reason to accelerate towards OpenCL? Belteshazzar GPU Computing 19 2012-03-07 18:58

All times are UTC. The time now is 05:14.

Mon Jun 14 05:14:19 UTC 2021 up 17 days, 3:01, 0 users, load averages: 0.88, 1.17, 1.24

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.