![]() |
|
|
#276 |
|
"Mr. Meeseeks"
Jan 2012
California, USA
1000011110002 Posts |
12 DC's completed, all matched, and one overclocked gpu memory from 1125 to 1250 MHz.
This leads me to conclude that: AMD's memory is stronger than nVidia. CUDALucas is more intense on the card than clLucas. Or both. |
|
|
|
|
|
#277 |
|
Apr 2010
Over the rainbow
2×1,303 Posts |
From what I have read, non-stock 290X cooler are to be released for Christmas, not before.
However I've seen separate ( said compatible ) watercooler for 290X at around 100 $. Edit : It seems that, around the same time, a new low level API will be released (Mantel). It is essentially for gaming, but might be useful for computation. Last fiddled with by firejuggler on 2013-10-31 at 19:57 |
|
|
|
|
|
#278 |
|
Jul 2009
Tokyo
10011000102 Posts |
For AMD APP SDK v2.9
ubuntu: Code:
$ pwd /opt/AMDAPP/samples/opencl/cpp_cl $ tar -xvf clLucas.1.02.tar.bz2 $ cd clLucas.1.02/ $ sh -x ./run.sh + cmake . + make + export LD_LIBRARY_PATH=:/opt/clFFT-2.0/library/ + time ./clLucas 1398269 Platform :Advanced Micro Devices, Inc. Device 0 : Capeverde Build Options are : -D KHR_DP_EXTENSION start M1398269 fft length = 73728 Iteration 10000 0xa4a6d2f0e34629db, n = 73728 err = 0.07031 (0:16 real, 1.5492 ms/iter, ETA 35:37) ... Iteration 1390000 0x554ae339bfea8fae, n = 73728 err = 0.07812 (0:08 real, 0.8408 ms/iter, ETA 0:00) M( 1398269 )P, n = 73728, clLucas v1.02 |
|
|
|
|
|
#279 |
|
"Mr. Meeseeks"
Jan 2012
California, USA
23×271 Posts |
One thing I've noticed, is that increasing -c (checkpoints) make it slightly faster for me(probably more on faster cards). I noticed the gpu slows or stops during a checkpoint, so I really push it up when running.
|
|
|
|
|
|
#280 |
|
"Jerry"
Nov 2011
Vancouver, WA
1,123 Posts |
-st2 passed on:
Code:
Select device - Get device info - Device 1/1: BeaverCreek (Advanced Micro Device s, Inc.), device version: OpenCL 1.2 AMD-APP (1268.1), driver version: 1268.1 (VM) Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomic s cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_3d_ image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ext_atomic_count ers_32 cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_dx9_media_sharing cl _amd_image2d_from_buffer_read_only Global memory:799014912, Global memory cache: 0, local memory: 32768, workgroup size: 256, Work dimensions: 3[256, 256, 256, 0, 0] , Max clock speed:444, comput e units:4 Compiling kernels (build options: "-I. -DVECTOR_SIZE=4 -O3 -DMORE_CLASSES -DCL_G PU_SIEVE")................. OpenCL device info name BeaverCreek (Advanced Micro Devices, Inc.) device (driver) version OpenCL 1.2 AMD-APP (1268.1) (1268.1 (VM)) maximum threads per block 256 maximum threads per grid 16777216 number of multiprocessors 4 (320 compute elements) clock rate 444MHz Automatic parameters threads per grid 256 optimizing kernels for APU |
|
|
|
|
|
#281 | |
|
Nov 2010
Germany
3·199 Posts |
Quote:
It must have been running for half a day for this extended test ;-) |
|
|
|
|
|
|
#282 |
|
"Jerry"
Nov 2011
Vancouver, WA
46316 Posts |
Woops, yes it was. And it actually took over 24 hours
|
|
|
|
|
|
#283 |
|
"Mr. Meeseeks"
Jan 2012
California, USA
87816 Posts |
|
|
|
|
|
|
#284 |
|
"Yury Vorobyov"
Jul 2013
Chelyabinsk
238 Posts |
Guys. Empty condition - really?
Code:
if (fpi != NULL)
{
if (fgets (str, 132, fpi) == NULL);//line 1340
currentLine = atoi (str);
fclose (fpi);
printf ("Continue test of file '%s' at line %d\n",
input_filename, currentLine);
}
|
|
|
|
|
|
#285 |
|
"Yury Vorobyov"
Jul 2013
Chelyabinsk
19 Posts |
I know someone used Emacs on this. Please set tab size in Emacs to be equal standart of 4 and "linux" or other conventional style. Because "gnu" style does "zero size" or "virtual" tabs instead of "real tabs". In gnu style code looks like "one indent for all" on many machines.
As for me, I suppose Code:
(setq c-default-style "linux" c-basic-offset 4) Last fiddled with by sanaris on 2014-01-23 at 21:05 |
|
|
|
|
|
#286 |
|
Feb 2014
810 Posts |
I'm trying this out, and keep getting the following error:
Code:
X:\cllucas>clLucas_x64.exe -f 41943040 332233123 Platform :Advanced Micro Devices, Inc. Device 0 : Tahiti Build Options are : -D KHR_DP_EXTENSION start M332233123 fft length = 41943040 OPENCL_V_THROWERROR< CLFFT_NOTIMPLEMENTED > (772): Failed to clfftBakePlan. terminate called after throwing an instance of 'std::runtime_error' what(): OPENCL_V_THROWERROR< CLFFT_NOTIMPLEMENTED > (772): Failed to clfftBakePlan. This application has requested the Runtime to terminate it in an unusual way. Please contact the application's support team for more information. The problem goes away if I don't use the -f argument, but based on this thread I was under the impression that I have to use it, or it will either be slow or wrong. (If I can get away without the -f argument, then I have nothing to worry about.) I'm running a double-check on 20000003 now without the -f argument just to see what the result is, but I was wondering if anyone knows about this. |
|
|
|
![]() |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| mfakto: an OpenCL program for Mersenne prefactoring | Bdot | GPU Computing | 1676 | 2021-06-30 21:23 |
| Can't get OpenCL to work on HD7950 Ubuntu 14.04.5 LTS | VictordeHolland | Linux | 4 | 2018-04-11 13:44 |
| OpenCL accellerated lattice siever | pstach | Factoring | 1 | 2014-05-23 01:03 |
| OpenCL for FPGAs | TObject | GPU Computing | 2 | 2013-10-12 21:09 |
| AMD's Graphics Core Next- a reason to accelerate towards OpenCL? | Belteshazzar | GPU Computing | 19 | 2012-03-07 18:58 |