Forum: mersenne.ca
2021-03-02, 19:04
|
Replies: 540
Views: 58,378
|
Forum: Software
2021-02-20, 08:48
|
Replies: 39
Views: 5,434
3xSP sum()
Unfortunately the sum() I have up to now is a beast: 54 ADDs.
This seems a rather very expensive sum()..
To see some corner-cases that sum() must handle, here is one example: given "x", we'd...
|
Forum: GPU Computing
2021-02-20, 08:42
|
Replies: 7
Views: 508
|
Forum: Software
2021-02-12, 21:02
|
Replies: 39
Views: 5,434
Figure 10 seems to indicate:
c0,e0 =...
Figure 10 seems to indicate:
c0,e0 = twoSum(a0, b0)
d1,e11 = twoSum(a1, b1)
c1,e12 = twoSum(d1, e0)
c2 = a2 + b2 + e11 + e12
which looks pretty good (i.e. simpler than I was expecting)
|
Forum: Software
2021-02-12, 19:45
|
Replies: 39
Views: 5,434
|
Forum: Software
2021-02-12, 08:42
|
Replies: 39
Views: 5,434
SP plan
I've been thinking some more about a practical SP FFT implementation on GPUs, and here are some problems/ideas:
1. FFT twiddles, i.e. the trigonometric constants (sin+cos) used in the FFT.
...
|
Forum: GpuOwl
2021-02-06, 20:04
|
Replies: 48
Views: 5,568
|
Forum: GpuOwl
2021-02-04, 06:16
|
Replies: 48
Views: 5,568
|
Forum: GpuOwl
2021-02-03, 21:40
|
Replies: 48
Views: 5,568
GpuOwl updated P-1 calculator
Hi, recently I revisited the P-1 calculator that's included with GpuOwl's source code https://github.com/preda/gpuowl/blob/master/pm1/pm1.cpp
The calculator is a small stanalone C++ program; to...
|
Forum: GPU Computing
2021-01-17, 16:42
|
Replies: 21
Views: 1,255
|
Forum: Hardware
2021-01-05, 10:12
|
Replies: 128
Views: 10,338
The cache (L1/L2/L3) is used transparently for...
The cache (L1/L2/L3) is used transparently for the *global* memory operations. It is managed automatically by the cache control (probably a variant of LRU), not explicitly by the software. So yes,...
|
Forum: GpuOwl
2020-12-13, 04:31
|
Replies: 197
Views: 15,328
|
Forum: GpuOwl
2020-12-06, 22:58
|
Replies: 2,691
Views: 222,420
|
Forum: Hardware
2020-12-06, 22:54
|
Replies: 12
Views: 1,225
Hardware donation and possible meet-ups
I may have some extra hardware located in Sydney (Australia), I'm considering donating it based on GIMPS participation. Is somebody [else] from the forum living in Sydney?
Now that I think of it,...
|
Forum: GpuOwl
2020-12-01, 23:42
|
Replies: 2,691
Views: 222,420
|
Forum: GpuOwl
2020-11-24, 21:38
|
Replies: 197
Views: 15,328
"has slowed down" -- relative to what? did you...
"has slowed down" -- relative to what? did you compare two versions, to see what is the difference between them? Let's call the two versions you compare "before" and "after", or "good" and "bad"....
|
Forum: GpuOwl
2020-11-23, 22:23
|
Replies: 197
Views: 15,328
No that is not possible with the current...
No that is not possible with the current savefiles, they are most probably different between GpuOwl and mprime.
OTOH mprime may be offering the merged PRP+P1 at some point in the future.
|
Forum: GpuOwl
2020-11-23, 09:53
|
Replies: 197
Views: 15,328
Interesting. I haven't tried FFT 6M myself yet...
Interesting. I haven't tried FFT 6M myself yet (I'm still on 5.5M), I probably should.
- time helps a bit with timing the kernels. Sometimes running with -time old/new and comparing may provide a...
|
Forum: GpuOwl
2020-11-23, 02:58
|
Replies: 197
Views: 15,328
|
Forum: GpuOwl
2020-11-13, 22:57
|
Replies: 2,691
Views: 222,420
|
Forum: GpuOwl
2020-11-13, 20:10
|
Replies: 2,691
Views: 222,420
It is simply misreporting from GpuOwl: when it...
It is simply misreporting from GpuOwl: when it finds a P-1 factor, the part that writes the result doesn't know anymore whether it was found in stage1 or in stage2. It could write the result with...
|
Forum: GpuOwl
2020-11-12, 21:54
|
Replies: 197
Views: 15,328
I think ROCm 3.3 is good. Also any after 3.5...
I think ROCm 3.3 is good. Also any after 3.5 should work fine, but maybe slower than 3.3. You can try, now it's easier to install multiple versions of ROCm OpenCL in parallel (at the same time) and...
|
Forum: Math
2020-11-11, 00:06
|
Replies: 14
Views: 933
|
Forum: Math
2020-11-10, 23:33
|
Replies: 14
Views: 933
|
Forum: GpuOwl
2020-11-10, 22:48
|
Replies: 2,691
Views: 222,420
|