Forum: GpuOwl
2021-04-03, 17:12
|
Replies: 16
Views: 743
|
Forum: Information & Answers
2021-03-31, 11:55
|
Replies: 7
Views: 326
|
Forum: Hardware
2021-03-29, 07:43
|
Replies: 16
Views: 732
The need for the general-MUL vs. MUL-3 only...
The need for the general-MUL vs. MUL-3 only appears when changing the "L" step dinamically during a test. This is something GpuOwl does not support (and thus gets away with using MUL-3), but prime95...
|
Forum: GpuOwl
2021-03-28, 18:58
|
Replies: 82
Views: 10,185
|
Forum: Hardware
2021-03-21, 20:02
|
Replies: 16
Views: 732
|
Forum: GpuOwl
2021-03-12, 20:31
|
Replies: 16
Views: 743
|
Forum: GpuOwl
2021-03-12, 08:56
|
Replies: 82
Views: 10,185
I recently got a Radeon VII with Samsung memory...
I recently got a Radeon VII with Samsung memory (as as RMA replacement). Even without any RAM overclock, and without any undervolt, that memory consistently generates errors. This is in contrast with...
|
Forum: GpuOwl
2021-03-12, 08:52
|
Replies: 16
Views: 743
|
Forum: GpuOwl
2021-03-10, 19:43
|
Replies: 82
Views: 10,185
|
Forum: GpuOwl
2021-03-10, 17:28
|
Replies: 82
Views: 10,185
|
Forum: mersenne.ca
2021-03-02, 19:04
|
Replies: 597
Views: 67,031
|
Forum: Software
2021-02-20, 08:48
|
Replies: 39
Views: 6,315
3xSP sum()
Unfortunately the sum() I have up to now is a beast: 54 ADDs.
This seems a rather very expensive sum()..
To see some corner-cases that sum() must handle, here is one example: given "x", we'd...
|
Forum: GPU Computing
2021-02-20, 08:42
|
Replies: 10
Views: 1,127
|
Forum: Software
2021-02-12, 21:02
|
Replies: 39
Views: 6,315
Figure 10 seems to indicate:
c0,e0 =...
Figure 10 seems to indicate:
c0,e0 = twoSum(a0, b0)
d1,e11 = twoSum(a1, b1)
c1,e12 = twoSum(d1, e0)
c2 = a2 + b2 + e11 + e12
which looks pretty good (i.e. simpler than I was expecting)
|
Forum: Software
2021-02-12, 19:45
|
Replies: 39
Views: 6,315
|
Forum: Software
2021-02-12, 08:42
|
Replies: 39
Views: 6,315
SP plan
I've been thinking some more about a practical SP FFT implementation on GPUs, and here are some problems/ideas:
1. FFT twiddles, i.e. the trigonometric constants (sin+cos) used in the FFT.
...
|
Forum: GpuOwl
2021-02-06, 20:04
|
Replies: 48
Views: 6,867
|
Forum: GpuOwl
2021-02-04, 06:16
|
Replies: 48
Views: 6,867
|
Forum: GpuOwl
2021-02-03, 21:40
|
Replies: 48
Views: 6,867
GpuOwl updated P-1 calculator
Hi, recently I revisited the P-1 calculator that's included with GpuOwl's source code https://github.com/preda/gpuowl/blob/master/pm1/pm1.cpp
The calculator is a small stanalone C++ program; to...
|
Forum: GPU Computing
2021-01-17, 16:42
|
Replies: 21
Views: 1,672
|
Forum: Hardware
2021-01-05, 10:12
|
Replies: 128
Views: 11,993
The cache (L1/L2/L3) is used transparently for...
The cache (L1/L2/L3) is used transparently for the *global* memory operations. It is managed automatically by the cache control (probably a variant of LRU), not explicitly by the software. So yes,...
|
Forum: GpuOwl
2020-12-13, 04:31
|
Replies: 199
Views: 17,706
|
Forum: GpuOwl
2020-12-06, 22:58
|
Replies: 2,695
Views: 240,754
|
Forum: Hardware
2020-12-06, 22:54
|
Replies: 12
Views: 1,478
Hardware donation and possible meet-ups
I may have some extra hardware located in Sydney (Australia), I'm considering donating it based on GIMPS participation. Is somebody [else] from the forum living in Sydney?
Now that I think of it,...
|
Forum: GpuOwl
2020-12-01, 23:42
|
Replies: 2,695
Views: 240,754
|