mersenneforum.org  

Go Back   mersenneforum.org > Prime Search Projects > Twin Prime Search

Reply
 
Thread Tools
Old 2010-09-17, 16:57   #34
mdettweiler
A Sunny Moo
 
mdettweiler's Avatar
 
Aug 2007
USA

2·47·67 Posts
Default

Quote:
Originally Posted by Ken_g6 View Post
That's probably the cause. Some output goes to stderr (so that BOINC will save it, and/or because I'm lazy and haven't sorted out what shouldn't go to stderr ). Try a 2>&1 before your tee.
Ah, I hadn't thought of that. Okay, I'll try the 2>&1 next time.
mdettweiler is offline   Reply With Quote
Old 2010-09-19, 05:28   #35
Oddball
 
Oddball's Avatar
 
May 2010

1F316 Posts
Default

Would Ken or anyone else be interested in developing a CUDA sieve for a fixed n? If so, it could be used on the n=1000000 sieve for the Operation Megabit Twin subproject.

Last fiddled with by Oddball on 2010-09-19 at 05:28
Oddball is offline   Reply With Quote
Old 2010-09-19, 17:53   #36
Ken_g6
 
Ken_g6's Avatar
 
Jan 2005
Caught in a sieve

5×79 Posts
Default

Forget what I said here before; that was a fixed-K sieve.

The main obstacle to a fixed-N sieve is producing a list of primes to sieve against on the GPU. It might be doable.

Last fiddled with by Ken_g6 on 2010-09-19 at 17:59
Ken_g6 is offline   Reply With Quote
Old 2010-09-19, 19:08   #37
mdettweiler
A Sunny Moo
 
mdettweiler's Avatar
 
Aug 2007
USA

2×47×67 Posts
Default

Quote:
Originally Posted by Ken_g6 View Post
Forget what I said here before; that was a fixed-K sieve.

The main obstacle to a fixed-N sieve is producing a list of primes to sieve against on the GPU. It might be doable.
I don't suppose tpsieve itself works for fixed-n sieves? Or is it just that it's way slower than NewPGen for such?
mdettweiler is offline   Reply With Quote
Old 2010-09-19, 19:23   #38
Ken_g6
 
Ken_g6's Avatar
 
Jan 2005
Caught in a sieve

5·79 Posts
Default

Yeah, TPSieve-CUDA should work for a single N. It might even be faster than NewPGen, per CPU core run. And you might be able to run two or more processes at a time. But there would be a lot of bandwidth back and forth.
Ken_g6 is offline   Reply With Quote
Old 2010-09-19, 20:43   #39
Oddball
 
Oddball's Avatar
 
May 2010

499 Posts
Default

Quote:
Originally Posted by Ken_g6 View Post
Yeah, TPSieve-CUDA should work for a single N.
It doesn't work for large k ranges, though. From:

http://www.mersenneforum.org/showpos...6&postcount=86

Quote:
tpsieve (64-bit) refuses to sieve it:

Code:
tpsieve version 0.3.4 (testing)Compiled Sep 7 2009 with GCC 4.3.3kmax-kmin < 3*2^36 is required
edit: Is there any way you can get around the kmax-kmin < 3*2^36 limitation for the CPU version of TPSieve?

Last fiddled with by Oddball on 2010-09-19 at 20:47
Oddball is offline   Reply With Quote
Old 2010-09-19, 20:53   #40
Ken_g6
 
Ken_g6's Avatar
 
Jan 2005
Caught in a sieve

5·79 Posts
Default

That is an immense K range! The bitmap wouldn't fit in memory if you're getting that big. And besides, TPSieve wouldn't work until pmin > kmax.

How many candidates do you have in that range? Are you just sieving for twins, or for more specialized forms?
Ken_g6 is offline   Reply With Quote
Old 2010-09-19, 21:26   #41
Oddball
 
Oddball's Avatar
 
May 2010

49910 Posts
Default

Quote:
Originally Posted by Ken_g6 View Post
That is an immense K range! The bitmap wouldn't fit in memory if you're getting that big. And besides, TPSieve wouldn't work until pmin > kmax.

How many candidates do you have in that range? Are you just sieving for twins, or for more specialized forms?
We're sieving k=1-120T for sophies and twins at first (until p=100G), but it's now a twin only sieve:

http://www.mersenneforum.org/showthread.php?t=13439

The file has about 85M candidates.
Oddball is offline   Reply With Quote
Old 2010-09-19, 22:51   #42
kar_bon
 
kar_bon's Avatar
 
Mar 2006
Germany

2·1,531 Posts
Default

Quote:
Originally Posted by Oddball View Post
We're sieving k=1-120T for sophies and twins at first (until p=100G), but it's now a twin only sieve:

http://www.mersenneforum.org/showthread.php?t=13439

The file has about 85M candidates.
The file for k=1-120T at p=45T posted by Amphoria contains 78451842 candidates.
kar_bon is offline   Reply With Quote
Old 2010-09-27, 18:14   #43
Ken_g6
 
Ken_g6's Avatar
 
Jan 2005
Caught in a sieve

5×79 Posts
Default

Hi, all,

I have a new version of TPSieve-CUDA, V0.2.0, out for testing. It's a lot faster than the current version, but I've added one algorithm I'm not completely sure about. I need people to test this new version against previous sieve files, where 22 <= nstep <= 31 and p > 9T. (Just make sure it says that nstep is changed to 22.) And let me know if it misses any factors.

If this works, I have big plans for this app. One of those involves improving its speed a little more still. But if this app misses any factors, I need to know ASAP so I can re-arrange stuff.

Thanks!
Ken_g6 is offline   Reply With Quote
Old 2010-09-27, 21:16   #44
amphoria
 
amphoria's Avatar
 
"Dave"
Sep 2005
UK

23×347 Posts
Default

I have re-run 510T-511T of 480000-485000 and the factors match. I used the default value of -m as the meaning has either changed or the new code puts a greater load on the GPU. My original best value of 24576 caused the code to grind to a halt. With the default value I got an improvement from 138M p/sec (CPU 0.41) to 195M p/sec (CPU 0.57).
amphoria is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
Fast Mersenne Testing on the GPU using CUDA Andrew Thall GPU Computing 109 2014-07-28 22:14
Inconsistent factors with TPSieve Caldera Twin Prime Search 7 2013-01-05 18:32
tpsieve-cuda slows down with increasing p amphoria Twin Prime Search 0 2011-07-23 10:52
Is TPSieve-0.2.1 faster than Newpgen? cipher Twin Prime Search 4 2009-05-18 18:36
Thread for non-PrimeNet LL testing ThomRuley Lone Mersenne Hunters 6 2005-10-16 20:11

All times are UTC. The time now is 13:35.


Fri Jul 7 13:35:08 UTC 2023 up 323 days, 11:03, 0 users, load averages: 1.44, 1.27, 1.22

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔