mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware

Reply
 
Thread Tools
Old 2009-01-13, 06:08   #34
CRGreathouse
 
CRGreathouse's Avatar
 
Aug 2006

135338 Posts
Default

I still don't understand your TF vs LL comment.

Quote:
Originally Posted by flouran View Post
Although it may not be REALLY fast, it would be interesting to do it nonetheless.
I think it would be very interesting. I'm expect it to run at about 4% the speed of a similarly clocked-and-cored Core 2: lose a factor of 12-14 on the double emulation and another factor of 2 because there's no machine code or other optimization specifically for the Cell. Better than 8% would surprise me; better than 20% would floor me. On the other end, performance less than 1% would probably indicate that something is set up wrong.

Edit: I mean 4% the performance of Prime95 on a Core 2. It could perhaps achieve 6-8% the performance of Glucas on a Core 2 (per core-clock).

Last fiddled with by CRGreathouse on 2009-01-13 at 06:10
CRGreathouse is offline   Reply With Quote
Old 2009-01-13, 06:11   #35
retina
Undefined
 
retina's Avatar
 
"The unspeakable one"
Jun 2006
My evil lair

22·1,549 Posts
Default

Quote:
Originally Posted by flouran View Post
Plus it is a gaming console so it should be fast (as compared to say the current market of the x86 family of processors).
Apples to oranges again. Faster at what? You still haven't answered that.
retina is online now   Reply With Quote
Old 2009-01-13, 06:13   #36
flouran
 
flouran's Avatar
 
Dec 2008

72×17 Posts
Talking

Quote:
Originally Posted by CRGreathouse View Post
I still don't understand your TF vs LL comment.



I think it would be very interesting. I'm expect it to run at about 4% the speed of a similarly clocked-and-cored Core 2: lose a factor of 12-14 on the double emulation and another factor of 2 because there's no machine code or other optimization specifically for the Cell. Better than 8% would surprise me; better than 20% would floor me. On the other end, performance less than 1% would probably indicate that something is set up wrong.

Edit: I mean 4% the performance of Prime95 on a Core 2. It could perhaps achieve 6-8% the performance of Glucas on a Core 2 (per core-clock).
Maybe I can skip double emulation and make it single emulation if I can compile Mlucas NATIVELY on Yellow Dog. We'll have to see.
flouran is offline   Reply With Quote
Old 2009-01-13, 06:14   #37
flouran
 
flouran's Avatar
 
Dec 2008

34116 Posts
Talking

Quote:
Originally Posted by retina View Post
Apples to oranges again. Faster at what? You still haven't answered that.
Would anyone like to prove either me or retina correct?
flouran is offline   Reply With Quote
Old 2009-01-13, 06:17   #38
retina
Undefined
 
retina's Avatar
 
"The unspeakable one"
Jun 2006
My evil lair

22·1,549 Posts
Default

Quote:
Originally Posted by flouran View Post
Would anyone like to prove either me or retina correct?
Hehe, you fail to realise that I have not made any claim to be proved/disproved. Only you have made the vague claim about something being fast. I am asking you fast at what. And your comparisons are all apples to oranges anyway so I'm not sure why I bother.
retina is online now   Reply With Quote
Old 2009-01-13, 06:23   #39
mdettweiler
A Sunny Moo
 
mdettweiler's Avatar
 
Aug 2007
USA (GMT-5)

141518 Posts
Default

Quote:
Originally Posted by flouran View Post
Would anyone like to prove either me or retina correct?
Well, sorry to bust your chops, but I think I can prove retina correct in that you are indeed comparing apples to oranges. TF is indeed much slower than LL *IF* you use TF *in place of* LL--which nobody in their right mind would do for GIMPS-size numbers since it would take a million years to prove the primality or nonprimality of a number that doesn't have any relatively small factors. However, if you use TF as a *supplement* to LL, which is what GIMPS is doing, then you can use TF to greatly improve overall throughput by eliminating candidates with relatively small factors (which can be found with relatively little TF work) so that you don't have the spend all that time doing an LL test for every single number.

Long story short, as retina said, a case of apples and oranges--TF and LL are both important parts of the overall GIMPS testing process, though they are for different stages of the process on each exponent. Thus, if a PS3 could do TF, it would help out with *that stage* of the process, but not with the LL stage of the process. However, since there is plenty of both LL and TF work that needs to be done yet for GIMPS, a PS3 doing TF would still be quite beneficial to the overall throughput of GIMPS.
mdettweiler is offline   Reply With Quote
Old 2009-01-13, 06:25   #40
flouran
 
flouran's Avatar
 
Dec 2008

72·17 Posts
Default

Quote:
Originally Posted by mdettweiler View Post
Well, sorry to bust your chops, but I think I can prove retina correct in that you are indeed comparing apples to oranges. TF is indeed much slower than LL *IF* you use TF *in place of* LL--which nobody in their right mind would do for GIMPS-size numbers since it would take a million years to prove the primality or nonprimality of a number that doesn't have any relatively small factors. However, if you use TF as a *supplement* to LL, which is what GIMPS is doing, then you can use TF to greatly improve overall throughput by eliminating candidates with relatively small factors (which can be found with relatively little TF work) so that you don't have the spend all that time doing an LL test for every single number.

Long story short, as retina said, a case of apples and oranges--TF and LL are both important parts of the overall GIMPS testing process, though they are for different stages of the process on each exponent. Thus, if a PS3 could do TF, it would help out with *that stage* of the process, but not with the LL stage of the process. However, since there is plenty of both LL and TF work that needs to be done yet for GIMPS, a PS3 doing TF would still be quite beneficial to the overall throughput of GIMPS.
TF it is on the PS3. Thanks for clearing it up.
flouran is offline   Reply With Quote
Old 2009-01-13, 06:31   #41
CRGreathouse
 
CRGreathouse's Avatar
 
Aug 2006

3·1,993 Posts
Default

Quote:
Originally Posted by flouran View Post
Maybe I can skip double emulation and make it single emulation if I can compile Mlucas NATIVELY on Yellow Dog. We'll have to see.
Single precision will require something like 3 times as many pieces and ~20 times as much time overall, I hear. I haven't done even rough calculations on that yet; perhaps I will tomorrow for kicks. But surely you're not serious in considering *rewriting* code to use single precision, are you? That might take me over a year to do myself (familiarizing myself with the code and the math + the actual coding).
CRGreathouse is offline   Reply With Quote
Old 2009-01-13, 06:32   #42
CRGreathouse
 
CRGreathouse's Avatar
 
Aug 2006

3×1,993 Posts
Default

Quote:
Originally Posted by flouran View Post
TF it is on the PS3. Thanks for clearing it up.
I don't know how fast TF would be on the processor; I don't know of anyone who has tried it. But it does seem more likely to be fast than LL on a Cell.
CRGreathouse is offline   Reply With Quote
Old 2009-01-13, 06:49   #43
flouran
 
flouran's Avatar
 
Dec 2008

83310 Posts
Talking

Quote:
Originally Posted by retina View Post
Hehe, you fail to realise that I have not made any claim to be proved/disproved. Only you have made the vague claim about something being fast. I am asking you fast at what. And your comparisons are all apples to oranges anyway so I'm not sure why I bother.
Do some research then if you are unsure... Besides, your comment, "And your comparisons are all apples to oranges anyway so I'm not sure why I bother." was unnecessary and if you have more things to say like that please don't post (on this thread at least) unless you wanna contribute to running MPrime on a PS3. By the way, I wasn't really trying to prove anything; I simply stated my own opinion, it was YOU who wanted my statement to be either proved or disproved.

Last fiddled with by flouran on 2009-01-13 at 06:58
flouran is offline   Reply With Quote
Old 2009-01-13, 06:50   #44
flouran
 
flouran's Avatar
 
Dec 2008

34116 Posts
Talking

Quote:
Originally Posted by CRGreathouse View Post
Single precision will require something like 3 times as many pieces and ~20 times as much time overall, I hear. I haven't done even rough calculations on that yet; perhaps I will tomorrow for kicks. But surely you're not serious in considering *rewriting* code to use single precision, are you? That might take me over a year to do myself (familiarizing myself with the code and the math + the actual coding).
Of course not, I won't write the code. Besides, if it does work, that's great; and if it doesn't, well then it doesn't.

Last fiddled with by flouran on 2009-01-13 at 06:52
flouran is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
128-bit OS'es and GIMPS? ixfd64 Software 22 2011-10-31 22:23
GIMPS Nub SayMoi Information & Answers 5 2009-04-06 15:29
GIMPS uses only 1 cpu Unregistered Information & Answers 7 2009-01-10 20:01
GIMPS should pay Vijay Lounge 40 2005-07-01 18:10
Why do you run GIMPS ? Prime Monster Lounge 12 2003-11-25 19:04

All times are UTC. The time now is 21:49.


Fri Jul 16 21:49:23 UTC 2021 up 49 days, 19:36, 2 users, load averages: 1.93, 1.91, 1.89

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.