![]() |
|
|
#1 |
|
Feb 2012
34×5 Posts |
|
|
|
|
|
|
#2 |
|
Apr 2010
Over the rainbow
19·137 Posts |
I don't know if P95 can handle more than 32 core. I recall a thread that stated that even with 2 instance of P95 running, it wouldn't use the 48 core of the machine.
|
|
|
|
|
|
#3 |
|
Bemusing Prompter
"Danny"
Dec 2002
California
2×32×7×19 Posts |
I may be wrong, but I believe the 64-bit version of Prime95 could handle 64 threads.
|
|
|
|
|
|
#4 |
|
Romulan Interpreter
Jun 2011
Thailand
23×419 Posts |
that would not be a problem, virtual box (or equivalent), ten copies of the guest OS running with 32 cores allocated to each, P95 running there... just give me the wheelbarrow with the 320 cores... For the phi, the real problem will be its common caches, in fact.. because no matter how fast your dog can eat, you still must be able to feed him so fast...
So, in spite of having 7-8 times more cores then a "top" 8-cores Xeon E5-2670, it can only get 3-times more DP performance. That would be about 5- to 6-times faster compared with a i7-2600k (as everybody know what a 2600k is, but not everybody knows what a e5-2xxx is ).edit: and this is for clock per clock, but don't forget the 2600k has a faster clock, and runs much overclocked too, in water-cooled rigs. The pessimist in me says we won't see a more than 5 times performance increase, and that is already reachable with cudaLucas. Next big hit would be a FFT library for Radeons... Last fiddled with by LaurV on 2013-05-23 at 05:40 |
|
|
|
|
|
#5 |
|
∂2ω=0
Sep 2002
República de California
5×17×137 Posts |
TObject has it right as far as maximizing overall throughput - 1 LL test per core, or perhaps one LL test per 2-4 physically 'clustered' cores. Would be interesting to experiment and see what the optimal test/core matching strategy is here.
|
|
|
|
|
|
#6 | ||
|
"Oliver"
Mar 2005
Germany
11×101 Posts |
Quote:
Quote:
Xeon Phi
I've tried ewmeyers mlucas. I did compile, It runs singlethreaded (only SSE code is multithread which I can't use), and it is horrible slow: mlucas singlethreaded on a E5-2670 was ~50 times faster than a single core of Xeon Phi in my test. I failed to run glucas on Xeon Phi. Oliver P.S. My feeling tells me that you'll need to code an LL test explicit for Xeon Phi to achive good performance... but how many users would run it? There are no consumer cards for Xeon Phi. |
||
|
|
|
|
|
#7 |
|
Feb 2012
34·5 Posts |
The Xeon + Xeon Phi powered supercomputer takes the top spot. Titan moves to the second.
http://www.anandtech.com/show/7075/j...takes-top-spot |
|
|
|
|
|
#8 | |
|
∂2ω=0
Sep 2002
República de California
5·17·137 Posts |
Quote:
Oliver, I have just finished work on a beta version of Mlucas with pthreading-also-for-non-SIMD-builds ... PM me if you're interested in giving it a try. It can make use of up to 32 threads in LL-test mode [and up to 64 in Fermat-mod mode]. |
|
|
|
|
|
|
#9 |
|
Jul 2003
So Cal
211110 Posts |
I've played with the Xeon Phi. As previously mentioned, you need to recompile the code for the Phi without ASM, and to sufficiently hide memory latency you need to run 240 threads.
|
|
|
|
|
|
#10 |
|
Bemusing Prompter
"Danny"
Dec 2002
California
2·32·7·19 Posts |
Does anyone know if Xeon Phi would be useful for trial factoring?
|
|
|
|
|
|
#11 |
|
Aug 2006
3·1,993 Posts |
|
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Xeon vs. Quad CPU (775) | EdH | Hardware | 19 | 2017-06-08 22:06 |
| Motherboard for Xeon | ATH | Hardware | 7 | 2015-10-10 02:13 |
| Intel® Xeon Phi | pinhodecarlos | Hardware | 2 | 2015-02-10 18:42 |
| New Xeon | firejuggler | Hardware | 8 | 2014-09-10 06:37 |
| Dual Xeon Help | euphrus | Software | 12 | 2005-07-21 14:47 |