mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Software

Reply
 
Thread Tools
Old 2016-03-29, 23:52   #1
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

7,351 Posts
Default Prime95 version 28.9 / 28.10

Prime95 version 28.9 build 2 is available. From whatsnew.txt:

Code:
1)  Since GPUs are so much better at trial factoring than CPUs, benchmarking no longer times
    mprime's trial factoring by default.  Two new benchmarking options are available:
    OnlyBenchThroughput and OnlyBenchMaxCPUs.  See undoc.txt for details.
2)  Slightly reduced the memory bandwidth requirements for several large FFTs.  May lead to
    about a 1% speed increase for users testing 100 million digit numbers.
3)  If running more than one worker, prime95 looks for any sin/cos data that it can share among
    the workers.  Depending on the FFT sizes you are running, this could lead to a very slight
    reduction in needed memory bandwidth.
4)  Method for choosing the best FFT implementation changed.  In previous versions, the FFT
    implementation that resulted in the fastest single worker timing was used.  In this version
    the FFT implementation that had the best throughput was selected.  For FMA3 FFTs I used a
    4-core Skylake to measure best throughput.  For AVX FFTs I used a 4-core Sandy Bridge
    to measure best throughput.  Not many FFTs were affected, but you may see a few percent
    variation in throughput with this version.
5)  Improved AVX2 trial factoring in 64-bit executable.  Trial factoring should still be done
    on a GPU.  A GPU is on the order of 100 times more efficient at trial factoring than a CPU!!!
6)  Trial factoring now defines one "iteration" as processing 128KB of sieve, or 1M possible
    factors.  In previous versions an iteration was defined as 16KB of sieve in 32-bit executables
    and 48KB in 64-bit executables.  The trial factoring benchmark still times processing 16KB of sieve.
7)  Trial factoring in 64-bit executables is now multi-threaded.
8)  On initial install, the default settings for number of workers will be set to
    the number of cores / 4 with multithreading turned on.
9)  The worker windows menu choice now enforces a minimum number of multi-threaded cores for some
    work types to ensure timely completion of assignments.  Also, the worker windows menu choice
    no longer allows assigning work to hyperthreads (they are rarely beneficial in mprime).
    This behavior can be overridden with the ConfigureHyperthreads undoc.txt feature.

Download links:
Windows 64-bit: ftp://mersenne.org/gimps/p95v2810.win64.zip
Linux 64-bit: ftp://mersenne.org/gimps/p95v2810.linux64.tar.gz
Mac OS X: ftp://mersenne.org/gimps/p95v289.MacOSX.zip
FreeBSD 10 64-bit: ftp://mersenne.org/gimps/p95v289.FreeBSD10-64.tar.gz (not ready yet)
Windows 32-bit: ftp://mersenne.org/gimps/p95v2810.win32.zip
Linux 32-bit: ftp://mersenne.org/gimps/p95v2810.linux32.tar.gz
Source: ftp://mersenne.org/gimps/p95v2810.source.zip (not ready yet)

Last fiddled with by Prime95 on 2016-11-01 at 17:52
Prime95 is offline   Reply With Quote
Old 2016-03-29, 23:53   #2
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

162678 Posts
Default

1. Crash is possible when multithreading TF. Fixed in 28.10.
2. Results are not saved when a SIGHUP signal is received. Fixed in 28.10.
3. If a termination signal is received while mprime is awaiting user input from the menus or dialogs, then mprime does not terminate. Fixed in 28.10.
4. There is an intermittent bug in calculating how long an LL test takes to complete. This can trigger unwanted unreserving of exponents. This bug IS NOT fixed. There is a new undoc.txt option, UnreserveExponents, which will protect 99.9% of users from this bug. This new option is implemented in 28.10.
5. SSE2 torture test generated random errors and crashes. Fixed in 28.10.
6. Multithreaded PRP tests can produce incorrect results. The number being tested must generate carries into more than 4 FFT words - this happens for larger k,b,c combinations testing k*b^n+c. It also only happens for FFTs with a quantity that I call "cache line multiplier" (clm) is one. There is no way to tell if clm is one other than by looking at the mult.asm source code. Fixed in gwnum 28.12 and prime95 29.1 build 10.

Last fiddled with by Prime95 on 2017-02-18 at 05:29
Prime95 is offline   Reply With Quote
Old 2016-03-29, 23:59   #3
Prime95
P90 years forever!
 
Prime95's Avatar
 
Aug 2002
Yeehaw, FL

7,351 Posts
Default

You'll note in the whatsnew that I used a different method for choosing the best FFT implementation. Interestingly, on my Skylake system the 4M FFT now get more throughput running with four threads than running four single-threaded workers.

My Sandy Bridge system still runs better single-threaded.

I'm curious what changes y'all see in throughput. I use these options in prime.txt (feel free to try more FFT sizes than 4M):

Code:
MinBenchFFT=4096
MaxBenchFFT=4096
BenchTime=30
BenchMultithreads=1
OnlyBenchThroughput=1
OnlyBenchMaxCPUs=1
OnlyBench5678=0
BenchAllComplex=0
Prime95 is offline   Reply With Quote
Old 2016-03-30, 01:51   #4
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

1015610 Posts
Default

Quote:
Originally Posted by Prime95 View Post
I'm curious what changes y'all see in throughput. I use these options in prime.txt (feel free to try more FFT sizes than 4M):

Code:
MinBenchFFT=4096
MaxBenchFFT=4096
BenchTime=30
BenchMultithreads=1
OnlyBenchThroughput=1
OnlyBenchMaxCPUs=1
OnlyBench5678=0
BenchAllComplex=0
Run with the settings given above:
Compare your results to other computers at http://www.mersenne.org/report_benchmarks
AMD FX(tm)-8350 Eight-Core Processor
CPU speed: 4000.00 MHz, 8 cores
CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, FMA
L1 cache size: 16 KB
L2 cache size: 2 MB, L3 cache size: 8 MB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
L1 TLBS: 64
L2 TLBS: 1024
Prime95 64-bit version 28.9, RdtscTiming=1

Timings for 4096K FFT length (8 cpus, 1 worker): 13.32 ms. Throughput: 75.07 iter/sec.
Timings for 4096K FFT length (8 cpus, 2 workers): 25.03, 24.89 ms. Throughput: 80.13 iter/sec.
Timings for 4096K FFT length (8 cpus, 4 workers): 46.41, 48.23, 49.91, 45.72 ms. Throughput: 84.19 iter/sec.
Timings for 4096K FFT length (8 cpus, 8 workers): 90.94, 88.40, 90.99, 91.99, 88.38, 94.63, 90.25, 90.02 ms. Throughput: 88.24 iter/sec.

FWIW: The FX-8350 actually has L2 caches of 2048KB x 4. That is, 2MB L2 for each of the 4 [2 integer units, 1 FPU] groups. I don't know if this detection makes any difference in the way the program runs.
The L1 I-cache is also x4, which makes another good argument for treating this chip as a quad when running P95.

Last fiddled with by kladner on 2016-03-30 at 02:51
kladner is offline   Reply With Quote
Old 2016-03-30, 09:55   #5
ATH
Einyen
 
ATH's Avatar
 
Dec 2003
Denmark

5×607 Posts
Default

Haswell-E 5960X:

Code:
							Prime95 28.7 build 1		Prime95 28.9 build 2

Timings for 2048K FFT length (8 cpus, 1 worker):	Throughput: 830.76 iter/sec.	Throughput: 810.55 iter/sec.
Timings for 2048K FFT length (8 cpus, 2 workers):	Throughput: 747.78 iter/sec.	Throughput: 756.74 iter/sec.
Timings for 2048K FFT length (8 cpus, 4 workers):	Throughput: 726.44 iter/sec.	Throughput: 730.66 iter/sec.
Timings for 2048K FFT length (8 cpus, 8 workers):	Throughput: 726.48 iter/sec.	Throughput: 740.79 iter/sec.

Timings for 3072K FFT length (8 cpus, 1 worker):	Throughput: 523.80 iter/sec.	Throughput: 526.75 iter/sec.
Timings for 3072K FFT length (8 cpus, 2 workers):	Throughput: 478.93 iter/sec.	Throughput: 487.17 iter/sec.
Timings for 3072K FFT length (8 cpus, 4 workers):	Throughput: 479.19 iter/sec.	Throughput: 487.56 iter/sec.
Timings for 3072K FFT length (8 cpus, 8 workers):	Throughput: 483.86 iter/sec.	Throughput: 488.03 iter/sec.

Timings for 4096K FFT length (8 cpus, 1 worker):	Throughput: 360.31 iter/sec.	Throughput: 369.94 iter/sec.
Timings for 4096K FFT length (8 cpus, 2 workers):	Throughput: 335.23 iter/sec.	Throughput: 358.31 iter/sec.
Timings for 4096K FFT length (8 cpus, 4 workers):	Throughput: 352.46 iter/sec.	Throughput: 359.27 iter/sec.
Timings for 4096K FFT length (8 cpus, 8 workers):	Throughput: 355.22 iter/sec.	Throughput: 361.74 iter/sec.

Timings for 4608K FFT length (8 cpus, 1 worker):	Throughput: 323.82 iter/sec.	Throughput: 322.00 iter/sec.
Timings for 4608K FFT length (8 cpus, 2 workers):	Throughput: 303.35 iter/sec.	Throughput: 314.77 iter/sec.
Timings for 4608K FFT length (8 cpus, 4 workers):	Throughput: 318.59 iter/sec.	Throughput: 319.50 iter/sec.
Timings for 4608K FFT length (8 cpus, 8 workers):	Throughput: 320.56 iter/sec.	Throughput: 320.31 iter/sec.
"Real" exponent test on my current 67.4 M assignment (3584K FFT). Screen output set to every 10,000 iterations. Watched 10-15 screen outputs for each version:

28.7 build 1:
Between 2.373 ms/iter and 2.388 ms/iter:
Throughput: 418 iter/sec to 421 iter/sec

28.9 build 2:
Between 2.346 ms/iter and 2.368 ms/iter:
Throughput: 422 iter/sec to 426 iter/sec
ATH is online now   Reply With Quote
Old 2016-03-30, 11:57   #6
xtreme2k
 
xtreme2k's Avatar
 
Aug 2002

2×3×29 Posts
Default

Intel(R) Xeon(R) CPU E5-2697 v3 @ 2.60GHz
CPU speed: 2594.02 MHz, 14 hyperthreaded cores
CPU features: Prefetch, SSE, SSE2, SSE4, AVX, AVX2, FMA
L1 cache size: 32 KB
L2 cache size: 256 KB, L3 cache size: 35 MB
L1 cache line size: 64 bytes
L2 cache line size: 64 bytes
TLBS: 64

V28.7
Code:
Timings for 4096K FFT length (14 cpus, 1 worker):  1.99 ms.  Throughput: 503.07 iter/sec.
Timings for 4096K FFT length (14 cpus, 2 workers):  4.82,  4.88 ms.  Throughput: 412.40 iter/sec.
Timings for 4096K FFT length (14 cpus, 7 workers): 18.85, 18.80, 18.70, 18.70, 18.69, 18.82, 18.82 ms.  Throughput: 372.94 iter/sec.
Timings for 4096K FFT length (14 cpus, 14 workers): 37.27, 37.12, 37.28, 36.84, 36.66, 37.22, 36.70, 36.84, 36.63, 37.32, 37.47, 53.01, 37.23, 37.44 ms.  Throughput: 369.48 iter/sec.
V28.9 b2
Code:
Timings for 4096K FFT length (14 cpus, 1 worker):  1.89 ms.  Throughput: 530.30 iter/sec.
Timings for 4096K FFT length (14 cpus, 2 workers):  4.72,  4.73 ms.  Throughput: 423.22 iter/sec.
Timings for 4096K FFT length (14 cpus, 7 workers): 18.98, 18.52, 18.42, 18.36, 18.55, 18.69, 18.53 ms.  Throughput: 376.80 iter/sec.
Timings for 4096K FFT length (14 cpus, 14 workers): 38.80, 37.37, 37.29, 37.22, 37.06, 37.33, 37.14, 37.02, 37.29, 37.16, 37.82, 38.33, 37.59, 37.65 ms.  Throughput: 373.34 iter/sec.
xtreme2k is offline   Reply With Quote
Old 2016-03-30, 14:42   #7
ramshanker
 
ramshanker's Avatar
 
"Ram Shanker"
May 2015
Delhi

1001012 Posts
Default Feature Request

I sometimes test my office PC using Prime95. Internet connection is blocked in our office, at least for junior employees like me. So the Coms window just goes crazy and posts all the garbage returned by corporate firewall.

So I would like to request that whenever the response is "HTTP 403" i.e. access denied, as is the case with me, it should simply output "Internet Blocked" or something like that. Here is the comms window output.

[Main thread Mar 30 16:26] Mersenne number primality test program version 28.7
[Main thread Mar 30 16:26] Optimizing for CPU architecture: Core i3/i5/i7, L2 cache size: 256 KB, L3 cache size: 8 MB
[Main thread Mar 30 16:26] Logical CPUs 1,2 form one physical CPU.
[Main thread Mar 30 16:26] Logical CPUs 3,4 form one physical CPU.
[Main thread Mar 30 16:26] Logical CPUs 5,6 form one physical CPU.
[Main thread Mar 30 16:26] Logical CPUs 7,8 form one physical CPU.
[Comm thread Mar 30 16:26] Updating computer information on the server
[Main thread Mar 30 16:26] Starting worker.
[Comm thread Mar 30 16:26] PnErrorResult value missing. Full response was:
[Comm thread Mar 30 16:26] <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
[Comm thread Mar 30 16:26] <HTML><HEAD><TITLE>Error Message</TITLE>
[Comm thread Mar 30 16:26] <META http-equiv=Content-Type content="text/html; charset=UTF-8">
[Comm thread Mar 30 16:26] <STYLE id=L_default_1>A {
[Comm thread Mar 30 16:26] FONT-WEIGHT: bold; FONT-SIZE: 10pt; COLOR: #005a80; FONT-FAMILY: tahoma
[Comm thread Mar 30 16:26] }
[Comm thread Mar 30 16:26] A:hover {
[Comm thread Mar 30 16:26] FONT-WEIGHT: bold; FONT-SIZE: 10pt; COLOR: #0d3372; FONT-FAMILY: tahoma
[Comm thread Mar 30 16:26] }
[Comm thread Mar 30 16:26] TD {
[Comm thread Mar 30 16:26] FONT-SIZE: 8pt; FONT-FAMILY: tahoma
[Comm thread Mar 30 16:26] }
[Comm thread Mar 30 16:26] TD.titleBorder {
[Comm thread Mar 30 16:26] BORDER-RIGHT: #955319 1px solid; BORDER-TOP: #955319 1px solid; PADDING-LEFT: 8px; FONT-WEIGHT: bold; FONT-SIZE: 12pt; VERTICAL-ALI
[Comm thread Mar 30 16:26] }
[Comm thread Mar 30 16:26] TD.titleBorder_x {
[Comm thread Mar 30 16:26] BORDER-RIGHT: #955319 0px solid; BORDER-TOP: #955319 1px solid; PADDING-LEFT: 8px; FONT-WEIGHT: bold; FONT-SIZE: 12pt; VERTICAL-ALI
[Comm thread Mar 30 16:26] }
[Comm thread Mar 30 16:26] .TitleDescription {
[Comm thread Mar 30 16:26] FONT-WEIGHT: bold; FONT-SIZE: 12pt; COLOR: black; FONT-FAMILY: tahoma
[Comm thread Mar 30 16:26] }
[Comm thread Mar 30 16:26] SPAN.explain {
[Comm thread Mar 30 16:26] FONT-WEIGHT: normal; FONT-SIZE: 10pt; COLOR: #934225
[Comm thread Mar 30 16:26] }
[Comm thread Mar 30 16:26] SPAN.TryThings {
[Comm thread Mar 30 16:26] FONT-WEIGHT: normal; FONT-SIZE: 10pt; COLOR: #934225
[Comm thread Mar 30 16:26] }
[Comm thread Mar 30 16:26] .TryList {
[Comm thread Mar 30 16:26] MARGIN-TOP: 5px; FONT-WEIGHT: normal; FONT-SIZE: 8pt; COLOR: black; FONT-FAMILY: tahoma
[Comm thread Mar 30 16:26] }
[Comm thread Mar 30 16:26] .X {
[Comm thread Mar 30 16:26] BORDER-RIGHT: #955319 1px solid; BORDER-TOP: #955319 1px solid; FONT-WEIGHT: normal; FONT-SIZE: 12pt; BORDER-LEFT: #955319 1px soli
[Comm thread Mar 30 16:26] }
[Comm thread Mar 30 16:26] .adminList {
[Comm thread Mar 30 16:26] MARGIN-TOP: 2px
[Comm thread Mar 30 16:26] }
[Comm thread Mar 30 16:26] </STYLE>
[Comm thread Mar 30 16:26] <META content="MSHTML 6.00.2800.1170" name=GENERATOR></HEAD>
[Comm thread Mar 30 16:26] <BODY bgColor=#f3f3ed>
[Comm thread Mar 30 16:26] <TABLE cellSpacing=0 cellPadding=0 width="100%">
[Comm thread Mar 30 16:26] <TBODY>
[Comm thread Mar 30 16:26] <TR>
[Comm thread Mar 30 16:26] <TD class=titleborder_x width=30>
[Comm thread Mar 30 16:26] <TABLE height=25 cellSpacing=2 cellPadding=0 width=25 bgColor=black>
[Comm thread Mar 30 16:26] <TBODY>
[Comm thread Mar 30 16:26] <TR>
[Comm thread Mar 30 16:26] <TD class=x vAlign=center align=middle>X</TD>
[Comm thread Mar 30 16:26] </TR>
[Comm thread Mar 30 16:26] </TBODY>
[Comm thread Mar 30 16:26] </TABLE>
[Comm thread Mar 30 16:26] </TD>
[Comm thread Mar 30 16:26] <TD class=titleBorder id=L_default_2>Network Access Message:<SPAN class=TitleDescription> The page cannot be displayed</SPAN> </
[Comm thread Mar 30 16:26] </TR>
[Comm thread Mar 30 16:26] </TBODY>
[Comm thread Mar 30 16:26] </TABLE>
[Comm thread Mar 30 16:26]
[Comm thread Mar 30 16:26] <TABLE id=spacer>
[Comm thread Mar 30 16:26] <TBODY>
[Comm thread Mar 30 16:26] <TR>
[Comm thread Mar 30 16:26] <TD height=10></TD></TR></TBODY></TABLE>
[Comm thread Mar 30 16:26] <TABLE width=400>
[Comm thread Mar 30 16:26] <TBODY>
[Comm thread Mar 30 16:26] <TR>
[Comm thread Mar 30 16:26] <TD noWrap width=25></TD>
[Comm thread Mar 30 16:26] <TD width=400><SPAN class=explain><ID id=L_default_3><B>Explanation:</B></ID></SPAN><ID id=L_default_4> There is a problem with
[Comm thread Mar 30 16:26] <B><SPAN class=tryThings><ID id=L_default_5><B>Try the following:</B></ID></SPAN></B>
[Comm thread Mar 30 16:26] <UL class=TryList>
[Comm thread Mar 30 16:26] <LI id=L_default_6><B>Refresh page:</B> Search for the page again by clicking the Refresh button. The timeout may have occur
[Comm thread Mar 30 16:26] <LI id=L_default_7><B>Check spelling:</B> Check that you typed the Web page address correctly. The address may have been mistyped.
[Comm thread Mar 30 16:26] <LI id=L_default_8><B>Access from a link:</B> If there is a link to the page you are looking for, try accessing the page from that l
[Comm thread Mar 30 16:26]
[Comm thread Mar 30 16:26] </UL>
[Comm thread Mar 30 16:26] <ID id=L_default_9>If you are still not able to view the requested page, try contacting your administrator or Helpdesk.</ID> <BR><BR
[Comm thread Mar 30 16:26] </TD>
[Comm thread Mar 30 16:26] </TR>
[Comm thread Mar 30 16:26] </TBODY>
[Comm thread Mar 30 16:26] </TABLE>
[Comm thread Mar 30 16:26]
[Comm thread Mar 30 16:26] <TABLE id=spacer><TBODY><TR><TD height=15></TD></TR></TBODY></TABLE>
[Comm thread Mar 30 16:26]
[Comm thread Mar 30 16:26] <TABLE width=400>
[Comm thread Mar 30 16:26] <TBODY>
[Comm thread Mar 30 16:26] <TR>
[Comm thread Mar 30 16:26] <TD noWrap width=25></TD>
[Comm thread Mar 30 16:26] <TD width=400 id=L_default_10><B>Technical Information (for support personnel)</B>
[Comm thread Mar 30 16:26] <UL class=adminList>
[Comm thread Mar 30 16:26] <LI id=L_default_11>Error Code: 403 Forbidden. The ISA Server denied the specified Uniform Resource Locator (URL). (12202)
[Comm thread Mar 30 16:26] <LI id=L_default_12>IP Address: 107.155.126.203
[Comm thread Mar 30 16:26] <LI id=L_default_13>Date: 3/30/2016 10:58:38 AM [GMT]
[Comm thread Mar 30 16:26] <LI id=L_default_14>Server: XXXXXXXXXXXXXXXXXX
[Comm thread Mar 30 16:26] <LI id=L_default_15>Source: proxy
[Comm thread Mar 30 16:26]
[Comm thread Mar 30 16:26] </UL>
[Comm thread Mar 30 16:26] </TD>
[Comm thread Mar 30 16:26] </TR>
[Comm thread Mar 30 16:26] </TBODY>
[Comm thread Mar 30 16:26] </TABLE>
[Comm thread Mar 30 16:26]
[Comm thread Mar 30 16:26] </BODY>
[Comm thread Mar 30 16:26] </HTML>
[Comm thread Mar 30 16:26]
[Comm thread Mar 30 16:26] Visit http://mersenneforum.org for help.
[Comm thread Mar 30 16:26] Will try contacting server again in 70 minutes.

Last fiddled with by ramshanker on 2016-03-30 at 14:44
ramshanker is offline   Reply With Quote
Old 2016-03-30, 18:44   #8
science_man_88
 
science_man_88's Avatar
 
"Forget I exist"
Jul 2009
Dumbassville

26×131 Posts
Default

I decided to bench my system the other day results attached ( i3 2130 cpu so semi outdated because it's no longer made at last check 8 GB RAM):
Attached Files
File Type: txt results.txt (6.3 KB, 228 views)

Last fiddled with by science_man_88 on 2016-03-30 at 18:44
science_man_88 is offline   Reply With Quote
Old 2016-03-30, 19:44   #9
Fred
 
Fred's Avatar
 
"Ron"
Jan 2016
Fitchburg, MA

97 Posts
Default

For this older laptop, looks like a small decrease in performance.

i3-3120M laptop 4GB ram

Ver 28.7
Quote:
Timings for 4096K FFT length (2 cpus, 1 worker): 19.80 ms. Throughput: 50.51 iter/sec.
Timings for 4096K FFT length (2 cpus, 2 workers): 39.15, 38.44 ms. Throughput: 51.56 iter/sec.
Timings for 4096K FFT length (2 cpus hyperthreaded, 1 worker): 22.38 ms. Throughput: 44.67 iter/sec.
Timings for 4096K FFT length (2 cpus hyperthreaded, 2 workers): 44.77, 43.48 ms. Throughput: 45.34 iter/sec.
Ver. 28.9
Quote:
Timings for 4096K FFT length (2 cpus, 1 worker): 21.33 ms. Throughput: 46.88 iter/sec.
Timings for 4096K FFT length (2 cpus, 2 workers): 41.99, 41.78 ms. Throughput: 47.75 iter/sec.
Timings for 4096K FFT length (2 cpus hyperthreaded, 1 worker): 23.15 ms. Throughput: 43.20 iter/sec.
Timings for 4096K FFT length (2 cpus hyperthreaded, 2 workers): 45.86, 45.71 ms. Throughput: 43.69 iter/sec.

Last fiddled with by Fred on 2016-03-30 at 19:44
Fred is offline   Reply With Quote
Old 2016-03-30, 19:58   #10
Fred
 
Fred's Avatar
 
"Ron"
Jan 2016
Fitchburg, MA

97 Posts
Default

When upgrading (win64), is it sufficient to just replace the .exe?
Fred is offline   Reply With Quote
Old 2016-03-30, 21:53   #11
ewmayer
2ω=0
 
ewmayer's Avatar
 
Sep 2002
Rep├║blica de California

2×7×829 Posts
Default

Quote:
Originally Posted by kladner View Post
AMD FX(tm)-8350 Eight-Core Processor
CPU speed: 4000.00 MHz, 8 cores
CPU features: 3DNow! Prefetch, SSE, SSE2, SSE4, AVX, FMA
A note and a couple of questions re. your results summary:

o AMD's 8-int-core/4-fpu hybrid arch will likely act like an 8-core for integer-dominated work which is not too memory-intensive, for FPU-and-memory-heavy stuff like Prime95 it's effectively a 4-core.

o Does "FMA" here mean Intel-AVX2-style FMA3, or the AMD-only FMA4 which AMD introduced in late 2011 with their Bulldozer core?

o Has AMD fixed the issue noted by George in which their early AVX offerings performed worse running in AVX mode than they did in SSE2 mode? George, is there any way to force the program to run in SSE2 mode on a platform supporting both?
ewmayer is online now   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Prime95 version 27.3 Prime95 Software 148 2012-03-18 19:24
Prime95 version 26.3 Prime95 Software 76 2010-12-11 00:11
Prime95 version 25.5 Prime95 PrimeNet 369 2008-02-26 05:21
Prime95 version 25.4 Prime95 PrimeNet 143 2007-09-24 21:01
When the next prime95 version ? pacionet Software 74 2006-12-07 20:30

All times are UTC. The time now is 22:05.

Sat Feb 27 22:05:02 UTC 2021 up 86 days, 18:16, 0 users, load averages: 2.60, 2.13, 1.97

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.