![]() |
|
|
#177 |
|
Aug 2002
North San Diego County
5×137 Posts |
Indeed, Intel moved from solder to TIM with Ivy Bridge.
More details Intel even has a patent for fluxless solder to heat-spreaders Last fiddled with by sdbardwick on 2013-07-07 at 19:36 |
|
|
|
|
|
#178 | |
|
∂2ω=0
Sep 2002
República de California
19×613 Posts |
Quote:
|
|
|
|
|
|
|
#180 | |
|
"Kieren"
Jul 2011
In My Own Galaxy!
1015810 Posts |
Quote:
|
|
|
|
|
|
|
#181 |
|
P90 years forever!
Aug 2002
Yeehaw, FL
165678 Posts |
OK, here is the first comparisons of prime95 with FMA3 coding optimizations. Haswell at 4.2GHz, DDR3-2400 memory.
First, timings with version 27.9. Code:
Intel(R) Core(TM) i5-4670K CPU @ 3.40GHz CPU speed: 4200.00 MHz, 4 cores Prime95 64-bit version 27.9, RdtscTiming=1 Best time for 768K FFT length: 3.08 ms., avg: 3.15 ms. Best time for 896K FFT length: 3.80 ms., avg: 3.85 ms. Best time for 1024K FFT length: 4.22 ms., avg: 4.64 ms. Best time for 1280K FFT length: 5.42 ms., avg: 5.44 ms. Best time for 1536K FFT length: 6.62 ms., avg: 6.66 ms. Best time for 1792K FFT length: 7.94 ms., avg: 8.03 ms. Best time for 2048K FFT length: 8.82 ms., avg: 8.90 ms. Best time for 2560K FFT length: 11.38 ms., avg: 11.43 ms. Best time for 3072K FFT length: 13.86 ms., avg: 13.91 ms. Best time for 3584K FFT length: 16.95 ms., avg: 17.00 ms. Best time for 4096K FFT length: 18.78 ms., avg: 18.84 ms. Best time for 5120K FFT length: 24.69 ms., avg: 24.74 ms. Best time for 6144K FFT length: 29.37 ms., avg: 29.49 ms. Best time for 7168K FFT length: 35.22 ms., avg: 35.30 ms. Best time for 8192K FFT length: 41.07 ms., avg: 41.17 ms. LL test of M77000003 (4M FFT): One worker = 18.8 ms Two workers = 19.1 ms Three workers = 20.1 ms Four workers = 22.7 ms Small torture test temps (4 cores): 78 / 74 / 72 / 70 |
|
|
|
|
|
#182 |
|
P90 years forever!
Aug 2002
Yeehaw, FL
165678 Posts |
Now, with FMA3 enabled code (version 28.1). I still need to add FMA3 to the carry propagation code, so timings should get a smidge better.
Code:
Intel(R) Core(TM) i5-4670K CPU @ 3.40GHz CPU speed: 4200.00 MHz Prime95 64-bit version 28.1, RdtscTiming=1 Best time for 768K FFT length: 2.655 ms., avg: 2.689 ms. Best time for 896K FFT length: 3.158 ms., avg: 3.266 ms. Best time for 1024K FFT length: 3.605 ms., avg: 3.808 ms. Best time for 1280K FFT length: 4.675 ms., avg: 4.707 ms. Best time for 1536K FFT length: 5.849 ms., avg: 5.908 ms. Best time for 1792K FFT length: 6.841 ms., avg: 6.855 ms. Best time for 2048K FFT length: 7.836 ms., avg: 7.863 ms. Best time for 2560K FFT length: 9.985 ms., avg: 10.015 ms. Best time for 3072K FFT length: 11.827 ms., avg: 11.884 ms. Best time for 3584K FFT length: 14.141 ms., avg: 14.161 ms. Best time for 4096K FFT length: 16.585 ms., avg: 16.650 ms. Best time for 5120K FFT length: 20.244 ms., avg: 20.795 ms. Best time for 6144K FFT length: 25.213 ms., avg: 25.235 ms. Best time for 7168K FFT length: 29.142 ms., avg: 29.162 ms. Best time for 8192K FFT length: 36.826 ms., avg: 36.936 ms. M77000003 One worker = 16.6 ms Two workers = 17.2 ms Three workers = 18.6 ms Four workers = 21.8 ms Small torture test temps: 86 / 83 / 80 / 78 Notice the temp increase on the small FFT torture test! These small FFTs operate out of the L2 cache. I'm going to try a really small torture test that operates out of the L1 cache to see if I can get the temps even higher. |
|
|
|
|
|
#183 | |
|
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2
251916 Posts |
Quote:
See the difference - http://xkcd.com/242/ |
|
|
|
|
|
|
#184 |
|
Bemusing Prompter
"Danny"
Dec 2002
California
1001011000002 Posts |
Very nice!
I wonder how DDR4 will affect the performance once consumer chips start supporting it. |
|
|
|
|
|
#185 |
|
Dec 2011
After milion nines:)
145110 Posts |
And what will be performance increase with AVX2? ( if it will be any)
|
|
|
|
|
|
#186 |
|
May 2013
East. Always East.
11·157 Posts |
Any ideas as to why the time per iteration is going up every time you add a worker? When I did my tests, I had 0.011 seconds at one, two or three workers and only saw the drop to 0.014 seconds when I added the fourth.
|
|
|
|
|
|
#187 |
|
"Mr. Meeseeks"
Jan 2012
California, USA
23×271 Posts |
Memory bottleneck.
|
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Haswell-E Prelim. Benchmark | sdbardwick | Hardware | 37 | 2015-02-10 18:49 |
| Prime95 and Haswell | Pleco | Information & Answers | 22 | 2014-07-13 16:03 |
| Haswell Rig | Mini-Geek | Hardware | 64 | 2014-05-27 13:22 |
| Prime95 version 27.1 early preview, not-even-close-to-beta release | Prime95 | Software | 126 | 2012-02-09 16:17 |
| Missing mouse-over preview text | retina | Forum Feedback | 1 | 2011-09-12 15:32 |