![]() |
|
|
#254 | |
|
Dec 2002
5·163 Posts |
Quote:
Of course the Intel processors of today carry an enormous amount of legacy with them. It is about the cost of recompiling all existing software in the world again and redistribute that over the costumers of all software publishers versus the costs of carrying the legacy code in the processor. And also there are some commercial reasons that came to light when Intel tried to introduce a new architecture, Itanium, and failed to line up the customers behind a deal that found not enough market support. It gave AMD a lot of leeway to attract new customers. |
|
|
|
|
|
|
#255 |
|
Dec 2002
14578 Posts |
I have been running 27.9 now for five hours on the same worktodo test case. So far, no error showed up.
Apart from freezing errors showing up more frequently after restarting the machine, did errors show up more frequently when the machine was used for other purposes, like browsing the internet or so, or did the errors show up as frequently when the machine was left alone to only churn on prime95/mprime? George, can you write a version of 27.9 that outputs some potentially interesting data, like memory addresses used each iteration or so, that we could run? Of course such a modification will be very time consuming for the processor. Even worse may possibly actually prevent the error from showing up, but If I can make my machine to exhibit the freezing I would be willing to run such a version. |
|
|
|
|
|
#256 | |
|
Nov 2015
5010 Posts |
I recommend that you don't use your worktodo test case if you want to reproduce the error.
Is freezing equal to a worker stopped for you? There is another bug which will actually freeze the whole system: http://www.tomshardware.co.uk/forum/...-crashing.html Quote:
|
|
|
|
|
|
|
#257 | |||
|
Dec 2015
23×3 Posts |
Quote:
Code:
gwnum s = gwalloc(gwdata); double *startAddress = addr(gwdata, s, 0); unsigned long dataSize = gwnum_datasize(gwdata); So I need a confirmation that this code is right for the job. Quote:
Quote:
First I am trying to exclude that the *startAddress and dataSize variables do not touch any memory beyond their allocated space and they contain the entire FFT mem-work-space. Otherwise they will interfere with unallocated memory possibly used by VS. So I need some help with this. Thank you! Last fiddled with by megabit8 on 2016-01-02 at 12:36 |
|||
|
|
|
|
|
#258 |
|
Dec 2015
1816 Posts |
If "The addr function is used when looping from 0 to fftlen-1" then
Code:
unsigned long dataSize = (unsigned long)addr(gwdata, s, asm_data->FFTLEN - 1) + sizeof(double) - (unsigned long)startAddress; Code:
unsigned long dataSize = gwnum_datasize(gwdata);
I have also called this function once at the beginning: Code:
_declspec(dllexport) void __cdecl InitializeCPU()
{
guessCpuType();
guessCpuSpeed();
CPU_FLAGS &= ~CPU_FMA;
}
I am wondering what code VS 2015 executes there so that the FFT memory space gets altered. Last fiddled with by megabit8 on 2016-01-02 at 14:29 Reason: Add more info |
|
|
|
|
|
#259 | |
|
Einyen
Dec 2003
Denmark
2×1,579 Posts |
Quote:
But if you really want to know if your system has the error or not you should first replicate the exact circumstances, see for example post #184 for the summary. Last fiddled with by ATH on 2016-01-02 at 14:34 |
|
|
|
|
|
|
#260 |
|
Dec 2015
110002 Posts |
I've got some bad news. The same errors happen on another more dummy, slower but AVX optimized FFT of 768K real points. With the Debug Break and continue only on Skylake. It seems to be VS 2015 and Skylake related since _mm256_load_pd loads only half of the register when stepping over. Can you please tell me which compiler do you use for producing the Prime95 exe. Is it VS 2005 since the .SLN file is 2005 ?
Thank you! |
|
|
|
|
|
#261 | |
|
P90 years forever!
Aug 2002
Yeehaw, FL
19×397 Posts |
Quote:
|
|
|
|
|
|
|
#262 |
|
Dec 2002
5×163 Posts |
Yes, I have hyperthreading on and both versions of mprime reported that the cores 1 & 5, 2 & 6, 3 & 7, and 4 & 8 were working on the respective exponents.
I will finish the current test, which will be another two hours. I will then start a new test with 8 threads working concurrently on the following worktodo.txt test case: Code:
[Worker #1] Test=N/A,14942209,67,1 [Worker #2] Test=N/A,14942267,67,1 [Worker #3] Test=N/A,14942293,67,1 [Worker #4] Test=N/A,14942437,67,1 [Worker #5] Test=N/A,14942497,67,1 [Worker #6] Test=N/A,14942539,67,1 [Worker #7] Test=N/A,14942563,67,1 [Worker #8] Test=N/A,14942567,67,1 The current test and other tests done so far will prove the stability of the system so far. I have a GTX 580 and a 590 for this system, but I am not turning them on yet as I don't want to interfere with these tests. Last fiddled with by tha on 2016-01-02 at 15:33 |
|
|
|
|
|
#263 |
|
Dec 2002
5×163 Posts |
@ George:
If I remember correctly each time a LL test is started a random offset is chosen to eliminate potential errors in the design of the processor. Can we somehow force mprime (or prime95) to use no offset or at least a reproducible offset? I would like to try to create a case where we can tell beforehand exactly at which point the processor will be thrown off course. Also I assume you have access to a Skylake system yourself? |
|
|
|
|
|
#264 |
|
Einyen
Dec 2003
Denmark
2·1,579 Posts |
But have you tried the actually torture test yet? So far it is the only reported source of the bug, just to test if your system even has the error. It seems premature to test new ways to generate it before you know if the only known way works.
Last fiddled with by ATH on 2016-01-02 at 15:47 |
|
|
|
![]() |
| Thread Tools | |
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Skylake vs Kabylake | ET_ | Hardware | 17 | 2017-05-24 16:19 |
| Skylake and RAM scaling | mackerel | Hardware | 34 | 2016-03-03 19:14 |
| So does skylake-nonXeon actually get us anything? | fivemack | Hardware | 36 | 2015-09-08 01:42 |
| Skylake processor | tha | Hardware | 7 | 2015-03-05 23:49 |
| Skylake AVX-512 | clarke | Software | 15 | 2015-03-04 21:48 |