mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Software

Reply
 
Thread Tools
Old 2016-01-01, 22:37   #232
Aurum
 
Aurum's Avatar
 
Nov 2015

2·52 Posts
Default

It's hard to remember all websites/sources.

Last fiddled with by Aurum on 2016-01-01 at 22:38
Aurum is offline   Reply With Quote
Old 2016-01-01, 22:39   #233
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

2×67×73 Posts
Default

Quote:
Originally Posted by Aurum View Post
It's hard to remember all websites/sources.
Have you heard about Google? It's a little start-up which might help you find things you think you've read....
chalsall is offline   Reply With Quote
Old 2016-01-01, 22:41   #234
Aurum
 
Aurum's Avatar
 
Nov 2015

628 Posts
Default

I even searched my bookmarks + history ... I don't even remember if it was a german or english website. This is by far not the only thing I'm working on.
Aurum is offline   Reply With Quote
Old 2016-01-01, 22:43   #235
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

2×67×73 Posts
Default

Quote:
Originally Posted by Aurum View Post
I even searched my bookmarks + history ... I don't even remember if it was a german or english website. This is by far not the only thing I'm working on.
OK... Understood.

But please understand that making a claim, and then not being able to support it, doesn't go down well around here.
chalsall is offline   Reply With Quote
Old 2016-01-01, 22:52   #236
Aurum
 
Aurum's Avatar
 
Nov 2015

2×52 Posts
Default

The article also refereed to the sandy brige sata bug and said that a minor design change takes 8 weeks. That's why the workarounds in the microcode are not fixed in the hardware itself. Maybe someone else knows the source I'm talking about.
Aurum is offline   Reply With Quote
Old 2016-01-01, 22:57   #237
science_man_88
 
science_man_88's Avatar
 
"Forget I exist"
Jul 2009
Dumbassville

26·131 Posts
Default

Quote:
Originally Posted by Aurum View Post
It's hard to remember all websites/sources.
<never mind posted without thought again.>
is it any of these ?

https://www.google.ca/webhp?sourceid...%22sata+bug%22

Last fiddled with by science_man_88 on 2016-01-01 at 22:59
science_man_88 is offline   Reply With Quote
Old 2016-01-01, 22:58   #238
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

2·67·73 Posts
Default

Quote:
Originally Posted by Aurum View Post
The article also refereed to the sandy brige sata bug and said that a minor design change takes 8 weeks. That's why the workarounds in the microcode are not fixed in the hardware itself. Maybe someone else knows the source I'm talking about.
Anyone? Anyone at all...

Please forgive me for this, but we often have people entering our space who try to distract rather than converge.

It is important to have one's "signal to noise ratio" filter set to stun....
chalsall is offline   Reply With Quote
Old 2016-01-01, 23:30   #239
Aurum
 
Aurum's Avatar
 
Nov 2015

2·52 Posts
Default

I found it in my history: http://www.computerbase.de/2015-12/a...zessordesigns/
Aurum is offline   Reply With Quote
Old 2016-01-01, 23:35   #240
kladner
 
kladner's Avatar
 
"Kieren"
Jul 2011
In My Own Galaxy!

236568 Posts
Default

Quote:
Originally Posted by Aurum View Post

Warning! Google Translate link!

https://translate.google.com/transla...%2F&edit-text=

Last fiddled with by kladner on 2016-01-01 at 23:36
kladner is offline   Reply With Quote
Old 2016-01-02, 00:02   #241
chalsall
If I May
 
chalsall's Avatar
 
"Chris Halsall"
Sep 2002
Barbados

2·67·73 Posts
Default

Quote:
Originally Posted by Aurum View Post
OK... But that article is talking about AMD CPUs.

We are talking about Intel CPUs here.

Please do try to keep up....

(Just to be clear, this is intentionally confrontational. Deal with it.)
chalsall is offline   Reply With Quote
Old 2016-01-02, 02:05   #242
megabit8
 
Dec 2015

23·3 Posts
Default I managed to make the problem reproduce more quickly

I had an idea to use the Prime 95 v27.9 code to perform a 768K FFT per thread. The input of the FFT is always some fixed random data. So the hash result of the transformed fixed data should be always the same. And it is, most of the time on Skylake. Except that when the project is run from Visual Studio 2015 in Debug mode and the "Pause button is hit", then step through some lines of code then continue. Errors appear immediately. It is spectacular that on Ivy the Pause and step operations work fine and no error appears.

I have isolated this code below and exported it from a c++ dll. I need some expert advice from the developers that the following code performs a FFT of size 768K, exactly as in the v27.9 torture test:
#define norm_routines 10
#define gw_fft(h,a) (*(h)->GWPROCPTRS[0])(a)

_declspec(dllexport) void* __cdecl AllocPrime95Handle()
{
gwhandle *gwdata = new gwhandle();
unsigned long fftlen = 768 * 1024;
unsigned long p = 14942209; //does not matter, used just to initialize. (see the Prime95FFT function).

gwinit(gwdata);
gwset_specific_fftlen(gwdata, fftlen);
gwsetup(gwdata, 1.0, 2, p, -1);

return gwdata;
}

_declspec(dllexport) void __cdecl Prime95FFT(void *handle, __int64 *fastHashOutput) //fastHashOutput has 32 bytes
{
gwhandle *gwdata = (gwhandle*)handle;

int seed = 7; //Use the same calculations.
gwnum s = gwalloc(gwdata);

struct gwasm_data *asm_data = (struct gwasm_data *) gwdata->asm_data;
asm_data->NORMRTN = gwdata->GWPROCPTRS[norm_routines + gwdata->NORMNUM];
asm_data->DESTARG = s;
asm_data->DIST_TO_FFTSRCARG = 0;
asm_data->DIST_TO_MULSRCARG = 0;
asm_data->ffttype = 2; //type 2 = square.

double *startAddress = addr(gwdata, s, 0);
unsigned long dataSize = gwnum_datasize(gwdata);
int n = dataSize / sizeof(double); //n = 808952

__int64 v = (__int64)seed;
for (int i = n; --i >= 0; )
{
v = v * 0x2345987094395 + 1;
startAddress[i] = (double)v; //this is the fixed-random data that is written all the time.
}

gw_fft(gwdata, asm_data);

//sha1::calc(startAddress, dataSize, (unsigned char*)fastHashOutput);

__int64 *startAddressInt64 = (__int64*)startAddress;
__int64 hashResults[4] = { 10000, 20000, 30000, 40000 };
int shifts[4] = { 1, 2, 4, 8 }; //for primes: 3, 5, 17 and 256.
for (int i = n; --i >= 0; )
{
hashResults[i & 3] += (hashResults[i & 3] << shifts[i & 3]) + startAddressInt64[i];
}

memcpy(fastHashOutput, hashResults, sizeof(hashResults));

gwfree(gwdata, s);
}
I also need to know if I use the functions: addr and gwnum_datasize correctly so that this code does not touch memory zones outside the transform.

I have linked gwnum64.lib (non-debug library) to this project and included all the *.h files from the prime95 v27.9 "gwnum" folder.

Thank you!
megabit8 is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Skylake vs Kabylake ET_ Hardware 17 2017-05-24 16:19
Skylake and RAM scaling mackerel Hardware 34 2016-03-03 19:14
So does skylake-nonXeon actually get us anything? fivemack Hardware 36 2015-09-08 01:42
Skylake processor tha Hardware 7 2015-03-05 23:49
Skylake AVX-512 clarke Software 15 2015-03-04 21:48

All times are UTC. The time now is 04:33.


Fri Aug 6 04:33:46 UTC 2021 up 13 days, 23:02, 1 user, load averages: 2.30, 3.26, 4.38

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2021, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.