![]() |
![]() |
#45 | |||
"Ben"
Feb 2007
22·941 Posts |
![]() Quote:
Quote:
Quote:
cd asm/ make liblasieve.a CC=icc AVX512_TD=1 make liblasieveI11.a CC=icc AVX512_TD=1 make liblasieveI12.a CC=icc AVX512_TD=1 make liblasieveI13.a CC=icc AVX512_TD=1 make liblasieveI14.a CC=icc AVX512_TD=1 make liblasieveI15.a CC=icc AVX512_TD=1 make liblasieveI16.a CC=icc AVX512_TD=1 cd .. cp asm/liblasieve*.a . make all CC=icc AVX512_ALL=1 LASTATS=1 That last one is optional; it will provide timing for lasched and more accurate timings for the other categories if you run with -v. 4 |
|||
![]() |
![]() |
![]() |
#46 |
Apr 2020
2·33·19 Posts |
![]() |
![]() |
![]() |
![]() |
#47 | |
"Ben"
Feb 2007
22·941 Posts |
![]() Quote:
Is this it? Or is there somewhere else? |
|
![]() |
![]() |
![]() |
#48 |
"Ben"
Feb 2007
1110101101002 Posts |
![]()
I discovered that the missing factors in tinyecm processing of lpbr/a > 32 jobs are all a fairly specific class of inputs... namely 2LP's that are composed of two factors >= 32 bits, such that the input large factor is greater than 64 bits but <= lpbr/a*2 in size.
Fortunately, these are easy to identify and split using either mpqs or more effort in tinyecm. Now we find almost all the factors that pure mpqs does, still at a small fraction of the effort. Very large 3LP's may still be missed here and there, but I expect this factor finding rate should largely hold. Code:
time ./gnfs-lasieve4I16e -v -f 316000000 -c 1000 -a R1340L_poly.txt -o R1340L_16e_a_316000000_316001000.out.12 gnfs-lasieve4I16e (with asm64,avx-512 mmx-td,avx-512 lasetup,avx-512 lasched,avx-512 sieve1,avx-512 ecm): L1_BITS=15 Warning: lowering FB_bound to 315999999. FBsize 26351441+0 (deg 8), 26355865+0 (deg 1) total yield: 1242, q=316001009 (0.77841 sec/rel) ETA 0h00m) 48 Special q, 369 reduction iterations reports: 239715573->22542070->20471524->18368663->7200755->2605199 Number of relations with k rational and l algebraic primes for (k,l)=: Total yield: 1242 0/0 mpqs failures, 1108/20196 vain mpqs milliseconds total: Sieve 210330 Sched 416710 medsched 840 TD 161120 (Init 4220, MPQS 30740) Sieve-Change 30, lasieve_setup 177760 TD side 0: init/small/medium/large/search: 2420 32510 900 22630 12730 sieve: init/small/medium/large/search: 3370 50940 1320 34640 11040 TD side 1: init/small/medium/large/search: 3110 22690 1120 21560 5470 sieve: init/small/medium/large/search: 3810 68060 1130 33790 2230 953.632u 15.924s 16:10.01 99.9% 0+0k 2104+312io 1pf+0w |
![]() |
![]() |
![]() |
#49 |
"Oliver"
Sep 2017
Porta Westfalica, DE
7×223 Posts |
![]()
Would you mind trying to build it as C99, so that your compiler complains about implicit declarations (maybe with -Werr)? I then can give it a try with ICX since it will eliminate a lot of guesswork. Thanks.
![]() |
![]() |
![]() |
![]() |
#50 | |
"Ben"
Feb 2007
376410 Posts |
![]() Quote:
![]() |
|
![]() |
![]() |
![]() |
#51 |
"Oliver"
Sep 2017
Porta Westfalica, DE
30318 Posts |
![]()
At least for the YAFU code, I should have resolved the vast majority of them in my other thread.
Great to hear you trying this! |
![]() |
![]() |
![]() |
#52 |
"Ben"
Feb 2007
22×941 Posts |
![]()
The sievers should now build with CC=icx with all of the new AVX512 code. To others that may not know, icx can be downloaded for free from Intel. I was not aware of this until a few days ago.
If you wouldn't mind doing some sanity checking by comparing small runs with the new versions against the old/original sievers I would appreciate it. |
![]() |
![]() |
![]() |
#53 | |
"Ben"
Feb 2007
22·941 Posts |
![]() Quote:
Code:
./gnfs-lasieve4I16e -v -f 316000000 -c 1000 -a R1340L_poly.txt -J 16 -o R1340L_16e_a_316000000_316001000.out gnfs-lasieve4I16e (with asm64,avx-512 mmx-td,avx-512 lasetup,avx-512 lasched,avx-512 sieve1,avx-512 ecm): L1_BITS=15 Warning: lowering FB_bound to 315999999. FBsize 26351441+0 (deg 8), 26355865+0 (deg 1) Recurrence init: ub=32768 exceeds 16384 |
|
![]() |
![]() |
![]() |
#54 | |
"Bo Chen"
Oct 2005
Wuhan,China
2·3·31 Posts |
![]() Quote:
I could build that source 5 years ago, but it is a little strange that now I cann't compile it. Also there is a github Greg commit, (search lasieve5 on github, you will found Greg's github), perhaps it is newer, Greg's github still not support degree 8, but I think he already finish the code, since NFS@home could tackle degree 8 normally. |
|
![]() |
![]() |
![]() |
#55 | |
Apr 2020
2×33×19 Posts |
![]() Quote:
|
|
![]() |
![]() |
![]() |
Thread Tools | |
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
yafu ignoring yafu.ini | chris2be8 | YAFU | 9 | 2022-02-17 17:52 |
YAFU + GGNFS Confirmation | nivek000 | YAFU | 1 | 2021-12-10 22:35 |
Running YAFU via Aliqueit doesn't find yafu.ini | EdH | YAFU | 8 | 2018-03-14 17:22 |
GGNFS or something better? | Zeta-Flux | Factoring | 1 | 2007-08-07 22:40 |
ggnfs | ATH | Factoring | 3 | 2006-08-12 22:50 |