mersenneforum.org  

Go Back   mersenneforum.org > Factoring Projects > Factoring

Reply
 
Thread Tools
Old 2009-09-19, 15:52   #34
fivemack
(loop (#_fork))
 
fivemack's Avatar
 
Feb 2006
Cambridge, England

2×7×461 Posts
Default

ah, sorry:

http://www.chiark.greenend.org.uk/~t.../gnfs-batalov/

is what you want, and you want lasieveAsm64latest-updated16e.zip from under that
fivemack is offline   Reply With Quote
Old 2009-09-19, 16:10   #35
Andi47
 
Andi47's Avatar
 
Oct 2004
Austria

2×17×73 Posts
Default

Quote:
Originally Posted by fivemack View Post
ah, sorry:

http://www.chiark.greenend.org.uk/~t.../gnfs-batalov/

is what you want, and you want lasieveAsm64latest-updated16e.zip from under that
Thanks. I will try them as soon as the postprocessing of 2,1766M finishes.
Andi47 is offline   Reply With Quote
Old 2009-09-19, 16:59   #36
Brian Gladman
 
Brian Gladman's Avatar
 
May 2008
Worcester, United Kingdom

72·11 Posts
Default

Quote:
Originally Posted by Joe O View Post
I'm doubtful but it might be possible if the code was structured in a way that supports this approach. But this is unlikely unless it has been planned in advance. Agner uses this technique to write x64 assembler for both environments and I have also done this in my x64 assembler code for some crypographic algorithms.

One thing that might work is if the code was put through M4 to produce raw assembler code since this might produce code that could be converted for Windows with fewer difficulties. But I am not familiar enough with M4 to do this.

Last fiddled with by Brian Gladman on 2009-09-19 at 17:12
Brian Gladman is offline   Reply With Quote
Old 2009-09-19, 19:54   #37
Brian Gladman
 
Brian Gladman's Avatar
 
May 2008
Worcester, United Kingdom

21B16 Posts
Default

Quote:
Originally Posted by Brian Gladman View Post
I'm doubtful but it might be possible if the code was structured in a way that supports this approach. But this is unlikely unless it has been planned in advance. Agner uses this technique to write x64 assembler for both environments and I have also done this in my x64 assembler code for some crypographic algorithms.

One thing that might work is if the code was put through M4 to produce raw assembler code since this might produce code that could be converted for Windows with fewer difficulties. But I am not familiar enough with M4 to do this.
Answering my own post, this strategy works quite well and the YASM assembler that I use can handle the resulting code. So all that stands in the way is sorting out what stubs are needed and what other code changes are necessary (I already know that there are quite a few because YASM reports quite a few errors).

It is still a fair bit of work but a lot less than I feared. I don't have time to do it all myself but I would help out as a part of a team effort. In particular all the C prototypes for the assembler functions need to be extracted from the GGNFS code in order to change the calling conventions.
Brian Gladman is offline   Reply With Quote
Old 2009-09-19, 20:31   #38
Batalov
 
Batalov's Avatar
 
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2

32·1,117 Posts
Default

I can make the .s files by m4, if it would help. I have previously unrolled the loops (which is mostly what m4 does in that code) to find the 16e dicrepancy (for 16e, the loop is unrolled once and actually some registers may be saved, but I didn't figure it out, ...but I found a wrong instruction right then). I don't really know this assembly dialect at all!
Batalov is offline   Reply With Quote
Old 2009-09-19, 20:52   #39
Brian Gladman
 
Brian Gladman's Avatar
 
May 2008
Worcester, United Kingdom

10000110112 Posts
Default

Quote:
Originally Posted by Batalov View Post
I can make the .s files by m4, if it would help. I have previously unrolled the loops (which is mostly what m4 does in that code) to find the 16e dicrepancy (for 16e, the loop is unrolled once and actually some registers may be saved, but I didn't figure it out, ...but I found a wrong instruction right then). I don't really know this assembly dialect at all!
Thank you for the offer, which I appreciate, but I have already done this bit, which turned out to be easy. My version of M4 does however give some errors so it may well be useful to have alternative conversions. Some of the M4 output files for unrolling loops are huge so I may ask for just a part of some of these files. A positive note is that YASM gives just one error on a 50 Mbyte source file output by M4.

The hard bit is making the changes for the different calling conventions and the bit here that is needed as a starting point is the concise documentation of all the C prototypes for every function implemented in the assembler code. Unfortunately the few I have looked at have a lot of parameters, which is the more difficult conversion scenario.
Brian Gladman is offline   Reply With Quote
Old 2009-09-19, 21:48   #40
Batalov
 
Batalov's Avatar
 
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2

32×1,117 Posts
Default

Right, once you have m4, it takes no effort.
We could exclusively concentrate on 16e (.s files will be miniscule) and 15e (the prototypical small unroll for later). Then all others.
One thing that I've personally gotten to appreciate only very late was that all comments are in the .w files. I've even spent some time with 'cmagle' (or ctangle not to skip comments), but didn't get it to a good readable state.

And remember, there were the new sources contributed by Thorsten via Alex. I've decided to compile all relevant threads in one place, again:
SourceForge
http://mersenneforum.org/showthread.php?t=11660
http://mersenneforum.org/showthread.php?t=11430
http://mersenneforum.org/showthread.php?t=11160


(all divisible by 10! coincidence? I don't think so. )
Batalov is offline   Reply With Quote
Old 2009-09-23, 02:52   #41
Batalov
 
Batalov's Avatar
 
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2

1005310 Posts
Default A67-68M

Quote:
Originally Posted by Batalov View Post
I figured that the deposition site is going to be the same fivemack.dyndns.org, M941/ directory.
...and there goes the first pancake.
R67-68M.bz2 2,122,258 relns (of which 3,659 are duplicates)
Sieved in 2.0Ms on Phenom940.

No problems with 16e found; there's no yield dip and there were no badscheds. Same for 67M-a which is almost finished.
Planted the A67-68M.bz2 as well. All files are raw (duplicates not removed). Local duplication (in 1Mq windows) is very low but I checked it again: in this one 2,156,943 relations (of which are 3,590 duplicates).
Note: Two raw files combined produced 18,196 duplicates (and 4.21M unique); with more data duplication starts to show.

Here's an interesting bit. When counted in 2Kq bins, we get on average 4200 relations, but look at the deviation patterns:
Click image for larger version

Name:	A-R-67-68M.PNG
Views:	123
Size:	64.6 KB
ID:	4159
Algebraic side has about 2.5x rmsd of the rational side.
Just a note about difficulties of estimates in small -a intervals. (Incidentally, I am preparing an sfns project with -a side sieving and this property was important for me to remember.)

-S
Batalov is offline   Reply With Quote
Old 2009-09-23, 03:05   #42
bsquared
 
bsquared's Avatar
 
"Ben"
Feb 2007

7×13×41 Posts
Default

Quote:
Originally Posted by Batalov View Post
Algebraic side has about 2.5x rmsd of the rational side.
Just a note about difficulties of estimates in small -a intervals. (Incidentally, I am preparing an sfns project with -a side sieving and this property was important for me to remember.)

-S
I've seen this in every snfs job I've done which sieved on the algebraic side... makes for more difficult jobsize estimation, yes.
bsquared is offline   Reply With Quote
Old 2009-09-25, 14:19   #43
fivemack
(loop (#_fork))
 
fivemack's Avatar
 
Feb 2006
Cambridge, England

2×7×461 Posts
Default

61-62 finished both sides. 2212705+2127775 relations, 2.6M+2.7M seconds (1s/r on core2 hardware at around 3GHz, 1.4s/r on K8/2400 in a 2x2-core system, 1.3s/r on K8/2200 in a single-socket system)
fivemack is offline   Reply With Quote
Old 2009-09-26, 07:38   #44
Batalov
 
Batalov's Avatar
 
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2

32·1,117 Posts
Default A68-69M

finished A68-69M, too. Again 2.0Ms on k10/3000.
The duplicates count is:
2,137,457 raw = 2,133,980 unique + 3478 duplicate.
Batalov is offline   Reply With Quote
Reply

Thread Tools


Similar Threads
Thread Thread Starter Forum Replies Last Post
Possible response to "only uploading is lawsuit-worthy." jasong jasong 3 2012-12-27 16:40
New Computation JohnFullspeed Miscellaneous Math 8 2011-07-13 10:54
New Pi Computation Record ldesnogu Lounge 11 2010-01-07 14:42
Value of computation fivemack Lounge 0 2008-09-05 20:23
Saving computation in ECM dave_dm Factoring 8 2004-06-12 14:18

All times are UTC. The time now is 00:05.


Tue Jan 31 00:05:35 UTC 2023 up 165 days, 21:34, 1 user, load averages: 1.85, 1.12, 0.98

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔