mersenneforum.org  

Go Back   mersenneforum.org > Great Internet Mersenne Prime Search > Hardware

Reply
 
Thread Tools
Old 2014-04-19, 03:30   #1
ewmayer
2ω=0
 
ewmayer's Avatar
 
Sep 2002
República de California

22·2,939 Posts
Default The Mill CPU architecture

Friend sent me this link (#7 of a series of 9 talks) on a new security-and-performance-oriented CPU architecture called The Mill:



Some interesting comments in the first 20 mins on the role (and power-hungriness) of the TLB in current arches such as x86: "The power consumption of the TLB in a processor like Haswell is truly phenomenal."
ewmayer is offline   Reply With Quote
Old 2014-04-19, 08:56   #2
retina
Undefined
 
retina's Avatar
 
"The unspeakable one"
Jun 2006
My evil lair

6,793 Posts
Default

It will be interesting to see if they can get enough software backing to make it a major player.

I notice that it has no flags or equivalent state information. So how does one do carry propagation in multiprecision code? I hope that even low level coders are not forced to use the ugly C style constructs of comparing the result to the input value to simulate a carry flag. Ew.

And just like a GPU and Java you don't actually generate binaries from the compilation process, it is all a layer above with some intermediate code and the CPU-specific-software-loader will then create the compatible binary code at runtime. Seems to be just another place to have bugs present.
retina is online now   Reply With Quote
Old 2014-04-20, 16:44   #3
jasonp
Tribal Bullet
 
jasonp's Avatar
 
Oct 2004

5·23·31 Posts
Default

There are some very neat ideas incorporated into this design. One of the earlier talks said they had been at it for 10 years.
jasonp is offline   Reply With Quote
Old 2014-05-03, 07:49   #4
retina
Undefined
 
retina's Avatar
 
"The unspeakable one"
Jun 2006
My evil lair

6,793 Posts
Default

Quote:
Originally Posted by retina View Post
I notice that it has no flags or equivalent state information. So how does one do carry propagation in multiprecision code? I hope that even low level coders are not forced to use the ugly C style constructs of comparing the result to the input value to simulate a carry flag. Ew.
The forum has a comment about this:
Quote:
There are no condition codes as such; it’s kind of hard to do when you can have half-a-dozen CC-setting operations in each instruction, although some architectures try, using multiple CC registers.

To have the effect of a carry bit, if you know that no argument was a Nar and you do an addux and get a NaR out, then you must have gotten an overflow, but that’s pretty kludgy. Or you could do an adduw and test the first bit in the extended data, but that’s pretty kludgy too.

However, there is an alternative: the current operation set includes addc and subc, with two results: a wrapped value and a carry/borrow. It is not clear whether these ops will stay around though; the only use is in an arbitrary-precision library, and for quad arithmetic on members that don’t support native quad; neither of these seem that common, and it’s not clear that the ops (which would have two-cycle latency) are worth it.

We expect a general triage of the operations set later in the process of an FPGA implementation.
And further down:
Quote:
It would certainly be possible to have a carry metabit in each byte, and we considered it. After all, there is already a NaR bit in each byte. However, we could not find enough usage for such a design to justify the cost. Perhaps we overlooked some important use for carry ...
My first thought was TLS, and crypto primitives in general, make extensive use of arbitrary-precision code. Maybe I've overlooked something also?
retina is online now   Reply With Quote
Old 2014-05-03, 17:39   #5
sanaris
 
"Yury Vorobyov"
Jul 2013
Chelyabinsk

19 Posts
Default

Now, when casual users quit 100 watt processor range for tablets and iphones, industry could do some really nice computing devices in terms of floating point ops per clock.

Problem is, why only Intel has the balls to do it?
sanaris is offline   Reply With Quote
Old 2014-05-03, 19:58   #6
xilman
Bamboozled!
 
xilman's Avatar
 
"𒉺𒌌𒇷𒆷𒀭"
May 2003
Down not across

2·17·347 Posts
Default

Quote:
Originally Posted by sanaris View Post
Problem is, why only Intel has the balls to do it?
s/balls/money/
xilman is offline   Reply With Quote
Reply



Similar Threads
Thread Thread Starter Forum Replies Last Post
P1 on non-x86 architecture. Lorenzo Information & Answers 10 2016-03-16 07:42
Optimizing for CPU architecture: unknown Intel almostfrugal Information & Answers 1 2012-09-19 14:22
RIP 32 bit architecture... chalsall Lone Mersenne Hunters 2 2010-09-27 19:18
39 - 39.25 Mill to 2^61 moo Lone Mersenne Hunters 7 2005-04-30 02:08
Trouble at t' mill Wacky NFSNET Discussion 6 2004-01-27 17:56

All times are UTC. The time now is 13:24.


Fri Jul 7 13:24:11 UTC 2023 up 323 days, 10:52, 0 users, load averages: 1.41, 1.22, 1.17

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2023, Jelsoft Enterprises Ltd.

This forum has received and complied with 0 (zero) government requests for information.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.
A copy of the license is included in the FAQ.

≠ ± ∓ ÷ × · − √ ‰ ⊗ ⊕ ⊖ ⊘ ⊙ ≤ ≥ ≦ ≧ ≨ ≩ ≺ ≻ ≼ ≽ ⊏ ⊐ ⊑ ⊒ ² ³ °
∠ ∟ ° ≅ ~ ‖ ⟂ ⫛
≡ ≜ ≈ ∝ ∞ ≪ ≫ ⌊⌋ ⌈⌉ ∘ ∏ ∐ ∑ ∧ ∨ ∩ ∪ ⨀ ⊕ ⊗ 𝖕 𝖖 𝖗 ⊲ ⊳
∅ ∖ ∁ ↦ ↣ ∩ ∪ ⊆ ⊂ ⊄ ⊊ ⊇ ⊃ ⊅ ⊋ ⊖ ∈ ∉ ∋ ∌ ℕ ℤ ℚ ℝ ℂ ℵ ℶ ℷ ℸ 𝓟
¬ ∨ ∧ ⊕ → ← ⇒ ⇐ ⇔ ∀ ∃ ∄ ∴ ∵ ⊤ ⊥ ⊢ ⊨ ⫤ ⊣ … ⋯ ⋮ ⋰ ⋱
∫ ∬ ∭ ∮ ∯ ∰ ∇ ∆ δ ∂ ℱ ℒ ℓ
𝛢𝛼 𝛣𝛽 𝛤𝛾 𝛥𝛿 𝛦𝜀𝜖 𝛧𝜁 𝛨𝜂 𝛩𝜃𝜗 𝛪𝜄 𝛫𝜅 𝛬𝜆 𝛭𝜇 𝛮𝜈 𝛯𝜉 𝛰𝜊 𝛱𝜋 𝛲𝜌 𝛴𝜎𝜍 𝛵𝜏 𝛶𝜐 𝛷𝜙𝜑 𝛸𝜒 𝛹𝜓 𝛺𝜔