QUOTE(Thunderbolt @ Apr 23 2007, 02:58 AM)
Then what are u referring to?It is either Kentsfield, Penryn or Barcelona only.
Intel Penryn 40% faster?
|
|
Apr 23 2007, 09:46 AM
Return to original view | Post
#1
|
![]() ![]() ![]() ![]() ![]() ![]() ![]()
Senior Member
4,060 posts Joined: Jan 2003 From: Penang / PJ |
|
|
|
|
|
|
Apr 23 2007, 11:55 PM
Return to original view | Post
#2
|
![]() ![]() ![]() ![]() ![]() ![]() ![]()
Senior Member
4,060 posts Joined: Jan 2003 From: Penang / PJ |
|
|
|
Apr 24 2007, 09:00 AM
Return to original view | Post
#3
|
![]() ![]() ![]() ![]() ![]() ![]() ![]()
Senior Member
4,060 posts Joined: Jan 2003 From: Penang / PJ |
QUOTE(kapitan @ Apr 24 2007, 12:03 AM) There is no 'real quad' and 'fake quad'. Barcelona, Penryn and Kentsfield are all Quad Cores. If seriously wanna define 'Fake Quad', Pentium D EE with 2 physical cores and 2 logical cores is a good example.QUOTE(Thunderbolt @ Apr 24 2007, 04:04 AM) You are the one who starts : "In fact there is: True Quad doesnt share the cache". So i was really interested how True Quad can be defined this way.In fact, your statement said : Penryn / Kentsfield is a 'true' quad because they don't share cache between sites. Barcelona shared their cache among all cores, so it is not a 'true' quad. This is a misleading info dude. |
|
|
Apr 24 2007, 10:35 AM
Return to original view | Post
#4
|
![]() ![]() ![]() ![]() ![]() ![]() ![]()
Senior Member
4,060 posts Joined: Jan 2003 From: Penang / PJ |
there is no 'true' and 'fake' quad. If a CPU has 4 physical cores in the same package, it is a quad core.
Intel uses non-monolithic approach to simplify the design with some sacrifice in performance, while AMD uses monolithic approach to have higher performance but takes longer time to design. |
|
|
Apr 24 2007, 12:26 PM
Return to original view | Post
#5
|
![]() ![]() ![]() ![]() ![]() ![]() ![]()
Senior Member
4,060 posts Joined: Jan 2003 From: Penang / PJ |
QUOTE(cks2k2 @ Apr 24 2007, 11:40 AM) I would say design time would be pretty much the same - it's the manufacturing that's the problem. No, the design time will not be the same. In terms of architecture, you need a redesign the intercore communications and cache if it is a monolithic approach, not to mention about circuit and layout optimization. For Kentsfield, all you need is to pack 2x C2D into a package, where the architecture and circuit are almost identical. Probably the power management logic and coherency/EBL need some changes, that's all. MCM makes sense to save cost and design effort, but sacrificing the performance & efficiency. This is why Intel can debut the 1st quad core much faster than AMD.4-cores in 1 die == larger die size == higher defect potential. Also larger die size == less dies per wafer == less cost effective. Binning will be another problem -> you can only sell at the lowest common clock speed. MCM makes sense until you move to a mature smaller process. This post has been edited by charge-n-go: Apr 24 2007, 01:02 PM |
|
|
Apr 24 2007, 06:05 PM
Return to original view | Post
#6
|
![]() ![]() ![]() ![]() ![]() ![]() ![]()
Senior Member
4,060 posts Joined: Jan 2003 From: Penang / PJ |
QUOTE(BlackThyra87 @ Apr 24 2007, 05:34 PM) As I said, kentsfield and barcelona are real quad cores, as they has 4 physical cores. The 4 processing cores ARE there. In engineering term, it is called the Monolithic vs Non-monolithic approach. The real or fake stuff are just some marketing terms to confuse the public. QUOTE TRUE QUAD / NATIVE QUAD core means: there are 4 individual cores stick together (will look as one) in one processor die, has its own cache on each core. Just like the Barcelona. The real quad core is the Barcelona. No doubt about that. What are u trying to say? Kentsfield has its own cache in individual core also. The definition of native quad core doesnt care about the cache at all. The cache system in Barcelona is simply implementation specific but has nothing to do with Native Quad Core design. The word "true" is misleading, but "native" describes the monolithic approach best. QUOTE what intel has done on their Quad core is that they just stick 2x dual core in one processor die. Of course there are performance inprovement over some applications but the app must be specifically optimized for intel's Quad core paths. you are wrong again. intel doesnt stick 2x dual core into one processor die. they are simply packaged together as single CPU unit. the word 'die' means 1 silicon --> the monolithic approach. As long as the applications have multi-threading support, it will have performance increment for sure. No special optimization is needed. |
|
|
|
|
|
Apr 24 2007, 06:36 PM
Return to original view | Post
#7
|
![]() ![]() ![]() ![]() ![]() ![]() ![]()
Senior Member
4,060 posts Joined: Jan 2003 From: Penang / PJ |
QUOTE(s[H) sIkuA,Apr 24 2007, 06:24 PM]He himself is the article, he is an engineering student btw, hence some technical term used that most people not familiar with , am I right charge-n-go? haha, don't say tat. i m also referring to some internal documentations to answer BlackThyra87QUOTE(empire23 @ Apr 24 2007, 06:25 PM) It is proven that even AMD needs to go back to the NB for Core to Core transfers, so that crossbar seems pretty useless lol. haha, although i worked in intel, but there are some amd overview classes over here.The fact he works for Intel? You are right, the core to core transfer is done via crossbar. It is just like the D-link or whatever intranet switch we use at home. NB is the centralize controller to determine what data to be transferred via crossbar, and serves all requests from the cores. Well, i think the fact that NB is built-in into K8 doesn't cause as much penalty as C2D / P4 where the NB is external. from the attached image, K8's NB is called the System Request Interface. ^ oh shooot, this is a single core K8. a dual core K8 has the same structure btw, js adding another core side by side with the 1st core. Image taken from Mindshare Slides (www.mindshare.com) This post has been edited by charge-n-go: Apr 24 2007, 06:42 PM |
|
|
Apr 25 2007, 08:16 AM
Return to original view | Post
#8
|
![]() ![]() ![]() ![]() ![]() ![]() ![]()
Senior Member
4,060 posts Joined: Jan 2003 From: Penang / PJ |
QUOTE(ikanayam @ Apr 24 2007, 11:09 PM) As empire said, it has been proven a while ago that the crossbar in the current x2 processors is NOT used to transfer data between cores. Which is pretty dumb IMO, but let's see if they change/fix it in Barcelona. Probably requires a change in coherency protocol to achieve, which is why they didn't do it. It does go through the crossbar in trasferring data between cores, unless the course given here is misleading |
|
|
Apr 25 2007, 03:19 PM
Return to original view | Post
#9
|
![]() ![]() ![]() ![]() ![]() ![]() ![]()
Senior Member
4,060 posts Joined: Jan 2003 From: Penang / PJ |
QUOTE(empire23 @ Apr 25 2007, 02:03 PM) Xbitlabs did some testing and they proved it went through the FSB by looking at the latency numbers IIRC. Hey dude, still FSB? I think you Intel too much lately Anyway, even the data is taken from RAM, it still goes through the crossbar in this fashion: RAM -> DRAM controller -> mem controller -> crossbar -> SRI -> cache. MOESI / MESI is so stupid that it wont transfer data from core 0 to core 1 in a more direct way. (when going from 'E' state to 'S' state). This post has been edited by charge-n-go: Apr 25 2007, 03:19 PM |
|
|
Apr 25 2007, 03:52 PM
Return to original view | Post
#10
|
![]() ![]() ![]() ![]() ![]() ![]() ![]()
Senior Member
4,060 posts Joined: Jan 2003 From: Penang / PJ |
QUOTE(ikanayam @ Apr 25 2007, 03:30 PM) ^ it's more like the coherency controller wasn't smart enough to take advantage of MOESI than a weakness in the protocol itself. It can be done with MOESI given the right implementation (i.e. how it's done in Pirahna). yeah you got the point. The states can actually remain the same, but the implementation could be different. Hopefully barcelona takes care of this. It does have a shared L3 after all, which should make things a lot easier if they do it right. Hmm.. mind to post up the piranha details in engineering thread? |
|
|
May 8 2007, 06:44 PM
Return to original view | Post
#11
|
![]() ![]() ![]() ![]() ![]() ![]() ![]()
Senior Member
4,060 posts Joined: Jan 2003 From: Penang / PJ |
SSE are instruction sets, that means you need to use the assembly instruction in order to use the feature supported by SSE.
|
| Change to: | 0.0181sec
0.56
7 queries
GZIP Disabled
Time is now: 21st December 2025 - 02:38 AM |