Welcome Guest ( Log In | Register )

115 Pages  1 2 3 > » Bottom

Outline · [ Standard ] · Linear+

 AMD Bulldozer & Bobcat

views
     
TSjinaun
post Aug 25 2010, 05:07 PM, updated 16y ago

where are my stars???
Group Icon
Elite
6,139 posts

Joined: Jan 2003
Next year, AMD plans to ship products based on a new processor architecture code-named Bulldozer, and in the world of big, x86-compatible CPUs, that's huge news. In this arena, the question of how truly "new" a chip architecture is can be vexingly complicated, because technologies, ideas, and logic are often carried over from one generation to the next. But it's probably safe to say Bulldozer is AMD's first all-new, bread-and-butter CPU architecture since the introduction of the K7 way back in 1999. The firm has made notable incremental changes along the way—K8 brought a new system architecture, Barcelona integrated four cores together—but the underlying microarchitecture hasn't changed too much. Bulldozer is something very different, a new microarchitecture incorporating some novel concepts we've not seen anywhere else.

Today, at the annual Hot Chips conference, Mike Butler, AMD Fellow and Chief Architect of the Bulldozer core, gave the first detailed public exposition of Bulldozer. We didn't attend his presentation, but we did talk with Dina McKinney, AMD Corporate Vice President of Design Engineering, who led the Bulldozer team, in advance of the conference. We also have a first look at some of the slides from Butler's talk, which reveal quite a bit more detail about Bulldozer than we've seen anywhere else.

The first thing to know about the information being released today is that it's a technology announcement, and only a partial one at that. AMD hasn't yet divulged specifics about Bulldozer-based products yet, and McKinney refused to answer certain questions about the architecture, too. Instead, the company intends to release snippets of information about Bulldozer in a directed way over time in order to maintain the buzz about the new chip—an approach it likens to "rolling thunder," although I'd say it feels more like a leaky faucet.

The products: New CPUs in 2011

Regardless, we know the broad outlines of expected Bulldozer-based products already. Bulldozer will replace the current server and high-end desktop processors from AMD, including the Opteron 4100 and 6100 series and the Phenom II X6, at some time in 2011. A full calendar year is an awfully big target, especially given how close it is, but AMD isn't hinting about exactly when next year the products might ship. We do know that the chips are being produced by GlobalFoundries on its latest 32-nm fabrication process, with silicon-on-insulator tech and high-k metal gate transistors. McKinney told us the first chips are already back from the fab and up and running inside of AMD, so Bulldozer is well along in its development. Barring any major unforeseen problems, we'd wager the first products based on it could ship well before the end of 2011, which would be somewhat uncommon considering that these product launch time windows frequently get stretched to their final hours.

One advantage that Bulldozer-based products will have when they do ship is the presence of an established infrastructure ready and waiting for them. AMD says Bulldozer-based chips will be compatible with today's Opteron sockets C32 and G34, and we expect compatibility with Socket AM3 on the desktop, as well, although specifics about that are still murky.

AMD has committed to three initial Bulldozer variants. "Valencia" will be an eight-core server part, destined for the C32 socket with dual memory channels. "Interlagos" will be a 16-core server processor aimed at the G34 socket, so we'd expect it to have quad memory channels. In fact, Interlagos will likely be comprised of two Valencia chips on a single package, in an arrangement much like the present "Magny-Cours" Opterons. The desktop variant, "Zambezi", will have eight cores, as well. All three will quite likely be based on the same silicon.

The concept: two 'tightly coupled' cores

Attached Image

The specifics of that silicon are what will make Bulldozer distinctive. The key concept for understanding AMD's approach to this architecture is a novel method of sharing resources within a CPU. Butler's talk names a couple of well-known options for supporting multiple threads. Simultaneous multithreading (SMT) employs targeted duplication of some hardware and sharing of other hardware in order to track and execute two threads in a single core. That's the approach Intel uses its current, Nehalem-derived processors. CMP, or chip-level multiprocessing, is just cramming multiple cores on a single chip, as AMD's current Opterons and Phenoms do. The diagram above depicts how Bulldozer might look had AMD chosen a CMP-style approach.

Attached Image

AMD didn't take that approach, though. Instead, the team chose to integrate two cores together into a fundamental building block it calls a "Bulldozer module." This module, diagrammed above, shares portions of a traditional core—including the instruction fetch, decode, and floating-point units and L2 cache—between two otherwise-complete processor cores. The resources AMD chose to share are not always fully utilized in a single core, so not duplicating them could be a win on multiple fronts. The firm claims a Bulldozer module can achieve 80% of the performance of two complete cores of the same capability. Yet McKinney told us AMD has estimated that including the second integer core adds only 12% to the chip area occupied by a Bulldozer module. If these claims are anywhere close to the truth, Bulldozer should be substantially more efficient in terms of performance per chip area—which translates into efficiency per transistor and per watt, as well.

One obvious outcome of the Bulldozer module arrangement, with its shared FPU, is an inherent bias toward increasing integer math performance. We've heard several explanations for this choice. McKinney told us the main motivating factor was the presence of more integer math in important workloads, which makes sense. Another explanation we've heard is that, with AMD's emphasis on CPU-GPU fusion, floating-point-intensive problems may be delegated to GPUs or arrays of GPU-like parallel processing engines in the future.

Attached Image

In our talk, McKinney emphasized that a Bulldozer module would provide more predictable performance than an SMT-enabled core—a generally positive trait. That raised an intriguing question about how the OS might schedule threads on a Bulldozer-based processor. For an eight-threaded, quad-core CPU like Nehalem, operating systems generally tend to favor scheduling a single thread on each physical core before adding a second thread on any core. That way, resource sharing within the cores doesn't come into play before necessary, and performance should be optimal. We suggested such an arrangement might also be best for a Bulldozer-based CPU, but McKinney downplayed the need for any special provisions of that nature on this hardware. She also hinted that scheduling two threads on the same module and leaving the other three modules idle, so they cold drop into a low-power state, might be the best path to power-efficient performance. We don't yet know what guidance AMD will give operating system developers regarding Bulldozer, but the trade-offs at least shouldn't be too painful.


Sos: http://www.techreport.com/articles.x/19514

Bulldozer 20 Questions


QUOTE
“Will Bulldozer implement new versions of Hypertransport?” – Rheo

No, Bulldozer takes advantage of the same version of HyperTransport™ (HT) technology as our existing AMD Opteron™ 4000 and 6000 series processors, HyperTransport 3.1.

“Is there any”programmable-tangible” improvement in synchronization between cores in the same module? In other words, will I get tangible performance improvement if I can partition my multi-threaded algorithm to pairs of closely interacting threads, and schedule each pair to a module?” – Edward Yang

That is a very interesting question.

For the majority of software, the OS will work in concert with the processor to manage the thread to core relationships. We are collaborating with Microsoft and the open source software community to ensure that future versions of Windows and Linux operating systems will understand how to enumerate and effectively schedule the Bulldozer core pairs.   The OS will understand if your machine is setup for maximum performance or for maximum performance/watt which takes advantage of Core Performance Boost.

However, let’s say you want to explore if you can get a performance advantage if your threads were scheduled on different modules.  The benefit you can gain really depends on how much sharing the two threads are going to do.

Since the two integer cores are completely separate and have their own execution clusters (pipelines) you get no sharing of data in the L1 – and there is no specific optimizations needed at the software level. However, at the L2 cache level there could be  some  benefits.  A shared L2 cache means that both cores have access to read the same cache lines – but obviously only one can write any cache line at any time. This means that if you have a workload with a main focus of querying data and your two threads are sharing a data set that fits in our L2, then having them execute in the same module could have some advantages. The main advantage we expect to see is an increase in the power efficiency of the cores that are idle.  The more idle other cores are, the better chance the busy cores will have to boost.

However, there is another consideration to this which is how available other cores are.  You need to weigh the benefits of data sharing with the benefit of starting the thread on the next available core. Stacking up threads to execute in proximity means that a thread might be waiting in line while an open core is available for immediate execution.    If your multi-threaded application isn’t optimized to target the L2 (or possibly the L3 cache), or you have distinctly separate applications to run, and you don’t need to conserve power, then you’ll likely get better performance by having them scheduled on separate modules.   So it is important to weigh both options to determine the best execution.

“How much extra performance will we see when running two-threaded applications on one Bulldozer Module compared to two cores in different modules?” – Simon

Without getting too specific around actual scaling across cores on the processor, let me share with you what was in the Hot Chips presentation.  Compared to CMP (chip multiprocessing – which is, in simplistic terms building a multicore chip with each core having its own dedicated resources) two integer cores in a Bulldozer module would deliver roughly 80% of the throughput.  But, because they have shared resources, they deliver that throughput at low power and low cost.  Using CMP has some drawbacks, including more heat and more die space. The heat can limit performance in addition to consuming more power. Ask yourself, would you rather have a 4-cylinder engine that delivered 300HP or a 6-cylinder engine that delivered 360HP and consumed less gas?  The cylinder to horsepower ratio for 4-cylinder is obviously higher (75HP/cylinder vs. the V6’s 60HP/cylinder), meaning that each cylinder can give you more performance.  However, looking at the overall enginge, you are getting less total output; and you are getting that lower output at a higher cost (higher gas consumption).

“Current and forthcoming Nehalem EX based servers from IBM and HP top out at 8 sockets and 64 cores. What kind of vertical scalability can we expect from Bulldozer-based servers?” – David Roff

Bulldozer will fit into the current “Maranello” and “San Marino/Adelaide” platforms. “Maranello” is our high performance platform that will support up to 4 CPUs.  Combining a “Maranello” platform with the upcoming 16-core “Interlagos” processors, the total core density of a 4P system will reach as many as 64 cores.

The 8P x86 market today is pretty small. According to IDC, last year it accounted for roughly 7,915 total servers, down 26% from the year before (Source: IDC Quarterly Server Tracker, Q4 2009). If you want to say that 2009 was a bad year, from 2007 to 2008 the 8P x86 market was essentially flat as well, so that isn’t a growth engine. Part of what is impacting that market is the core and memory densities of today’s systems.  People bought 8P servers to get to 48 cores (8 x 6-core) or to get to large memory footprints. Today’s 4P systems are meeting those needs at a lower price, with lower power consumption and lower latency. When we get to 2011 with “Bulldozer,” you’ll see an increase up to 64 cores, and we expect the total memory footprint will increase again.

The bottom line is, you’ll get the 64 cores that you want, you’ll just have to spend a lot less to get them; is that OK?

“As far as power usage goes, from what I understand BD is supposed to be taking power management features to a level of granularity that hasn’t been seen yet with consumer/business grade CPUs. Will those new features be available to current MC users or will a platform upgrade be necessary? Can you elaborate on any new power saving features that would make a business want to consider BD at this time?” – Jeremy Stewart

Current “Maranello” platforms with AMD Opteron™ 6100 Series processors already have the hooks embedded in them for any “Bulldozer”-level power efficiency features.  When we specified the platforms for today’s processors, we did so with “Bulldozer” in mind.

As we have said already in this blog, we expect the shared architecture to provide us with a great deal of power savings – there are a lot of circuits that are essentially being duplicated in today’s multicore processors. Having a new “from the ground up” design allowed us to take a very close look at the circuits and determine which ones are ripe for consolidation and which ones really need their own dedicated resources.

We started with inherently power-efficient microarchitecture and implementation that included dynamic sharing of shared resources, minimized data movement and took advantage of extensive clock and power gating. From there, we added active management support that allows us to digitally measure activity in order to estimate power. Support for chip-level core power gating was also added to the processor.

These new features join existing AMD Opteron processor technologies such as AMD PowerNow!™, AMD CoolCore™, low voltage DDR-3 memory support and more, all working in concert to help create a power efficient system.

Even though you’ll see processors with 33% more cores and larger caches than the previous generation, we’ll still be fitting them into the same power and thermal ranges that you see with our existing 12-core processors.

http://blogs.amd.com/work/2010/08/10/20-qu...ulldozer-style/

http://blogs.amd.com/work/2010/08/23/%E2%8...ions-round-one/

http://blogs.amd.com/work/2010/08/30/bulld...2%80%93-part-2/

This post has been edited by jinaun: Sep 5 2010, 12:41 AM
TSjinaun
post Aug 25 2010, 05:08 PM

where are my stars???
Group Icon
Elite
6,139 posts

Joined: Jan 2003
placeholder
AMDAthlon
post Aug 25 2010, 06:07 PM

The future is Fusion
*******
Senior Member
5,221 posts

Joined: Aug 2007
From: Deneb star


Sure..

http://www.youtube.com/watch?v=VIs1CxuUrpc&feature=sub

user posted image
xenon_aniki
post Aug 26 2010, 10:00 AM

Regular
******
Senior Member
1,069 posts

Joined: Nov 2008
From: Kuala Lumpur


im hoping it can head to head with i7 extreme. now thats call badass
davidbilly87
post Aug 26 2010, 10:14 AM

On my way
****
Senior Member
513 posts

Joined: Jul 2007



Only have 2 cores?
billytong
post Aug 26 2010, 01:21 PM

Lord Sauron
*******
Senior Member
4,522 posts

Joined: Jan 2003
From: Mordor, Middle Earth.


QUOTE(xenon_aniki @ Aug 26 2010, 10:00 AM)
im hoping it can head to head with i7 extreme. now thats call badass
*

no use comparing with i7,

they should be comparing with Intel Sandy bridge, if they cant beat Sandy bridge, it is a FAIL product.
SUSMatrix
post Aug 26 2010, 03:24 PM

King of Char Siew!
********
Senior Member
15,022 posts

Joined: Jan 2003
From: Damansara Jaya/Bandar Utama


QUOTE(billytong @ Aug 26 2010, 01:21 PM)
no use comparing with i7,

they should be comparing with Intel Sandy bridge, if they cant beat Sandy bridge, it is a FAIL product.
*
Not neccessary. If they can match 80% of Intel performance at 50% of the price, I'm sold. tongue.gif
Peterbigeyes
post Aug 26 2010, 03:38 PM

Getting Started
**
Junior Member
205 posts

Joined: Jul 2010


QUOTE(Matrix @ Aug 26 2010, 03:24 PM)
Not neccessary. If they can match 80% of Intel performance at 50% of the price, I'm sold.  tongue.gif
*
I sooo agree with that

This post has been edited by Peterbigeyes: Aug 26 2010, 03:38 PM
billytong
post Aug 26 2010, 05:16 PM

Lord Sauron
*******
Senior Member
4,522 posts

Joined: Jan 2003
From: Mordor, Middle Earth.


QUOTE(Matrix @ Aug 26 2010, 03:24 PM)
Not neccessary. If they can match 80% of Intel performance at 50% of the price, I'm sold.  tongue.gif
*

Although I do not have high hope on this, they better be decent. Or else the another time of failure like this is gonna be hitting hard for AMD.

AMD have wake the giant up since AMD64 era, now it will be a lot harder to beat Intel. *I still remember AMD is charging RM4000+ on a piece of CPU while back then it has the best chance of grabbing the market but they got greedy charge ridiculous pricing and also got bottleneck by the Fab production, and couldnt really take on the market share. Less 30% market share with the superior AMD64 isnt really an achievement.

The launch date of the Bulldozer is way too late, I dont think I got the patient to wait. Mostly likely I will gonna be end up with Sandy bridge platform. sweat.gif

This post has been edited by billytong: Aug 26 2010, 05:17 PM
jeopardise
post Aug 26 2010, 11:07 PM

tidak memberi seketul tahi!
******
Senior Member
1,767 posts

Joined: Jan 2003
From: Bandar Seri Putra


AMD taking a risk to introduce a new architecture with 32nm silicon. Hope its works.
shinjite
post Aug 26 2010, 11:28 PM

�ŞħĬΩĵΐŦ��
********
All Stars
19,309 posts

Joined: Jan 2003
From: Klang


Well, to me its technology so hope for the best for AMD smile.gif
Najmods
post Aug 26 2010, 11:33 PM

*mutter mutter mutter mutter*
*******
Senior Member
5,211 posts

Joined: Feb 2005
From: Konohana


QUOTE(davidbilly87 @ Aug 26 2010, 10:14 AM)
Only have 2 cores?
*
No, they just wanted to show the core architecture so they just show one 'block' of the core

QUOTE(jeopardise @ Aug 26 2010, 11:07 PM)
AMD taking a risk to introduce a new architecture with 32nm silicon. Hope its works.
*
Its not a risk, they MUST create new architecture to compete with Intel CPUs. Their desktop its pretty good but their mobile segment are pretty poor as compared to Intel counterparts, both in performance and battery life

The Bulldozer architecture is pretty radical, as it is not a 'true' dual core as it share fetch and decode blocks, FPU and L2 caches and claiming its '80% performance of true dual core' to reduce thermal and power envelope. I hope there is more to that.
saturn85
post Aug 27 2010, 12:02 AM

Folding@home
*******
Senior Member
8,686 posts

Joined: Mar 2009



QUOTE(billytong @ Aug 26 2010, 05:16 PM)
AMD have wake the giant up since AMD64 era, now it will be a lot harder to beat Intel. *I still remember AMD is charging RM4000+ on a piece of CPU while back then it has the best chance of grabbing the market but they got greedy charge ridiculous pricing and also got bottleneck by the Fab production, and couldnt really take on the market share. Less 30% market share with the superior AMD64 isnt really an achievement.
*
which model of processor that hit rm4k that time? unsure.gif
LittleLinnet
post Aug 27 2010, 12:15 AM

Iophobia
*******
Senior Member
3,593 posts

Joined: Feb 2005
From: ***Penang***
QUOTE(saturn85 @ Aug 27 2010, 12:02 AM)
which model of processor that hit rm4k that time? unsure.gif
*
the FX series I believe.
shinjite
post Aug 27 2010, 12:56 AM

�ŞħĬΩĵΐŦ��
********
All Stars
19,309 posts

Joined: Jan 2003
From: Klang


I think its definitely the Athlon FX series biggrin.gif

This post has been edited by shinjite: Aug 27 2010, 12:57 AM
jeopardise
post Aug 27 2010, 06:33 AM

tidak memberi seketul tahi!
******
Senior Member
1,767 posts

Joined: Jan 2003
From: Bandar Seri Putra


QUOTE(Najmods @ Aug 26 2010, 11:33 PM)
No, they just wanted to show the core architecture so they just show one 'block' of the core
Its not a risk, they MUST create new architecture to compete with Intel CPUs. Their desktop its pretty good but their mobile segment are pretty poor as compared to Intel counterparts, both in performance and battery life

The Bulldozer architecture is pretty radical, as it is not a 'true' dual core as it share fetch and decode blocks, FPU and L2 caches and claiming its '80% performance of true dual core' to reduce thermal and power envelope. I hope there is more to that.
*
Well from your reply still sounds like AMD taking RISK, because they MUST (take risk to) create new architecture to compete with Intel's 32nm technology tongue.gif No denying about that. They also risks delaying 32nm transition where Intel alreay ahead of AMD. So far performance scaling looks promising but it is based on simulation based on architecture design.

The longer Bulldozer is delayed, the greater the chance that it'll debut into the teeth of new six-core and eight-core Sandy Bridge products.
AMDAthlon
post Aug 27 2010, 07:35 AM

The future is Fusion
*******
Senior Member
5,221 posts

Joined: Aug 2007
From: Deneb star


QUOTE(billytong @ Aug 26 2010, 05:16 PM)
Although I do not have high hope on this, they better be decent. Or else the another time of failure like this is gonna be hitting hard for AMD.

AMD have wake the giant up since AMD64 era, now it will be a lot harder to beat Intel. *I still remember AMD is charging RM4000+ on a piece of CPU while back then it has the best chance of grabbing the market but they got greedy charge ridiculous pricing and also got bottleneck by the Fab production, and couldnt really take on the market share. Less 30% market share with the superior AMD64 isnt really an achievement.

The launch date of the Bulldozer is way too late, I dont think I got the patient to wait. Mostly likely I will gonna be end up with Sandy bridge platform. sweat.gif
*
Launch date of Bulldozer and Sandy Bridge is about the same Q1-Q2..

Thats why next year is an interesting year drool.gif
Najmods
post Aug 27 2010, 11:59 AM

*mutter mutter mutter mutter*
*******
Senior Member
5,211 posts

Joined: Feb 2005
From: Konohana


QUOTE(jeopardise @ Aug 27 2010, 06:33 AM)
Well from your reply still sounds like AMD taking RISK, because they MUST (take risk to) create new architecture to compete with Intel's 32nm technology tongue.gif No denying about that. They also risks delaying 32nm transition where Intel alreay ahead of AMD. So far performance scaling looks promising but it is based on simulation based on architecture design.

The longer Bulldozer is delayed, the greater the chance that it'll debut into the teeth of new six-core and eight-core Sandy Bridge products.
*
Simulated are nothing compared to real life performance. At least I would like to see L2/L3 cache sizes and clockspeed. They delaying far too long, unless they have something good up their sleeve
jeopardise
post Aug 27 2010, 08:31 PM

tidak memberi seketul tahi!
******
Senior Member
1,767 posts

Joined: Jan 2003
From: Bandar Seri Putra


There is an article I just read last few days but still can't find it.

AMD designed the new chip and uses simulation based on its design to determine how the chip performs compared to its predecessors. Of course it doesn't reflect the real life performance.

QUOTE
Bulldozer is one risky architecture for AMD. At the same time AMD is making its first 32nm CPU on an unproven, new and risky process and the company is making a completely new core, something that it hasn't done for years. When you do both of these things at once, you kind of have a recipe for potential disaster. AMD might get lucky this time and we're hoping that it will have more luck than with its transition to 65nm and the K10 native quad-core. As you probably remember, that transition was anything but smooth.

source: http://www.fudzilla.com/processors/process...mds-llano-is-k8


Performance/Power/Price would always be attractive at AMD side. brows.gif Hoping that their will manage to increase efficiency even closer to Intel's 32nm.
kingkingyyk
post Aug 29 2010, 10:20 AM

10k Club
Group Icon
Elite
15,694 posts

Joined: Mar 2008
29-8-2010 News :
AMD Zambezi to Use AM3+ Platforms
Advanced Micro Devices said that its next-generation desktop processors code-named Zambezi will use socket AM3+ platforms, which will be backwards compatible with the firm's existing AM3 products. While the latter is an advantage for the platform, it may be a disadvantage for eight-core processors based on Bulldozer micro-architecture.

"The existing G34 and C32 server infrastructure will support the new Bulldozer-based server products. In order for AMD’s desktop offering to fully leverage the capabilities of Bulldozer, an enhanced AM3+ socket will be introduced that supports Bulldozer and is backward-compatible with our existing AM3 CPU offerings," an official from AMD said in an interview with Planet3DNow web-site.

Apparently, it was possible for AMD to make Bulldozer microprocessors compatible with existing AM3 infrastructure, but in order to do that, the company would have to sacrifice certain important features of the new core.

"When we initially set out on the path to Bulldozer we were hoping for AM3 compatibility, but further along the process we realized that we had a choice to make based on some of the features that we wanted to bring with Bulldozer. We could either provide AM3 support and lose some of the capabilities of the new Bulldozer architecture or, we could choose the AM3+ socket which would allow the Bulldozer-base Zambezi to have greater performance and capability," the official said.

The compatibility with older microprocessors allows AMD to simplify transition to the new micro-architecture and process design since the new AM3+ platform will support inexpensive chips from day one. However, such compatibility also means that AMD Zambezi processors will only support dual-channel memory controller. Considering the fact that all modern high-end Intel Core i7 processors with up to six cores feature triple-channel memory controller, it is unclear how AMD plans to "feed" eight cores of Zambezi with dul-channel DDR3 without creating bottlenecks.

AMD Bulldozer-based processor code-named Zambezi will have up to eight cores along with a new TurboCore dynamic acceleration technology. Thanks to the new micro-architecture the chip promises to be faster than existing AMD products.

Yup, they are going to AM3+, but AM3+ mobo still can support AM3 processors.
It is good for your pocket biggrin.gif

115 Pages  1 2 3 > » Top
 

Change to:
| Lo-Fi Version
0.0218sec    0.48    6 queries    GZIP Disabled
Time is now: 27th November 2025 - 12:13 AM