Friday February 10, 2012 5:07 PM AEST

CPU and GPU now, the convergence goes on

By The Inquirer
10:02 Nov 2, 2009 | 6 Comments
Tags: CPU | GPU | CPGPU | graphics | processor | news
 »
CPU and GPU now, the convergence goes on

Part 1: Fusion or fission?

Over the past two years, CPU and GPU capabilities have started to converge, but at a snail's pace. While GPUs can now handle branching code and double-precision IEEE floating-point operations, we still don't have a GPU that can run generic C or Fortran code, at least not until the Nvidia GT300 comes out, or that can access system memory and support the paged virtual memory model to boot a modern OS, and not even the GT300 will be able to do that.

On the other hand, any attempts by CPUs to handle real time 3D graphics really quickly have, up to now, faltered miserably. So, neither have CPUs replaced GPUs, nor have GPUs come much closer to replacing CPUs. How far along is each camp now? And how will they perform within our crystal ball's prediction capacity horizon of, say six months from now? Let's look at the most favourite common metric between the two camps - the GFLOPS peak floating-point instruction rate, in double-precision of course.


CPU speedups
Intel's 32nm Westmere 6-core chip is the next major step in Chipzilla's roadmap. The flagbearer Westmere, in its Gulftown-EP dual CPU configuration and, a month or two later, single CPU desktop configuration, will provide 50 per cent more cores matched by 50 per cent more L3 cache at 12 MB, an improved memory controller able to support DDR3-1600MHz even as server memory by default, all within roughly the same clock speed range and die size as the current Nehalem chips.

If running at the standard non-Turbo mode 3.33GHz, the single Gulftown CPU will give you 80 GFLOPS of raw double-precision floating-point power, or twice that, 160 GFLOPS, in a dual-CPU workstation configuration. I do expect 3.6GHz parts to appear in the Gulftown stable too, before mid-2010.

While this sounds far below top numbers for the current GPUs, keep in mind this is fully general purpose floating-point for any application out there, today or tomorrow. No fancy programming tricks or new code needed. And, as usual, don't be surprised to see most of these Gulftowns doing well at north of 4GHz with even simple overclocking. How about 200 GFLOPS in a dual processor workstation by your deskside? With a grand total of six channels of DDR3-1600MHz server ECC memory, or DDR3-2000MHz desktop memory, these CPUs shouldn't be waiting for main memory data for too long.

AMD's Magny Cours, as two Istanbul dies in a single chip package, will at the same time pack twelve slower cores, probably not clocked higher than 2.4GHz to start. These two dies together will pack the same amount of total L3 cache as a single Intel Gulftown, but in theory will still be able to churn out over 110 GFLOPS of total peak double-precision floating-point power, or around 55 GFLOPS per die. Hopefully in the same time frame AMD will be able to speed up the single die Istanbul to above 3GHz, especially for the eventual desktop version.

The real floating-point throughput advances for both Intel and AMD CPUs should be seen in their end-2010 next generation cores, the Sandy Bridge for Intel - yes, the one with an integrated on-die GPU for some flavours - and the Bulldozer for AMD. Both should offer double the peak double-precision floating-point performance throughput per clock, enabling roughly 200 GFLOPS peak number munching power in a 4GHz, 12-core dual chip workstation setup, for example.

 
 »
 
Behind the scenes with Mass Effect 3! GTX 560 VGA round-up! Essential Skyrim tweaks to improve your game! Plus reviews, news, hardware, more games, and easy to following modding guides for PC builders. ON SALE NOW!
6 Comments
pkroeze
Nov 2, 2009 10:20 AM
i think it will be a long time before we get an intergrated CPU/GPU because that would mean less money for all companies involved. take AMD/ATI if they do not merge these technologies they sell 2 cores for every PC but once they merge CPU and GPU they only sell 1. On the other side Nvidia and intel probably won't try it either first of all because of legal bindings between the 2 but also if Intel tried to make a cpu/gpu they would have a hard time selling them until these technologies match seperate CPU and GPU's.
Jeruselem
Nov 2, 2009 10:42 AM
I'm not a fan of absolute single point of failure given how hot CPUs and GPUs get.
tunksy
Nov 2, 2009 10:46 AM
thanks for a very intresting read.
thesorehead
Nov 2, 2009 1:01 PM
As Jerusalem said: top-of-the-line GPU and CPU parts are going to need to be separate for some time simply for the purposes of heat dissipation.

However, GPUs integrated into the MOBO have been fine for "home/office" use where all you need is the Windows desktop and a bit of Java/Flash/whatever shininess. I can see a scenario where Intel/AMD compete in this budget-conscious arena with a fully-integrated system.
CK
Nov 2, 2009 11:13 PM
How about a motherboard with 2 Separate sockets on it? One the CPU and other the GPU. Both spaced apart enough to get aftermarket heat sinks on them for cooling/overclocking, Able to both access system memory when they need it(hopefully DDR5 or something for the GPU's sake).Didn't P55 just do away with an extra chip on the motherboard? Just coincidence maybe???
omega
Nov 3, 2009 1:23 PM
CK, then you'd have things like an AMD/Intel, AMD/Nvidia (would they even...), AMD/Intel(?), Intel/ATI, Intel/Nvidia, Intel/Intel(?) motherboard options as I dont see ATI/Nvidia and Intel all sharing the same socket type for their GPU's.

It would make buying a mobo a lot harder as you dont know which company (blue, green or red) will have the best GPU in 2 years time when you upgrade.
Comments have been disabled on this article.
 
Latest Competitions
 
Atomic Magazine

Issue: 133 | February, 2012

Atomic is a magazine aimed squarely at computer enthusiasts, gamers, and serious PC upgraders.

Every month we bring you the latest reviews of new technology and PC components, in depth features on everything from overclocking to console hacking, and gaming previews and interviews.
 
Latest Comments
 
Latest User Reviews
Battlefield 3 is the new benchmark online FPS
90%
A very fun and realistic multiplayer ride.
 
Antec Kuhler 920 - liquid cool
90%
Antec Kuhler 920 silent but effientive out of the box no maintence water cooling kit
 
Antec's Lanboy Air - our new favourite case
90%
Antec Lan boy Air in red a very cool design
 
Antec's Lanboy Air - our new favourite case
90%
This product overall is awesome.
 
MSI's GT780 laptop as fast as it gets
90%
Nice laptop
 
 
Close Get the February, 2012 issue of Atomic mailed to you for $8.95, including postage.

SubscribeBuy nowDigital Version