PDA

View Full Version : nVidia GT300's Fermi architecture unveiled: 512 cores, up to 6GB GDDR5


yose3
12-01-2009, 07:12 AM
nVidia GT300 chip is a computational beast like you have never seen before. In fact, we would go as far out and state that this is as closest as GPU can be to a CPU in the whole history of graphics technology. Now, time will tell whatever GT300 was the much needed revolution.

Beside the regular NV70 and GT300 codenames [codename for the GPU], nVidia's insiders called the GPU architecture - Fermi. Enrico Fermi was an Italian physicist who is credited with the invention of nuclear reactor. That brings us to one of codenames we heard for one of the GT300 board itself - "reactor".
When it comes to boards themselves, you can expect to see configurations with 1.5, 3.0 GB and 6GB of GDDR5 memory, but more on that a little bit later.

GPU specifications
This is the meat part you always want to read fist. So, here it how it goes:

* 3.0 billion transistors
* 40nm TSMC
* 384-bit memory interface
* 512 shader cores [renamed into CUDA Cores]
* 32 CUDA cores per Shader Cluster
* 1MB L1 cache memory [divided into 16KB Cache - Shared Memory]
* 768KB L2 unified cache memory
* Up to 6GB GDDR5 memory
* Half Speed IEEE 754 Double Precision


As you can read for yourself, the GT300 packs three billion transistors of silicon real estate, packing 16 Streaming Multiprocessor [new name for former Shader Cluster] in a single chip. Each of these sixteen multiprocessors packs 32 cores and this part is very important - we already disclosed future plans in terms to this cluster in terms of future applications. What makes a single unit important is the fact that it can execute an integer or a floating point instruction per clock per thread.

TSMC was in charge of manufacturing the three billion transistor mammoth, but it didn't stop there. Just like the G80 chip, nVidia GT300 packs six 64-bit memory controllers for a grand total of 384-bit, bringing back the odd memory capacity numbers. The memory controller is a GDDR5 native controller, which means it can take advantage of built-in ECC features inside the GDDR5 SDRAM memory and more importantly, GT300 can drive GDDR5 memory in the same manner as AMD can with its really good Radeon HD 5800 series. The additional two memory interfaces will have to wait until 28nm or 22nm full node shrinks, if we get to them with an essentially unchanged architecture. You can expect that the lower-end variants of GT300 architecture will pack less dense memory controller for more cost efficiency, especially on the memory side.

GPGPU is dead, cGPU lives!
Just like we reported earlier, GT300 changed the way how the GPU is functioning. If we compare it to the old GT200 architecture, comparisons are breathtaking. Fermi architecture operates at 512 Fused Multiply-Add [FMA] operations per clock in single precision mode, or 256 FMA per clock if you're doing double precision.
The interesting bit is the type of IEEE formats. In the past, nVidia supported IEEE 754-1985 floating point arithmetic, but with GT300, nVidia now supports the latest IEEE 754-2008 floating-point standard. Just like expected, GT300 chips will do all industry standards - allegedly with no tricks.

A GPU supports C++ natively?
Ferni architecture natively supports C [CUDA], C++, DirectCompute, DirectX 11, Fortran, OpenCL, OpenGL 3.1 and OpenGL 3.2. Now, you've read that correctly - Ferni comes with a support for native execution of C++. For the first time in history, a GPU can run C++ code with no major issues or performance penalties and when you add Fortran or C to that, it is easy to see that GPGPU-wise, nVidia did a huge job.

To implement ISA inside the GPU took a lot of bravery, and with GT200 project over and done with, the time came right to launch a chip that would be as flexible as developers wanted, yet affordable.

In a nutshell, this is just baseline information about what nVidia is going to introduce in the next couple of weeks. Without any doubt, we can see that nVidia reacted to Larrabee by introducing a part that is extremely efficient, natively support key industry standards and more importantly, doesn't cost an arm and a leg.

The line-up is consisted out of high-end consumer part [GeForce], commercial [Quadro] and scientific [Tesla]. You can expect memory sizes from 1.5GB for consumer GeForce 380 to 6GB for commercial Quadro and Tesla parts.

All rights reserved To © 2009 Bright Side Of News.
by: Theo Valich

http://www.brightsideofnews.com/news/2009/9/30/nvidia-gt300s-fermi-architecture-unveiled-512-cores2c-up-to-6gb-gddr5.aspx

Radiator
12-01-2009, 11:31 AM
Uhh... I heard the GT300 wont really have the performance it's hyped up to have ( then again who knows at this point , could be false ) due to the new architecture , etc ( like the FX5200 series - new technology , not nearly enough power to run it , some poor decisions made during developing ) .
I do hope its going to be a beast so ATi's cards prices would fall down .
1.5 GB of vRAM is a bit of an overkill , though .

LycosIX
12-01-2009, 02:28 PM
The 5970 has 4 billion transistors what's the fermi

ultima
12-01-2009, 11:13 PM
3 billion

anyways...

until it comes out we won't know for sure

even if it works like it stated

it'll be like the PS3
more expensive than the XBOX yet does the same thing

yose3
12-02-2009, 02:47 AM
The 5970 has 4 billion transistors what's the fermi

well we will have to see benchmarks my friend until it comes out we can't be expeculating if u get my point...


just wait until the "monster come out"

LycosIX
12-02-2009, 04:42 AM
is it gonna to be out by march? i want it out to slash the 5770 prices

Radiator
12-02-2009, 02:03 PM
Well it SHOULD be released Q1 next year , but who knows ...

Toxict51
12-05-2009, 04:44 PM
dudes sorry to interrupt but i liked the subject the gt 300/310
is not theperformance cards jut like any first card in any geforce series
gt 210,9400 gt,310 gt,8400 gs etc.......
anyways here is a link check it out
http://techwizardz.blogspot.com/2009/12/nvidia-geforce-310-gpu-features.html
or this :
http://www.xbitlabs.com/news/video/display/20091126111148_Nvidia_Quietly_Unveils_GeForce_310_ GeForce_205_Graphics_Cards.html

-RK
12-05-2009, 08:07 PM
The "GT300" refers to the core of the gtx380, not the gt310 card.

There is really no comparison between the two.

Both NV and ATI tend to release the high end parts of a line before the low end cards.

WukaChop
12-15-2009, 03:55 PM
It would be good, but no card is ever going to use 6GB of RAM to itself, I don't even think photo realistic graphics would use that much

saintjimmy
12-16-2009, 01:37 AM
Nothing will use that much until 2011 maybe! It's all a matter of time. Heck, the poorly optimized Grand Theft Auto 4 can use 1 GB of video RAM by itself. I'm sure other games like Crysis, Call of Duty series, Fallout 3, etc... use quite a bit of vRAM, too. Besides, I think that insane amount of memory would be useful in parallel processing, drafting maybe, physics calculations, & other important stuff for big, rich, & important companies.

yose3
12-19-2009, 02:34 AM
well alot of Vram is not bad at all it mean is future proof who know what may be next in Graphics situation

Radiator
12-19-2009, 08:33 PM
Nothing will need that much vRAM ( counting out shitty console to PC ports ) until 2012 when new consoles come out = the graphics and physics in games are going to improve a lot , atleast as far as gaming goes ( talking about cards with 1.5-2GB of vRAM , 6 GB will most likely be useless for half a decade at the very least ) .
Sadly the devs aren't really going to try to improve anything by much if a console can't handle it ( there are a few exceptions to this , though ) .
However GPGPU tasks , high amounts of vRAM can prove useful .

-RK
12-20-2009, 01:54 AM
I seriously doubt that games in the near future (well, in the next year or so) will be able to utilize more than 2gb vram at normal resolutions for rendering. Maybe with ATI's eyefinity it could be justified, but I doubt we'll see it for a normal monitor.

On the other hand, we may end up seeing more and more CPU tasks being handled by the GPU - likely even in games - which would definitely justify extra ram with an extremely quick access time.

Die Auslese
01-17-2010, 05:39 AM
I'm gonna take a stab at the rest of the specs of the GTX380 (Fermi), but I know I'm gonna be wrong. Since the GTX280 had 80 TMU, and 32 ROPS (64, and 16 last gen), I'm basing my estimation at 128 TMU's (8 per Cuda Cluster), and 64 ROPS. It might be less TMU's for all I know. Maybe see a 1000mhz on the memory, and because of the 40nm manufacturing 733+ on the core, which would give it 94k Texture 46k Pixel Fill Rate, and a Memory bandwidth of 192GB/s. this is all speculation, and guessing. But if something has that much power it definitely has the potential to crush everything except the 5970.

borkus
01-18-2010, 10:52 PM
Any word on the release date? I've still yet to find anything concrete on the internet. Just loose rumors of a late February to mid-March release.

saintjimmy
01-19-2010, 12:51 AM
http://www.hardocp.com/article/2010/01/17/nvidias_fermi_gf100_facts_opinions

That's the only legit pre-release info out there right now that I know of.

yose3
01-26-2010, 05:53 AM
http://www.hardocp.com/article/2010/01/17/nvidias_fermi_gf100_facts_opinions

That's the only legit pre-release info out there right now that I know of.


yes every body locked at that page

-RK
01-27-2010, 01:50 AM
yes every body locked at that page

Hey, he posted it even before Steve got to it.