Kepler’s impressive efficiency has been considered a major achievement by press and PC gamers, but a key part of the story has never been told.

As the leader of the engineering team that worked with TSMC for three years to manufacture Kepler, I’d like to shed some light on the impact of 28nm process technology on Kepler’s efficiency.

The 28nm GeForce GTX 680 die is 294mm sq.

Kepler was an ambitious project because it introduced a new architecture at the same time as a new silicon process technology node. This is a bit like designing a new jet engine using exotic materials which are still in development. Much like the engineers at Pratt & Whitney, there was an intense focus on power efficiency, on delivering the best performance per energy unit (watts in our case, gallons of jet fuel in theirs).

The advancement that TSMC offered was a new optimized process technology. Kepler is manufactured using TSMC’s 28nm high performance (HP) process, the foundry’s most advanced 28nm process which uses their first-generation high-K metal gate (HKMG) technology and second generation SiGe (Silicon Germanium) straining. HKMG is a process that uses a gate insulator film with a high dielectric constant which reduces power by reducing gate leakage compared to the previous generation SiON gate. SiGe straining is a chemical process to stretch the silicon atoms to improve the mobility or the effective frequency of the transistor. Both technical advances improve the performance per watt of the transistor translating to a more power efficient system.

Using TSMC’s 28nm HP process enabled us to reduce active power by about 15 percent and leakage by about 50 percent compared to 40nm, resulting in an overall improvement in power efficiency of about 35 percent (see chart). Let me explain why this is so critical.

Today, the primary constraint on processor performance is the power consumption budget. So our goal is always to develop solutions that deliver the highest performance within a fixed power budget. Having a more efficient process enabled us to add more processing cores, thus increasing performance. Put simply, greater efficiency equals greater performance and optimal performance per watt.

To maximize the efficiency of 28nm (while developing a new architecture) required us to change our silicon process development model with TSMC. In previous process nodes we had worked independently—with TSMC preparing the process, and NVIDIA working on the design. TSMC engineers would do the best job making a volume process platform, and NVIDIA would implement our designs following the guidelines of process design rules and electrical performance.

For Kepler, we began working with TSMC three years before our product tape-out (when the processor design is complete and ready for manufacturing). Together we created a Production Qualification Vehicle (PQV) to allow the TSMC process engineers and our internal design engineers to optimize the process before the product tape-out. Through repeated prototyping, we were able to optimize both the process and design, creating a more efficient Kepler design rather than simply a chip in a standard 28nm process.

TSMC’s 28nm HP process, seen here under an electron
microscope, is 30 percent smaller than 40nm and about
35 percent more energy efficient.

We’re extremely proud of what we accomplished with Kepler. It combines NVIDIA’s world-class GPU engineering with TSMC’s very best 28nm process. But while Kepler was a key milestone, it is one point in a continuum. We continue to improve on what we developed and continue our collaboration with TSMC. In fact, we recently received our first version of an enhanced PQV for 20nm from TSMC. That process will yield even greater efficiency for NVIDIA’s next next-generation GPUs.

Similar Stories

  • Ezequiel Gustavo Martinez

    Very nice… I’m Electromechanical Technical and i like know a more of this type of things

  • BestJinjo

    Joe, thanks for providing more in-depth explanation on how the collaboration of NVIDIA and TSMC have played a vital role in the efficiency benefits of Kepler products. I have noticed that TSMC’s 28nm HP process has brought about a 30% reduction in transistor and about 35% more power efficiency. 

    On Global Foundries’ website it is stated that their 28nm High-k Metal Gate delivers about twice the gate density of 40nm and up to 60% higher transistor switching at comparable leakage to 40nm, with up to 50% lower energy / static power. 

    ^ Is that an overly optimistic marketing stance they are taking or does this HKMG transistor offer even better benefits than the process at TSMC? 

    It would have been incredible to see how much power Kepler could have delivered within a 250W TDP envelope, with GPU die size in the 500-550mm^2 range, however. A chip of that size could easily incorporate dedicated hardware scheduling, and allow for a very potent dual precision GPGPU monster, without giving up excellent gaming performance.  Either way, I can’t wait to see what you guys can do with Maxwell on even more advanced node processes!

  • William Snell

     your e retailers are pushing the price to over 1 grand for a card for
    you set price of 499.99 plea’s check over with them because theirs
    wasn’t sort of change of sell price

  • E Christenson

    Seriously Joe?  You’re singing TSMC’s praises about their process when they are hosing you on yield?  I mean just today we get, “Pacific Crest said tight 28nm supply is a headwind for NVIDIA.”  I sure hope this is a relationship repair effort rather than a pat yourself on the back.  Financially 28nm is a looooong way from success for you.

  • Joe Greco

    Hi, William. Thanks for your comment.

    It is important to note that NVIDIA does not actually set pricing for any of our partner’s products. We do provide suggested etail pricing, and partners price their products accordingly, based on bundles, special board designs, overclocked editions, and water cooling.

    For the GTX 680, we haven’t seen pricing over $1000. Newegg for example, shows pricing between $499 for the standard version to $699 for special overclocked editions.

    If you could provide a link to the pricing you are seeing, that would be helpful.

  • Joe Greco

    Thanks for your note BJ. I wouldn’t want to delve into the waters of competing foundry technologies here. That topic is ample enough to be a PhD-level dissertation by itself.

    We will satisfy your curiosity on your last point pretty soon. Will you be attending our GPU Technology Conference in May? Check the session S0642 – Inside Kepler hosted by Stephen Jones and Lars Nyland.