Posts: 14,914
Threads: 9,727
Thanks Received: 9,120 in 7,272 posts
Thanks Given: 9,937
Joined: 12 September 18
27 May 19, 07:27
Quote:
While the bulk of this morning’s AMD Computex keynote has been on AMD’s 3rd generation Ryzen CPUs and their underlying Zen 2 architecture, the company also took a moment to briefly touch upon its highly anticipated Navi GPU architecture and associated family of products. AMD didn’t go too deep here, but they have given us just enough to be tantalized ahead of a full reveal in the not too distant future. The first Navi cards will be the Radeon RX 5700 series, which are launching in July and on an architectural level will offer 25% better performance per clock per core and 50% better power efficiency than AMD’s current-generation Vega architecture. The products will also be AMD's first video cards using faster GDDR6 memory. Meanwhile AMD isn’t offering much in the way of concrete details on performance, but they are showing it off versus NVIDIA’s GeForce RTX 2070 in the AMD-favorable game Strange Brigade.
A Peek At the Navi GPU Architecture
Compared to today’s Ryzen information AMD is being a lot more modest – there’s no specific SKU information for a start – but they are offering us our first architectural details on Navi since we learned in 2016 that it would bring “next gen memory” support. First and foremost, AMD has been tinkering with their GPU architecture to improve both the effective IPC and memory bandwidth efficiency. In what the company is calling their Radeon DNA (RNDA) archtiecture – I should note that it’s not clear if this is a branding exercise to downplay the GCN family name or if it's a more heavily overhauled architecture – AMD has revealed that Navi’s compute units have been redesigned to improve their efficiency. AMD's press materials also note that, regardless of the above changes, the size hasn't changed: a single CU is still 64 stream processors.
Altogether, a Navi core/CU should be 25% faster than a Vega core on a clock-for-clock basis. The devil is in the details of course – AMD's data is based off of their internal testing, taking the geomean of 30 games tested at 3840x2160 with Ultra settings and 4x AA, so it's not a holistic view of just the core architecture – but it’s still potentially one of AMD’s largest GPU IPC gains in the last several years. AMD’s presentation also noted that this was part of a larger streamlining of the graphics pipeline, which is designed for both higher IPCs and high clockspeeds. For reference, on the same process AMD has been able to push Radeon VII well past its comfort/efficiency point to 1750MHz, so it will be interesting to see how Navi compares.
Feeding the beast is a new multi-level cache hierarchy. AMD is touting that Navi’s cache subsystem offers both higher performance and lower latency than Vega’s, all for less power consumption. AMD has always been hamstrung a bit by memory/cache bottlenecks, so this would be a promising development for AMD’s GPU architecture. Meanwhile for a bit of reference, Vega already implemented a more modern cache hierarchy, so it would seem unlikely that AMD is changing their cache levels or what blocks are clients of which caches.
AMD's memory controllers themselves have also been updated. Long expected, and confirmed in the press release that went out after Dr. Lisa Su's keynote, the upcoming RX 5700 series cards use GDDR6 memory, which should give AMD's cards a hearty bandwidth bump over their comparable GDDR5 Polaris cards.
It's also worth noting that, fittingly, the new Navi parts support PCI Express 4.0 as well. This will actually be AMD's second GPU with PCIe 4.0 functionality – Vega 20 was first, but it's only enabled in the Radeon Instinct parts – so this is the first time it's enabled in a consumer part. Of course to make full use of it you'll need a PCI 4.0-capable host, which AMD is happy to sell you as well. As for the performance impacts, these remain to be seen. Thus far high-end AMD/NVIDIA parts haven't been significantly bottlenecked in games by PCIe 3.0 x16 (or even x8), so it will be interesting to see how much the extra bus bandwidth helps these first generation of parts.
Continue Reading