Evaluating AMD's TrueAudio and Mantle Technologies with Thiefby Ryan Smith on March 18, 2014 1:15 AM EST
Scheduled for release today is the 1.3/AMD patch for Thief, Square Enix’s recently released stealth action game. Following last month’s Battlefield 4 patch, Thief is the second big push for AMD’s recent Radeon technology initiative, becoming the second game to support Mantle and the first game to support TrueAudio Technology.
Thief has been something of a miss from a Metacritic perspective, but from a technology perspective it’s still a very big deal for AMD and Radeon owners. As a Mantle enabled title it’s the second game to support Mantle and the first single-player game to support it. Furthermore for AMD it showcases that they have Mantle support from more developers than just EA and other Frostbite 3 users, with Square Enix joining the fray. Finally it’s the first Unreal Engine based game to support Mantle, which can be particularly important since Unreal Engine 3 is so widely used and we expect much the same for the forthcoming Unreal Engine 4.
But more excitingly the release of this patch heralds the public release of AMD’s TrueAudio technology. Where Battlefield 4 was the launch title for Mantle, Thief is the launch title for TrueAudio, being the first game to receive TrueAudio support. At the same time it also marks the start of AMD enabling TrueAudio in their drivers, and the start of their TrueAudio promotional campaign. So along with Thief, AMD is also going to be distributing demos to showcase the capabilities of TrueAudio, but more on that later.
Diving right into matters then, at the end of last week AMD was able to get the press access to today’s patch, giving us a short opportunity to look at Thief from both a Mantle perspective and a TrueAudio perspective. As this week coincides with GDC 2014 we haven’t had a ton of time to spend with Thief, so this overview is going to be relatively brief, but it has given us enough time to play with both of AMD’s technologies.
As this is a brief overview we’re going to skip recapping the technical details behind Mantle and TrueAudio. But if you haven’t read our previous works on those subjects, you can find more details on TrueAudio and Mantle in their respective articles.
Finally, launching alongside today’s Thief patch will be the latest rendition of AMD’s Catalyst drivers, Catalyst 14.3 Beta 1 (build 13.1350.1005). We don’t have a change log for these drivers at this point – expect one to be posted alongside the drivers today – but the important point is that these are the drivers intended to be used alongside the newly patched Thief and AMD’s TrueAudio demos.
|CPU:||Intel Core i7-4960X @ 4.2GHz|
|Motherboard:||ASRock Fatal1ty X79 Professional|
|Power Supply:||Corsair AX1200i|
|Hard Disk:||Samsung SSD 840 EVO (750GB)|
|Memory:||G.Skill RipjawZ DDR3-1866 4 x 8GB (9-10-9-26)|
|Case:||NZXT Phantom 630 Windowed Edition|
AMD Radeon R9 290X
AMD Radeon R7 260X
|Video Drivers:||AMD Catalyst 14.3 Beta 1|
|Headphones:||Sennheiser PC 360|
|OS:||Windows 8.1 Pro|
First and foremost, let’s talk about Mantle. Whereas Battlefield 4 was primarily a multiplayer game, Thief is the first single player game to gain Mantle support. So although Thief isn’t the first Mantle game, by virtue of being a single player game it presents gamers and Mantle with a very different and much more tightly structured workload to work off of. Perhaps more importantly, since it is a single player game it has a much more consistent performance profile than Battlefield 4, and better still it even has a built in benchmark to go with it.
On the whole, Thief is a better than average game from a graphics technology perspective. It is a multi-platform title based on Unreal Engine 3, and at higher quality settings includes a number of graphical features such as tessellation, contact hardening shadows, and even supersample anti-aliasing (achieved through internally rendering at a higher resolution). However even with those effects, unlike Battlefield 4, Thief is much easier to CPU bottleneck. On our fastest video cards it tends to be SSAA (or very high resolutions) that leads to Thief being bottlenecked, allowing it to otherwise become CPU bottlenecked at 1080p without SSAA.
When it comes to being CPU limited, Thief’s preferences are clear: 4 cores with as much performance per thread as you can throw at it. This leads to Thief strongly favoring Intel CPUs – first the quads and then the dual cores – with AMD’s CPUs and APUs falling into place after that. As a result of these CPU bottlenecks Thief can trend very close to being a best case scenario for Mantle, so long as it’s not outright GPU bottlenecked.
With that in mind we quickly took a look at Thief’s Mantle performance on an R9 290X (Uber mode to rule out throttling) and an R7 260X to cover both a high-end GPU and a mainstream GPU. Furthermore we tested both of those configurations with a variant of the game’s Very High settings – dropping SSAA down to Low in exchange for 16x AF, alongside the game’s Low settings. Finally we ran the above against both a high-end CPU configuration of 6 cores/12 threads at 4.2GHz, and a low-end configuration of 2 cores/4 threads at 3.3GHz.
On a quick side note, AMD included the following notes with their instructions for testing Thief. In short, Mantle is up and running for all compatible AMD cards, but multi-GPU is not yet working, and memory management is in need of further optimization.
- Mantle performance for GPUs with 2GB framebuffers will receive additional optimization in a future application path for Thief™. Currently, these products may see limited gains in scenarios requiring large amount of video memory (e.g. maximum detail settings with SSAA enabled).
- Multi-GPU support under the Mantle codepath will be added to Thief in a future application patch
- As with other first-person titles, relatively smaller gains will be observed in GPU-bound scenarios
Looking first at the R9 290X, we can see that even at our modified Very High settings, there are still some small performance gains to be had from enabling Mantle. Switching out Direct3D for Mantle gets us another 3.6fps, or a 5% boost in performance. As we would expect however, a far more significant gain can be found when using Low settings. He we can see the 290X top out at 86.8fps with D3D – indicating that our earlier Very High settings weren’t all that far from being CPU bottlenecked – while Mantle boosts that up to 117.7fps, for a gain of 30.9fps or 36%.
From a practical perspective we would expect most 290X owners to be playing at settings similar to Very High, so the performance gains, though appreciated, aren’t especially influential in the long run. But it does give us some idea of what to expect.
Meanwhile if we start slowing down the CPU to just 2 cores at 3.3GHz, we can see the Mantle performance advantage grow. In this CPU bottlenecked scenario the performance gains from enabling Mantle are anywhere between 33% for Very High settings to a rather sizable 49% when using Low settings. This scenario, though contrived, makes for a good reminder of how significantly the current Direct3D rendering pipeline can bottleneck a GPU in the wrong (right?) circumstances.
Moving on to the 260X, to no surprise we’re completely and utterly GPU bottlenecked with our Very High settings. The performance gains with Mantle are inconsequential at best, indicating that Mantle isn’t being used to significantly alter the rendering process on the GPU itself.
Shifting over to Low settings still leaves our setup GPU bottlenecked when testing against the 6 core setup, however we do see a very distinct performance gain on the 2 core setup. In this scenario enabling Mantle is worth an 11.1fps boost, or 25%, pushing the framerate up to 55.4fps.
Despite this being an artificial test on our GPU testbed, we would consider this to be a very real scenario overall given the price of the 260X. At $119 (MSRP) the 260X is very likely to be paired with a dual-core CPU or equivalent, so to see a meaningful performance gain in this scenario is promising. Whether any other Mantle-enabled single-player games will be this badly CPU limited remains to be seen, but if other games were to behave like Thief, then we may see similar gains on lower-end setups such as this.
Ultimately we’ve only had a limited amount of time with the Thief Mantle patch, so we’ll have to take a look at the competitive landscape another day. But as a pure Mantle analysis Thief is probably the greater beneficiary from Mantle at this time. The gains at the high end aren’t worth writing home about, but since we need the CPU to churn out a fairly high framerate regardless, there’s a much greater opportunity to benefit from Mantle on lower end Intel CPUs and AMD’s CPUs/APUs.
Post Your CommentPlease log in or sign up to comment.
View All Comments
blanarahul - Tuesday, March 18, 2014 - linkRyan, why don't use measure GPU power consumption like W1zzard from TechPowerUp!.
Ryan Smith - Tuesday, March 18, 2014 - linkOn this article or in general?
In general we do a system wide measurement because we use a closed testbed, which means it's impractical to tap to measure the individual lines.
On this article in particular we were only doing a (relatively) short overview on performance. There wasn't time to look at much else.
TheElMoIsEviL - Thursday, March 20, 2014 - link"Ryan, Why don't you talk about how great nVIDIA is and how much AMD sucks? I mean I realize that when AMD had better power consumption, I cheered on the Geforce GTX 480, because I'm dishonest like that, but now that nVIDIA has better power/heat levels I have decided that these things matter"
Is how I interpreted your post blanarahul.
Le Québécois - Tuesday, March 18, 2014 - linkWhen can we expect a full article/review on Mantle? So far we're seen preview for Thief and BF4 but not full detailed 20 pages or so review on the thing. I use it on my 7970 GHz and it seems fast and stable but it could also just be my imagination tricking me with a "placebo" effect.
nathanddrews - Tuesday, March 18, 2014 - linkDefinitely should be some slower CPUs in the testing. Pentium G-series, maybe a C2D/Q. I've seen claims of 50-to-300% gains on forums (with numbers to back it up). for max, average, and minimum frame rates. Even AMD's press coverage made it known that the fastest CPUs wouldn't gain much (maybe 5%), so it would seem fitting to run tests using low-to-mid range hardware.
rms - Tuesday, March 18, 2014 - linkDefinitely. On my 6core phenom2 & 290X, I doubled my minimum framerate, and nearly doubled the average. Why would the only graph be with a high-end overclocked intel cpu that shows minimal gains?
Ammaross - Tuesday, March 18, 2014 - linkI would think an AMD APU akin to the one(s) found in the XBone and PS4 would be a REQUIREMENT for any and all games testing (at least for PC-ports).
mikato - Thursday, March 20, 2014 - linkI agree. Using a very high end CPU and just cutting cores and clock rate doesn't give us much hint of the point behind Mantle.
Ryan, you wrote the Mantle portion like you expected everyone to have a high end CPU. I know you tested for lower CPU performance, but that's the angle of the article. See below...
"The gains at the high end aren’t worth writing home about, but since we need the CPU to churn out a fairly high framerate regardless, there’s a much greater opportunity to benefit from Mantle on lower end Intel CPUs and AMD’s CPUs/APUs."
So you're thinking is "does Mantle help, and how much?". That question doesn't cover the whole picture. Mantle may lead to purchasing decisions. Someone could buy a lesser CPU and still get close to higher end CPU performance because of Mantle. What about that? I agree that including some lesser CPUs in the testing would really be needed for this as well.
I also have to say, what a lovely sound demo!
savage.r - Friday, April 4, 2014 - linkOf course it should, todays games are made for todays HW and API, means mantle cannot bring performance improvement if game itself doesn't need more. Only way is to use much slower IPC and more cores, still most improvements will see when optimized games will be made from ground up, like star swarm demo, I have 300% FPS there in RTS view. PS4/XBOX uses 8core jaguar architecture, same is used in ultrathins and tablets! CPU with few times higher TDP/ IPC and even less cores, cannot be even remotely comparable to that. But it proofs that today's CPU have much more performance than we really need since all todays games cannot optimally use more than 1core anyway.
Wreckage - Tuesday, March 18, 2014 - linkMantle is still beta and has yet to be shipped with a game. Give it a few years