The Truth About AMD Ryzens Performance Issues

After AMD released Ryzen, Reviewers and Users alike were really quick to throw around Theories. And this has been going on with no clear answer. Many people blamed the Scheduler, others blamed SMT. Thanks to two unnamed theory crafters and the help of nwgat, we can now get a closer look at the actual cause. Let’s take a look shall we? Update 2017-03-16: A user on Reddit apparently got a response from AMD confirming that there is indeed only one memory controller on Ryzen (Infinity Fabric). This confirms that there is indeed a bottleneck on the CPU itself. Read more

The future of AMD Encoder: HEVC on RX 4xx, Pre-Pass, VBAQ and more!

AMD released version 1.4 of the AMF SDK back in January and I quickly got to work familiarizing myself with the changes and new sample tools. I did some rather extreme tests with the encoder, which resulted in me reporting 8 Issues to the AMF Issue Tracker – mostly GPU crashes or encoding failures. But with that out of the way, Patrons have now finally received the first official pre-release build, just as I promised back in December 2016. But what’s actually inside? Read more

What is the fastest way to get an Inverse Square Root? (Part 2)

Part 1: What is the fastest way to get an Inverse Square Root? Last time I covered what the fastest way was for single precision floating point, but what about double precision floating point numbers? Do they behave the same or will we run into even more issues on older hardware? Since the market starts looking towards 128-bit integers and quad precision floating point numbers, it’s time to also test this one while it’s still ‘fresh’. Read more

What is the fastest way to get an Inverse Square Root?

Everyone who works with a 2D or 3D development studio knows that eventually you will run into hardware that is too slow. Wether that hardware is what you currently have or what your target demographic has doesn‘t matter – you‘ll hit this limit, even if you try not to. That‘s why some clever people came up with different ways to make things faster. Seperating the FPU from the IPU, SSE and AVX and the famous Quake III floating point hack. But which one performs the fastest, and can we make that one even faster? Read more