Theoretical Performance
With 16 pixel pipelines and over 500MHz clock speed, Radeon X800 XT Platinum Edition should have a very large theoretical fill-rate. We'll take a look at the XT and PRO's theoretical rates in comparison to ATI's previous high end offerings:
R420 Theoretical Performance
Core Clock (MHz) | Fill-rate (Mp/s) | Texture Fill-rate (Mt/s) | Triangle (Mtris/p) | Memory Clock (MHz) | Memory Bandwidth (GB/s) | |
X800 XT PE | 520 | 8320 | 8320 | 780 | 560 | 35.8 |
X800 PRO | 475 | 5700 | 5700 | 713 | 475 | 30.4 |
9800 XT | 412 | 3296 | 3296 | 412 | 365 | 23.4 |
9800 PRO | 380 | 3040 | 3040 | 380 | 350 | 22.4 |
9700 PRO | 325 | 2600 | 2600 | 325 | 310 | 19.8 |
X800 XT PE % Difference
Core Clock | Fill-rate | Texture Fill-rate | Triangle Rate | Memory Clock | Memory Bandwidth | |
X800 PRO | 9.5% | 46.0% | 46.0% | 9.5% | 17.9% | 17.9% |
9800 XT | 26.2% | 152.4% | 152.4% | 89.3% | 53.4% | 53.4% |
9800 PRO | 36.8% | 173.7% | 173.7% | 105.3% | 60.0% | 60.0% |
9700 PRO | 60.0% | 220.0% | 220.0% | 140.0% | 80.6% | 80.6% |
X800 PRO % Difference
Core Clock | Fill-rate | Texture Fill-rate | Triangle Rate | Memory Clock | Memory Bandwidth | |
9800 XT | 15.3% | 72.9% | 72.9% | 72.9% | 30.1% | 30.1% |
9800 PRO | 25.0% | 87.5% | 87.5% | 87.5% | 35.7% | 35.7% |
9700 PRO | 46.2% | 119.2% | 119.2% | 119.2% | 53.2% | 53.2% |
When we look at the X800 XT PE, we see that due to the number of pipelines enabled, and the clock frequency it operates at, the theoretical pixel fill-rate is over 8 Giga-Pixels per second, which is 46% greater than the 12 pipelined, 475MHz X800 PRO and over 152% greater than the 9800 XT. The Theoretical Geometry rages of between the XT PE and PRO are similar as they both have the same number of Vertex Shaders enabled, hence the rates only vary according to clock speed; in comparison to the 9800 XT the X800 XT PE has nearly a 90% greater theoretical rate thanks to having 50% more units and a higher clock speed. The bandwidth differences between the X800 XT PE and the 9800 XT aren’t as large as the fill-rate differences as GDDR-3 memory is only just ramping in the lower speed ranges at the moment, but there is still over 50% more bandwidth available for the X800 XT.
Looking at the X800 PRO we see that is has a 73% fill-rate boost over the 9800 XT, and the same rate for its theoretical Vertex processing abilities. With a 475MHz memory clock rate the memory bandwidth increase over 9800 XT is about 30%.
3DMark2001SE
Pixel Fill (Mpp/s) | Texel Fill (Mtp/s) | Geometry 1 Light (M tris/s) | Geometry 8 Lights (M tris/s) | Vertex Shader (FPS) | |
X800 XT PE | 3618.8 | 8053.1 | 149.5 | 39.0 | 244.2 |
X800 PRO | 2738.0 | 5501.2 | 132.6 | 35.3 | 245.6 |
9800 XT | 2194.5 | 3241.4 | 93.5 | 20.7 | 249.8 |
9800 PRO | 2025.3 | 2984.7 | 102.4 | 19.3 | 239.7 |
9700 PRO | 1778.9 | 2561.9 | 93.0 | 16.6 | 209.5 |
X800 XT PE % Difference
Pixel Fill (Mpp/s) | Texel Fill (Mtp/s) | Geometry 1 Light (M tris/s) | Geometry 8 Lights (M tris/s) | Vertex Shader (FPS) | |
X800 XT PE | 32.2% | 46.4% | 12.7% | 10.5% | -0.6% |
X800 PRO | 64.9% | 148.4% | 59.9% | 88.4% | -2.2% |
9800 XT | 78.7% | 169.8% | 46.0% | 102.1% | 1.9% |
9800 PRO | 103.4% | 214.3% | 60.8% | 134.9% | 16.6% |
X800 PRO % Difference
Pixel Fill (Mpp/s) | Texel Fill (Mtp/s) | Geometry 1 Light (M tris/s) | Geometry 8 Lights (M tris/s) | Vertex Shader (FPS) | |
9800 XT | 24.8% | 69.7% | 41.8% | 70.5% | -1.7% |
9800 PRO | 35.2% | 84.3% | 29.5% | 82.9% | 2.5% |
9700 PRO | 53.9% | 114.7% | 42.6% | 112.7% | 17.2% |
For the X800 XT PE, the pixel fill-rate difference to the X800 PRO falls a little shy of its theoretical difference, but this is due to this particular test being very memory bandwidth limited (indeed, the performance differences more closely resemble the memory bandwidth differences) – the same can be said when comparing against the 9800 XT. However, looking at the multi-texture fill-rate test we see performance differences much closer to their theoretical rates as this test isn’t as bandwidth limited.
Looking at the fixed function Geometry tests we can see that the performance difference of the X800 XT PE to the X800 PRO is slightly greater than its theoretical rates would suggest, and the difference to the 9800 XT is lower on the single light test but about at the theoretical difference on the 8 light test. Curiously with the Vertex Shader test the performance of the X800 XT PE is slightly lower than the X800 PRO or the 9800 XT, which is likely to be indicative of some other limitation.
When looking at the X800 PRO we see that it nearly reaches its theoretical fill-rate difference to the 9800 XT with the multi-texturing test, but, again, not in the single texture test due to bandwidth limitations. The theoretical geometry rate differences between the X800 PRO and the 9800 XT are nearly achieved in the 8 light test, but again the Vertex Shader test shows a slightly lower performance for the X800 board than the 9800 XT.
Due to the high bandwidth nature of the 3DMark2001SE fill-rate test, the X800 boards both look like they may be configured with half the pixel pipelines, but with double the texture rate – even though this doesn’t make any sense of the X800 PRO configuration (6x2 would mean it was operating on one and a half quads!) we’ll check the fill-rate performances in another test, in this case MDolenc’s fill-rate tester:
MDolenc fill-rate tester
Pure fillrate | Z pixel rate | Single texture | Dual texture | Triple texture | Quad texture | |
X800 XT PE | 5884.0 | 7859.3 | 4411.7 | 2467.4 | 1749.9 | 1352.6 |
X800 PRO | 3182.7 | 5367.7 | 3046.3 | 1902.6 | 1122.8 | 884.5 |
9800 XT | 2840.9 | 2814.3 | 2747.4 | 1468.2 | 989.7 | 754.6 |
9800 PRO | 2628.7 | 2588.3 | 2538.9 | 1369.2 | 909.9 | 620.8 |
9700 PRO | 2420.7 | 2104.9 | 2273.6 | 1182.0 | 798.8 | 608.7 |
X800 XT PE % Difference
Pure fillrate | Z pixel rate | Single texture | Dual texture | Triple texture | Quad texture | |
X800 PRO | 84.9% | 46.4% | 44.8% | 29.7% | 55.8% | 52.9% |
9800 XT | 107.1% | 179.3% | 60.6% | 68.1% | 76.8% | 79.2% |
9800 PRO | 123.8% | 203.6% | 73.8% | 80.2% | 92.3% | 117.9% |
9700 PRO | 143.1% | 273.4% | 94.0% | 108.7% | 119.1% | 122.2% |
X800 Pro % Difference
Pure fillrate | Z pixel rate | Single texture | Dual texture | Triple texture | Quad texture | |
9800 XT | 12.0% | 90.7% | 10.9% | 29.6% | 13.5% | 17.2% |
9800 PRO | 21.1% | 107.4% | 20.0% | 39.0% | 23.4% | 42.5% |
9700 PRO | 31.5% | 155.0% | 34.0% | 61.0% | 40.6% | 45.3% |
As you can see, the “Pure Fill-Rate” performances, whilst still not being very close to their theoretical rates, are higher than an 8x2 or 6x2 configuration for the X800 XT PE and X800 PRO respectively could achieve, clearly marking these as 16x1 and 12x1 configurations. Again, these are the only configurations that make sense as the chips are operating on quads of pixel pipelines.
You may note that the Z fill-rate gets much closer to its theoretical performance, due to this requiring less bandwidth, however it certainly isn’t greater than the theoretical single texturing fill-rate highlighting that there isn’t an optimized Z/Stencil path in R420.