Logo   Banner   TopRight
TopUnder
Transparent
AMD Phenom X4 9850 B3 Revision
Transparent
Date: Mar 27, 2008
Section:Processors
Author: Marco Chiappetta
Transparent
Introduction and Specifications

 

It doesn't take a genius to figure out that AMD's initial Phenom processor launch didn't go exactly as planned.  The first batch of retail-ready Phenom processors were clocked lower than expectations and in the days leading up to the launch, the now infamous TLB bug reared its ugly head.  If you're not aware, all initial Phenom and Opteron quad-core processors based on the Barcelona architecture were plagued by an errata that could cause a system crash under certain circumstances.  The errata involves the processor's translation lookaside buffer or "TLB" and L3 cache.  Keep in mind, every processor has some sort of errata, but it usually isn't cause for concern, nor does it manifest itself in any real-world application performance.  In many cases processor errata can be worked-around via BIOS or software patches and end users are none the wiser.  With the Phenom's TLB errata though, working around the problem involved a significant performance penalty.

Obviously aware of the problem, AMD's engineers immediately went to work crafting a new revision of the native quad-core Phenom silicon that resolved the TLB errata.  And today that new revision is ready.  Quad-core Phenom processors based on the new B3 revision silicon should be hitting store shelves in the not too distant future.  In addition to resolving the TLB bug, AMD is also unveiling some higher clocked Phenoms and tri-core and low-power models too.

We've got AMD's latest and greatest Phenom processor, the Phenom X4 9850, on the test bench and have our findings posted for you here.  First up, the specifications...
 

 
AMD Phenom Retail Box

AMD Phenom Processors 
Specifications and Features

Model / Processor Frequency: AMD Phenom Processor Model X4 9850 / 2.5GHz
L1 Cache Sizes: 64K of L1 instruction and 64K of L1 data cache per core (512KB total L1 per processor)
L2 Cache Sizes: 512KB of L2 data cache per core (2MB total L2 per processor)
L3 Cache Size: 2MB
Memory Controller Type: Integrated 128-bit wide memory controller, capable of being configured for dual 64-bit channels for simultaneous read/writes
Memory Controller Frequency: Up to 2.0GHz with Dual Dynamic Power Management
Types of Memory: Support for unregistered DIMMs up to PC2 8500 (DDR2-1066MHz)
HyperTransport 3.0: One 16-bit/16-bit link @ up to 4000MHz full duplex
Total Processor Bandwidth: Up to 31.5 GB/s bandwidth
Packaging: Socket AM2+ 940-pin organic micro pin grid array (micro-PGA) (backward compatible with Socket AM2)
Fab location: AMD's Fab 36 wafer fabrication facilities in Dresden, Germany
Process Technology: 65nm (.065-micron) Silicon on Insulator (SOI)
Approximate Transistor count: approx. 450 million (65nm)
Approximate Die Size: 285 mm2 (65nm)
Nominal Voltage: 1.1-1.25 Volts
Max Ambient Case Temp: 70 degrees Celsius
Max TDP: 125 Watts
ACP: *to be announced after launch
Future Memory Controller Note: Future 45nm processors versions are planned to include support for DDR3 memory


  
AMD Phenom X4 9850

Save for its unique markings, the new AMD Phenom X4 9850 looks exactly like any other socket AM2+ processor outfitted with AMD's standard heat spreader.  The chip uses the same packaging and socket as current Phenom processors; it is only the silicon underneath that has changed.

The AMD Phenom X4 9850 is a 2.5GHz processor manufactured using AMD's 65nm Silicon on Insulator process technology.  The chip has a Max TDP of 125W and it is a "Black Edition" product, which means its multiplier is unlocked for more flexible overclocking.  New to the Phenom X4 9850 is full support for a 2.0GHz memory controller and HT 3.0 frequency with Dual Dynamic Power Management technology.  Although AMD had previously mentioned support for a 2GHz memory controller and HT3.0 link frequency, the memory controller in the initial batch of Phenoms clocked in at 1.8GHz.  Also note the "X" designation has returned to the product name.  With the new tri-core processors about to hit as well, AMD felt this additional designation would make it easier to discern the number of cores available in the CPU.  Another move designed to help differentiate their product offerings is the "50" in the model number.  To keep things simple, a "50" in the last two digits of the model number designates a processor based on the B3 revision core.

Transparent
Vital Signs and Overclocking



As we've mentioned, AMD’s new Phenom X4 9850 processor look just like current Phenom processors.  It utilizes the same packaging and pin configuration.  But underneath that unassuming heat-spreader is a new silicon revision that fixes the TLB bug present on initial Phenom processors.


      
Details from CPU-Z with the Phenom 9700


The latest version of CPU-Z correctly identifies and outlined the Phenom X4 9850's inner workings.  As you can see, the processor is based on the Agena core with socket AM2+ packaging.  It is clocked at 2.5GHz (12.5 x 200MHz), although the clock speed is reported as 1255MHz due to AMD's Cool'n'Quiet feature being enabled.  Our particular chip is stepping ‘DR-B3’ and supports all of the instructions sets listed in the aptly names ‘instructions’ section.  The processor’s 2MB L2 cache configuration (512KB per core) is 16-way set associative, but the processors 2MB of shared L3-cache is 32-way set associative.  With the exception of the stepping and clock speed, nothing else has changed with this revision of Phenom.

Overclocking The Phenom X4 9850
Quad-Core Flat-Out

We know many of you are wondering just how much clock speed headroom the Phenom X4 9850 has left under the hood, so we spent some time overclocking our chip using a Gigabyte 790FX-chipset based motherboard.  We simply upped the processor voltage and increased the multiplier until our test machine was no longer stable.  A basic Thermaltake air cooler, similar to AMD's stock PIB heatsink was used.


AMD Phenom X4 9850 Overclocked to 2.8GHz

With a minor bump in voltage up to 1.4v, we were able to take our Phenom X4 9850 processor up from its default clock speed of 2.5GHz to a little over 2.8GHz, an increase of about 300MHz or 12%.  We suspect that with more elaborate cooling and perhaps a little more voltage, even higher frequencies are possible.  And remember, because this is a Black Edition processors, anyone who buys one of these chips at retail will also be able to alter its multiplier exactly as we did here.  AMD's excellent Overdrive software can also be used to overclock the chip, for those less inclined to fiddle around in the system BIOS.

Transparent
Our Test Systems and SANDRA

 

How We Configured Our Test Systems: When configuring our test systems for this article, we first entered their respective system BIOSes and set each board to its "Optimized" or "High performance Defaults". We then saved the settings, re-entered the BIOS and set memory timings for either DDR2-1066 (AMD) with 5,5,5,15 timings or DDR3-1333 with 7,7,7,20 timings (Intel). The hard drives were then formatted, and Windows Vista Ultimate was installed. When the Windows installation was complete, we updated the OS, and installed the drivers necessary for our components. Auto-Updating and Windows Defender were then disabled and we installed all of our benchmarking software, defragged the hard drives, and ran all of the tests.


 HotHardware's Test Systems
 Intel and AMD - Head To Head 

System 1:
Core 2 Extreme QX9650
(3.0GHz - Quad-Core)
Core 2 Quad Q6600
(2.40GHz - Quad-Core) 

Asus P5E3 Premium
(X48 Chipset)

2x1GB Corsair DDR3-1800
CL 7-7-7-20 - DDR3-1333

GeForce 8800 GTX
On-Board Ethernet
On-board Audio

WD740 "Raptor" HD
10,000 RPM SATA

Windows Vista Ultimate
NVIDIA Forceware v163.75
DirectX Redist (November 2007)

System 2:
AMD Phenom X4 9850
(2.5GHz)
AMD Phenom 9600
(2.3GHz) 

Gigabyte GA-MA790FX-DQ6
(AMD 790FX Chipset)

2x1GB Corsair PC2-8500
CL 5-5-5-15 - DDR2-1066

GeForce 8800 GTX
On-Board Ethernet
On-board Audio

WD740 "Raptor" HD
10,000 RPM SATA

Windows Vista Ultimate
NVIDIA Forceware v163.75
DirectX Redist (November 2007)

 

 Preliminary Testing with SiSoft SANDRA XII
 Synthetic Benchmarks


We began our testing with SiSoftware's SANDRA XII, the System ANalyzer, Diagnostic and Reporting Assistant. We ran three of the built-in subsystem tests that partially comprise the SANDRA XII suite with AMD's new Phenom X4 9850 processor (CPU Arithmetic, Memory Bandwidth, and Memory Latency) and with a B2 revision Phenom clocked at the exact same frequency for reference.  All of the scores reported below were taken with the processors running a clock speed of 2.5GHz, with 2GB of DDR2-1066 RAM running in 128-bit ganged mode.  Please note however, the Phenom X4 9850 has a faster memory controller frequency of 2.0GHz, the B2 revision chip had a 1.8GHz memory controller clock.



Phenom B2 Stepping @ 2.5GHz
CPU Arithmetic
TLB Patch Enabled


Phenom B2 Stepping @ 2.5GHz

Memory Bandwidth
TLB Patch Enabled


Phenom B2 Stepping @ 2.5GHz

Memory Latency
TLB Patch Enabled


 

Phenom B3 Stepping @ 2.5GHz
CPU Arithmetic
 


 

Phenom B3 Stepping @ 2.5GHz
Memory Bandwidth
 

 


Phenom B3 Stepping @ 2.5GHz
Memory Latency
 

If you compare the perfomance of the B2 revision chip with the TLB patch enabled to the new B3 revision chip, you'll notice some dramatic differences.  SANDRA's CPU arithmetic benchmark reports similar scores for both chips.  The Memory Bandwidth test, however, shows the B3 chip with a huge advantage (8.4GB/s vs. 11.2GB/s).  And the latency test also shows the B3 chip with a huge edge in latency (232ns vs. 83ns).  These large deltas can be directly attributed to the TLB errats which has been resolved in the B3 revision Phenom X4 9850.
Transparent
PCMark Vantage

We ran a handful processors and platforms, including the new Phenom X4 9850, through Futuremark’s latest system performance metric built especially for Windows Vista, PCMark Vantage. PCMark Vantage runs through a host of different usage scenarios to simulate different types of workloads including High Definition TV and movie playback and manipulation, gaming, image editing and manipulation, music compression, communications, and productivity.  Most of the tests are multi-threaded as well, so the tests can exploit the additional resources offered by a quad-core CPU.

Futuremark PCMark Vantage
Simulated Application Performance




From this point forward in this article, we'll be comparing the performance of the new AMD Phenom X4 9850 processor to a couple of Intel quad-core processors and a trio of other Phenom configurations.  To get as close to an apples to apples comparison between the B3 and B2 revision of Phenom, we overclocked a Phenom 9600 Black Edition to 2.5GHz and ran it though our benchmark suite with the TLB patch both enabled and disabled.

As you can see, the new Phenom X4 9850 is clearly AMD's fastest CPU to date.  It significantly outpaces the Phenom 9600 and is as fast of faster than a B2 revision chip, clock for clock. The Phenom X4 9850 also outpaces the Intel Core 2 Quad Q6600 in a majority of the tests here, which bodes well for the Phenom considering its lower price point.

Transparent
LAME MT and Kribibench v1.1

In our custom LAME MT MP3 encoding test, we convert a large WAV file to the MP3 format, which is a popular scenario that many end users work with on a day-to-day basis to provide portability and storage of their digital audio content.  LAME is an open-source mid to high bit-rate and VBR (variable bit rate) MP3 audio encoder that is used widely around the world in a multitude of third party applications.

LAME MT
Audio Encoding

In this test, we created our own 223MB WAV file (a hallucinogenically-induced Grateful Dead jam) and converted it to the MP3 format using the multi-thread capable LAME MT application in single and multi-thread modes. Processing times are recorded below, listed in seconds. Once again, shorter times equate to better performance. 

The new Phenom X4 9850 and similarly clocked B2 revision chips all performed at the exact same level, whether or not the TLB patch was enabled.  As expected the higher-clocked X4 9850 was also faster than the Phenom 9600, and comes within striking distance of the performance offered by the Core 2 Quad Q6600.

Kribibench v1.1
CPU-Bound 3D Rendering

For this next batch of tests, we ran Kribibench v1.1, a 3D rendering benchmark produced by the folks at Adept Development.  Kribibench is an SSE aware software renderer where a 3D model is rendered and animated by the host CPU and the average frame rate is reported.  We used two of the included models with this benchmark: a "Sponge Explode" model consisting of over 19.2 million polygons and the test suite's "Ultra" model that is comprised of over 16 billion polys.

Once again, it is clear that the Phenom X4 9850 is AMD's fastest desktop processor to date.  It outpaces all of the other AMD-built chips to some degree, and just edges out Intel's Core 2 Quad Q6600.

Transparent
Cinebench R10 and 3DMark06

Cinebench R10 is an OpenGL 3D rendering performance test based on Cinema 4D. Cinema 4D from Maxon is a 3D rendering and animation tool suite used by 3D animation houses and producers like Sony Animation and many others.  It's very demanding of system processor resources and is an excellent gauge of pure computational throughput.

Cinebench R10
3D Rendering

This is a multi-threaded, multi-processor aware benchmark that renders a single 3D scene and tracks the length of the entire process. The time it took each test system to render the entire scene is represented in the graph below, listed in seconds.



Cinebench R10 shows the B3 revision Phenom X4 9850 performing right on par with a similarly clocked B2 revision chip, but just behind the Core 2 Duo Q6600.

Futuremark 3DMark06
Synthetic DirectX Gaming

3DMark06's built-in CPU test is a multi-threaded DirectX gaming metric that's useful for comparing relative performance between similarly equipped systems.  This test consists of two different 3D scenes that are processed with a software renderer that is dependent on the host CPU's performance.  Calculations that are normally reserved for your 3D accelerator are instead sent to the CPU for processing and rendering.  The frame-rate generated in each test is used to determine the final score.


3DMark06's CPU performance module tells essentially the same story as Cinebench.  The Phenom X4 9850 is slightly faster than the similarly clocked B2 revision chip, and just a notch behind the Core 2 Duo Q6600.

Transparent
Gaming: Crysis and F.E.A.R.


For our next set of tests, we moved on to some in-game benchmarking with Crysis and F.E.A.R. When testing processors with Crysis or F.E.A.R., we drop the resolution to 800x600, and reduce all of the in-game graphical options to their minimum values to isolate CPU and memory performance as much as possible.  However, the in-game effects, which control the level of detail for the games' physics engines and particle systems, are left at their maximum values, since these actually do place some load on the CPU rather than GPU.

Low-Resolution Gaming: Crysis and F.E.A.R.
Taking the GPU out of the Equation




In both F.E.A.R. and Crysis, the new AMD Phenom X4 9850 processor was somewhat faster than the similarly clocked B2 revision chip.  Also note the huge performance penalty associated with running a B2 revision chip with the TLB patch enabled.  With the patch enabled a Phenom 9600, clocked 200MHz below the 2.5GHz chip is actually faster to a large degree.  In comparison to the Intel processors, the Phenom X4 9850 smokes the Q6600 in F.E.A.R., but falls behind in Crysis.

Transparent
Total System Power Consumption


Before we bring this article to a close, we'd like to cover a few final data points. Throughout all of our benchmarking and testing, we monitored how much power our test systems were consuming using a power meter. Our goal was to give you all an idea as to how much power each configuration used while idling and under a heavy workload. Please keep in mind that we were testing total system power consumption at the outlet here, not just the power being drawn by the motherboards alone.

Total System Power Consumption
Tested at the Outlet





Our power consumption testing yielded some interesting results.  First off, we found that a B2 revision Phenom actually consumed more power with the TLB patch enabled, then it did disabled.  Even though a large part of the chip is essentially going unused with the patch, the processor still managed to use more power.  If we shift focus to the B3 revision Phenom X4 9850, you'll see the chip consumed less power than B2 while idling, but a bit more while under load.  We suspect this is due to the Phenom X4 9850's faster memory controller and HT link and the relative newness of B3 silicon.  Over time, we suspect power consumption to drop as AMD tweaks their manufacturing process.

Transparent
Our Summary and Conclusion



Performance Summary: The Phenom X4 9850 is AMD's most powerful desktop processor to date.  Due to the processor's higher clock speed and revised silicon that eliminates the TLB errata, the Phenom X4 9850 is measurably faster than all of AMD's previously released Phenom processors.  In comparison to Intel's closest competitor, the Phenom X4 9850 competes well, trading benchmark victories with the 2.4GHz Core 2 Quad Q6600.  Keep in mind though, Intel is updating their line-up to include Penryn-based Core 2 products across the board and higher performing, 45nm Yorkfield-based quad-core processors should be hitting store shelves very soon.  In fact, the Q9300 has already showed up at some on-line retailers for about $290.





Although the Phenom X4 9850 doesn't propel AMD to the head of the pack in terms of performance, it is a significant step forward for the company.  AMD can now put the TLB issue behind them and focus on ramping clock speeds and their impending transition to a 45nm manufacturing process.  Until all that happens, the Phenom X4 9850 allows AMD to compete better with Intel's offerings and eliminates a roadblock that was preventing some AMD faithfuls from migrating to a Phenom CPU.  And when you consider the total platform, AMD is in a pretty good position.  DDR2 RAM is dirt cheap at the moment, and AMD 770 and 790FX motherboards are priced anywhere between $100 and $220, not to mention the affordable 780G.  In addition, the list of older socket AM2 motherboards that support Phenom is growing all the time.  Drop in CPU upgrades sure are nice when a user doesn't have the funds to overhaul their entire system.




Today isn't all about the Phenom X4 9850, however.  AMD is also announcing some lower clocked Phenom 50-series processors based on B3 revision silicon, new tri-cores, and even a low-power 65E quad-core.  Pricing and availability for all of the chips is listed in the chart above.  Please note, not all of the chips will hit the channel.  Some are reserved for OEMs only.

Now that AMD has put the TLB bug behind them, we're hoping the company switches on the afterburners and ramps up Phenom clock speeds to better compete at the high end.  Although the vast majority of processors sold are at or below the Phenom X4 9850's price point, as enthusiasts, we'd still like to see AMD work some magic and make a go at the performance crown.  It will obviously take a ton of work, but they've done it before and for competition's sake, we hope they can do it again.


 

  • No More TLB Bug!
  • Solid Performance
  • Some overclocking headroom
  • Higher Clock Speed
  • Still not as fast as Intel
  • Relatively High Power Consumption



Content Property of HotHardware.com