P4E needs very long pipeline stages in order to ramp up to high clock speed in the future. It also uses larger cache to compensate the long pipeline penalty. Well, it claims that the branch prediction has improved significantly. It's normal for longer pipeline CPU to perform weaker due to the pipeline latency in every stages, if 1 stage causes 1ns latency, 31 stages causes 31ns latency compare to 20ns latency on northwood. Moreover, longer pipeline means a task can only be accomplished after 31 stages of pipeline, compare to 20 stages of northwood. Therefore, if no enhancement on cache, branch prediction or prefetching, 31 stages of pipeline will definitely slower than 20 stages pipeline CPU at the same clock speed. Main advantage of having long pipeline is to ramp up clock speed , because each task are divided into more stages, where each stage equal to 1 clock cycle.
P4 Northwood vs Prescott GUIDE, P4 "C" vs P4 "E"
Apr 9 2004, 01:41 PM
Quote
0.0165sec
0.64
7 queries
GZIP Disabled