CPU2006 license: | 4 | Test date: | May-2009 |
---|---|---|---|
Test sponsor: | SGI | Hardware Availability: | Mar-2009 |
Tested by: | SGI | Software Availability: | Feb-2009 |
Hardware | |
---|---|
CPU Name: | Intel Xeon X5570 |
CPU Characteristics: | Quad Core, 2.93 GHz Intel Turbo Boost Technology up to 3.33 GHz |
CPU MHz: | 2933 |
FPU: | Integrated |
CPU(s) enabled: | 32 cores, 8 chips, 4 cores/chip, 2 threads/core |
CPU(s) orderable: | 1,2 chips per blade, 2-16384 blades |
Primary Cache: | 32 KB I + 32 KB D on chip per core |
Secondary Cache: | 256 KB I+D on chip per core |
L3 Cache: | 8 MB I+D on chip per chip |
Other Cache: | None |
Memory: | 192 GB (4 x 12*4GB DDR3-1066 CL7 RDIMMs) |
Disk Subsystem: | 13 TB Lustre Parallel Filesystem 1 Metadata Server and 6 Object Storage Servers 96 x 136 GB SAS (Seagate Cheetah 15000 rpm) |
Other Hardware: | None |
Software | |
---|---|
Operating System: | SUSE Linux Enterprise Server 10 (x86_64) SP2 with patch Linux kernel 20080917, Kernel 2.6.16.60-0.30-smp |
Compiler: | Intel C++ and Fortran Compiler 11.0 for Linux Build 20090131 Package ID: l_cproc_p_11.0.080, l_cprof_p_11.0.080 |
Auto Parallel: | No |
File System: | lustre v1.6.7 over DDR Infiniband |
System State: | Multi-user, run level 3 |
Base Pointers: | 64-bit |
Peak Pointers: | 32/64-bit |
Other Software: | SGI ProPack 6 for Linux Service Pack 2 Binutils 2.18.50.0.7.20080502 |
Benchmark | Base | Peak | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||
410.bwaves | 64 | 1919 | 453 | 1271 | 684 | 1272 | 684 | 32 | 622 | 699 | 622 | 699 | 623 | 698 |
416.gamess | 64 | 1547 | 810 | 1544 | 812 | 1542 | 812 | 32 | 762 | 822 | 762 | 822 | 763 | 821 |
433.milc | 64 | 939 | 626 | 940 | 625 | 939 | 626 | 64 | 943 | 623 | 941 | 624 | 942 | 624 |
434.zeusmp | 64 | 714 | 815 | 714 | 816 | 716 | 813 | 64 | 706 | 825 | 702 | 830 | 708 | 823 |
435.gromacs | 64 | 585 | 781 | 582 | 785 | 583 | 783 | 64 | 563 | 812 | 565 | 809 | 563 | 812 |
436.cactusADM | 64 | 856 | 893 | 852 | 898 | 848 | 901 | 64 | 871 | 878 | 901 | 849 | 920 | 831 |
437.leslie3d | 64 | 1234 | 487 | 1234 | 487 | 1234 | 488 | 32 | 617 | 488 | 616 | 488 | 616 | 488 |
444.namd | 64 | 702 | 731 | 702 | 731 | 700 | 734 | 64 | 691 | 743 | 688 | 747 | 688 | 746 |
447.dealII | 64 | 652 | 1120 | 656 | 1120 | 656 | 1120 | 64 | 610 | 1200 | 603 | 1210 | 605 | 1210 |
450.soplex | 64 | 1004 | 532 | 1004 | 532 | 1005 | 531 | 32 | 477 | 560 | 477 | 560 | 477 | 560 |
453.povray | 64 | 321 | 1060 | 321 | 1060 | 322 | 1060 | 64 | 267 | 1280 | 267 | 1280 | 266 | 1280 |
454.calculix | 64 | 570 | 927 | 570 | 926 | 578 | 914 | 64 | 586 | 901 | 583 | 906 | 584 | 904 |
459.GemsFDTD | 64 | 1573 | 432 | 1572 | 432 | 1572 | 432 | 32 | 768 | 442 | 769 | 442 | 769 | 442 |
465.tonto | 64 | 771 | 817 | 776 | 811 | 785 | 803 | 64 | 758 | 831 | 758 | 831 | 752 | 838 |
470.lbm | 64 | 2055 | 428 | 2056 | 428 | 2056 | 428 | 32 | 991 | 444 | 991 | 444 | 991 | 444 |
481.wrf | 64 | 860 | 831 | 856 | 835 | 856 | 835 | 64 | 860 | 831 | 856 | 835 | 856 | 835 |
482.sphinx3 | 64 | 1621 | 769 | 1622 | 769 | 1627 | 767 | 64 | 1553 | 803 | 1555 | 802 | 1559 | 800 |
The config file option 'submit' was used. A submit.pl script was used to distribute benchmark copies across the 4 blades and to pin processes to cores using dplace. Each blade runs a separate instance of the operating system.
Adjacent cache line prefetch enabled System has 4 blades with 2 chips/blade.
icc |
icpc |
ifort |
icc ifort |
410.bwaves: | -DSPEC_CPU_LP64 |
416.gamess: | -DSPEC_CPU_LP64 |
433.milc: | -DSPEC_CPU_LP64 |
434.zeusmp: | -DSPEC_CPU_LP64 |
435.gromacs: | -DSPEC_CPU_LP64 -nofor_main |
436.cactusADM: | -DSPEC_CPU_LP64 -nofor_main |
437.leslie3d: | -DSPEC_CPU_LP64 |
444.namd: | -DSPEC_CPU_LP64 |
447.dealII: | -DSPEC_CPU_LP64 |
450.soplex: | -DSPEC_CPU_LP64 |
453.povray: | -DSPEC_CPU_LP64 |
454.calculix: | -DSPEC_CPU_LP64 -nofor_main |
459.GemsFDTD: | -DSPEC_CPU_LP64 |
465.tonto: | -DSPEC_CPU_LP64 |
470.lbm: | -DSPEC_CPU_LP64 |
481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
482.sphinx3: | -DSPEC_CPU_LP64 |
-xSSE4.2 -ipo -O3 -no-prec-div -static |
-xSSE4.2 -ipo -O3 -no-prec-div -static |
-xSSE4.2 -ipo -O3 -no-prec-div -static |
-xSSE4.2 -ipo -O3 -no-prec-div -static |
icc | |
482.sphinx3: | icc -m32 |
icpc | |
450.soplex: | icpc -m32 |
ifort | |
437.leslie3d: | ifort -m32 |
icc ifort |
410.bwaves: | -DSPEC_CPU_LP64 |
416.gamess: | -DSPEC_CPU_LP64 |
433.milc: | -DSPEC_CPU_LP64 |
434.zeusmp: | -DSPEC_CPU_LP64 |
435.gromacs: | -DSPEC_CPU_LP64 -nofor_main |
436.cactusADM: | -DSPEC_CPU_LP64 -nofor_main |
444.namd: | -DSPEC_CPU_LP64 |
447.dealII: | -DSPEC_CPU_LP64 |
453.povray: | -DSPEC_CPU_LP64 |
454.calculix: | -DSPEC_CPU_LP64 -nofor_main |
459.GemsFDTD: | -DSPEC_CPU_LP64 |
465.tonto: | -DSPEC_CPU_LP64 |
470.lbm: | -DSPEC_CPU_LP64 |
481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
433.milc: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -fno-alias |
470.lbm: | -xSSE4.2 -ipo -O3 -no-prec-div -static -opt-prefetch -auto-ilp32 |
482.sphinx3: | -xSSE4.2 -ipo -O3 -no-prec-div -static -unroll2 |
444.namd: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -fno-alias -auto-ilp32 |
447.dealII: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -unroll2 -ansi-alias -scalar-rep- |
450.soplex: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -opt-malloc-options=3 |
453.povray: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -unroll4 -ansi-alias |
410.bwaves: | -xSSE4.2 -ipo -O3 -no-prec-div -static -opt-prefetch |
416.gamess: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -unroll2 -Ob0 -ansi-alias -scalar-rep- |
434.zeusmp: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) |
437.leslie3d: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -opt-malloc-options=3 -opt-prefetch |
459.GemsFDTD: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -unroll2 -Ob0 -opt-prefetch |
465.tonto: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -unroll4 -auto |
435.gromacs: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -opt-prefetch -auto-ilp32 |
436.cactusADM: | -xSSE4.2(pass 2) -prof-gen(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -static(pass 2) -prof-use(pass 2) -unroll2 -opt-prefetch -auto-ilp32 |
454.calculix: | -xSSE4.2 -ipo -O3 -no-prec-div -static -auto-ilp32 |
481.wrf: | basepeak = yes |