CPU2006 license: | 19 | Test date: | Jun-2008 |
---|---|---|---|
Test sponsor: | Fujitsu Limited | Hardware Availability: | Jul-2008 |
Tested by: | Sun Microsystems | Software Availability: | Jul-2008 |
Hardware | |
---|---|
CPU Name: | SPARC64 VII |
CPU Characteristics: | |
CPU MHz: | 2520 |
FPU: | Integrated |
CPU(s) enabled: | 64 cores, 16 chips, 4 cores/chip, 2 threads/core |
CPU(s) orderable: | 1 to 4 CMUs; each CMU contains 2 or 4 chips |
Primary Cache: | 64 KB I + 64 KB D on chip per core |
Secondary Cache: | 6 MB I+D on chip per chip |
L3 Cache: | None |
Other Cache: | None |
Memory: | 256 GB (128 x 2 GB) |
Disk Subsystem: | 805 GB RAID 0 Solaris Volume 12 x Fujitsu 73 GB 10000 RPM SAS Stripe interlace size 512 Kbytes |
Other Hardware: | None |
Software | |
---|---|
Operating System: | Solaris 10 5/08 with Patch 137111-03 |
Compiler: | Sun Studio 12 with patches 124867-06, 124861-07, 124863-05, 127000-05 (see patch information below) |
Auto Parallel: | Yes |
File System: | ufs |
System State: | Default |
Base Pointers: | 32-bit |
Peak Pointers: | 32/64-bit |
Other Software: | None |
Benchmark | Base | Peak | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||
410.bwaves | 127 | 3490 | 495 | 3459 | 499 | 3455 | 499 | 127 | 3417 | 505 | 3420 | 505 | 3422 | 504 |
416.gamess | 127 | 3174 | 783 | 3141 | 792 | 3150 | 789 | 127 | 3071 | 810 | 3093 | 804 | 3018 | 824 |
433.milc | 127 | 4700 | 248 | 4703 | 248 | 4692 | 248 | 127 | 4655 | 250 | 4651 | 251 | 4646 | 251 |
434.zeusmp | 127 | 1825 | 633 | 1811 | 638 | 1805 | 640 | 127 | 1825 | 633 | 1811 | 638 | 1805 | 640 |
435.gromacs | 127 | 1067 | 850 | 1063 | 853 | 1068 | 849 | 127 | 971 | 933 | 967 | 938 | 973 | 932 |
436.cactusADM | 127 | 2122 | 715 | 2085 | 728 | 2127 | 713 | 64 | 925 | 826 | 928 | 824 | 926 | 826 |
437.leslie3d | 127 | 3660 | 326 | 3659 | 326 | 3688 | 324 | 64 | 1788 | 336 | 1787 | 337 | 1788 | 337 |
444.namd | 127 | 1068 | 954 | 1063 | 958 | 1061 | 960 | 127 | 1072 | 950 | 1059 | 962 | 1052 | 969 |
447.dealII | 127 | 1431 | 1020 | 1417 | 1030 | 1434 | 1010 | 127 | 1386 | 1050 | 1389 | 1050 | 1385 | 1050 |
450.soplex | 127 | 3749 | 283 | 3755 | 282 | 3736 | 284 | 127 | 3763 | 281 | 3725 | 284 | 3716 | 285 |
453.povray | 127 | 812 | 832 | 813 | 831 | 837 | 807 | 127 | 600 | 1130 | 591 | 1140 | 615 | 1100 |
454.calculix | 127 | 1076 | 974 | 1076 | 974 | 1077 | 973 | 127 | 1071 | 978 | 1085 | 966 | 1071 | 978 |
459.GemsFDTD | 127 | 5530 | 244 | 5536 | 243 | 5534 | 243 | 127 | 5393 | 250 | 5394 | 250 | 5392 | 250 |
465.tonto | 127 | 1891 | 661 | 1895 | 659 | 1895 | 659 | 127 | 1694 | 738 | 1708 | 732 | 1689 | 740 |
470.lbm | 127 | 6334 | 276 | 6344 | 275 | 6334 | 275 | 1 | 32.0 | 430 | 31.9 | 431 | 31.8 | 432 |
481.wrf | 127 | 2768 | 513 | 2758 | 514 | 2757 | 515 | 63 | 1335 | 527 | 1335 | 527 | 1335 | 527 |
482.sphinx3 | 127 | 5774 | 429 | 5876 | 421 | 5781 | 428 | 127 | 5528 | 448 | 5498 | 450 | 5487 | 451 |
Sun Studio compiler patches are available at http://developers.sun.com/sunstudio/downloads/patches/ss12_patches.jsp
Processes were assigned to specific processors using 'pbind' commands. The config file option 'submit' was used, along with a list of processors in the 'BIND' variable, to generate the pbind commands. (For details, please see the config file.)
Environment Variable Settings: The maximum number of threads a program can create was set with: OMP_NUM_THREADS=127 Program threads were bound to processors with: SUNW_MP_PROCBIND="1-127" Behavior of parallel threads was set with: SUNW_MP_THR_IDLE=SPIN SPIN specifies that an idle thread should spin while waiting at barrier or waiting for new parallel regions to work on. System Tunables (/etc/system parameters): tune_t_fsflushr=10 Controls how many seconds elapse between runs of the page flush daemon, fsflush. autoup=300 Causes pages older than the listed number of seconds to be written by fsflush. bufhwm=3000 Memory byte limit for caching I/O buffers segmap_percent=3 Set maximum percent memory for file system cache lpg_alloc_prefer=1 Set lgroup page allocation to strongly prefer local pages Other System Settings: The webconsole service was turned off using svcadm disable webconsole
Memory is 8-way interleaved by filling all slots with the same capacity DIMMs. This result is measured on a Sun SPARC Enterprise M8000 Server. Note that the Sun SPARC Enterprise M8000 and Fujitsu SPARC Enterprise M8000 are electrically equivalent.
cc |
CC |
f90 |
cc f90 |
-fast -fma=fused -xipo=2 -xpagesize=4M -xprefetch_level=1 -xalias_level=std -xprefetch_auto_type=indirect_array_access |
-xdepend -library=stlport4 -fast -fma=fused -xipo=2 -xpagesize=4M -xprefetch_level=1 -xalias_level=compatible |
-fast -fma=fused -xipo=2 -xpagesize=4M -xprefetch_level=1 |
-fast(cc) -fast(f90) -fma=fused -xipo=2 -xpagesize=4M -xprefetch_level=1 -xalias_level=std -xprefetch_auto_type=indirect_array_access |
-xjobs=16 -V -# |
-xjobs=16 -verbose=diags,version |
-xjobs=16 -V -v |
-xjobs=16 -V -# -v |
cc |
CC |
f90 |
cc f90 |
444.namd: | -xdepend -library=stlport4 -fast -xpagesize=4M -xalias_level=compatible -fma=fused -xprefetch=latx:7 |
447.dealII: | -xdepend -library=stlport4 -xprofile=collect:./feedback(pass 1) -xprofile=use:./feedback(pass 2) -fast -xpagesize=4M -xalias_level=compatible -xipo=2 -xrestrict -fma=fused |
450.soplex: | -xdepend -library=stlport4 -xprofile=collect:./feedback(pass 1) -xprofile=use:./feedback(pass 2) -fast -xpagesize=4M -xalias_level=compatible -xipo=2 -xprefetch=no -fsimple=0 -xrestrict |
453.povray: | Same as 447.dealII |
410.bwaves: | -fast -xpagesize=4M -xipo=2 -xprefetch_level=2 -fma=fused |
416.gamess: | -xprofile=collect:./feedback(pass 1) -xprofile=use:./feedback(pass 2) -fast -xpagesize=4M -xipo=2 -xprefetch_level=2 -fma=fused |
434.zeusmp: | basepeak = yes |
437.leslie3d: | -fast -xpagesize=4M -fma=fused -xipo=2 -xprefetch=latx:4 -xprefetch_level=2 |
459.GemsFDTD: | -fast -xpagesize=4M -fsimple=1 -xprefetch=no -fma=fused |
465.tonto: | -xprofile=collect:./feedback(pass 1) -xprofile=use:./feedback(pass 2) -fast -xpagesize=4M -xipo=2 -xprefetch=no -xarch=generic -lfast |
435.gromacs: | -xprofile=collect:./feedback(pass 1) -xprofile=use:./feedback(pass 2) -fast(cc) -fast(f90) -xpagesize=4M -xipo=2 -xarch=generic -xchip=generic -fsimple=0 -xunroll=5 -xprefetch=latx:0.5 |
436.cactusADM: | -fast(cc) -fast(f90) -xpagesize=4M -xipo=2 -fma=fused |
454.calculix: | -fast(cc) -fast(f90) -xpagesize=4M -xipo=2 -xprefetch_level=3 -fma=fused -xprefetch=latx:3.0 -xalias_level=std |
481.wrf: | -fast(cc) -fast(f90) -xpagesize=4M -xipo=2 -xprefetch_level=3 -fma=fused -xunroll=8 |