{"id":2387,"date":"2024-06-04T12:26:44","date_gmt":"2024-06-04T12:26:44","guid":{"rendered":"https:\/\/mvermeulen.org\/perf\/?page_id=2387"},"modified":"2024-06-07T00:45:01","modified_gmt":"2024-06-07T00:45:01","slug":"525-x264_r","status":"publish","type":"page","link":"https:\/\/mvermeulen.org\/perf\/workloads\/cpu2017\/525-x264_r\/","title":{"rendered":"525.x264_r"},"content":{"rendered":"\n<p>x264 is a SPEC CPU(R) benchmark written in C and described <a href=\"https:\/\/spec.org\/cpu2017\/Docs\/benchmarks\/525.x264_r.html\">here<\/a>. The workload runs on all logical cores<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1280\" height=\"960\" src=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/06\/systemtime-23.png\" alt=\"\" class=\"wp-image-2458\" srcset=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/06\/systemtime-23.png 1280w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/06\/systemtime-23-1024x768.png 1024w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/06\/systemtime-23-768x576.png 768w\" sizes=\"auto, (max-width: 1280px) 100vw, 1280px\" \/><\/figure>\n\n\n\n<p>Topdown profile shows a somewhat balanced workload with moderate retirement rate.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1280\" height=\"960\" src=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/06\/amdtopdown-24.png\" alt=\"\" class=\"wp-image-2459\" srcset=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/06\/amdtopdown-24.png 1280w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/06\/amdtopdown-24-1024x768.png 1024w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/06\/amdtopdown-24-768x576.png 768w\" sizes=\"auto, (max-width: 1280px) 100vw, 1280px\" \/><\/figure>\n\n\n\n<p>AMD metrics on 7840 show show a higher floating point ratio for an intrate benchmark. Otherwise cpu and memory are similar in stalls.<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>elapsed              427.195\non_cpu               0.937          # 15.00 \/ 16 cores\nutime                6338.573\nstime                68.079\nnvcsw                18688          # 22.00%\nnivcsw               66257          # 78.00%\ninblock              0              # 0.00\/sec\nonblock              7014200        # 16419.20\/sec\ncpu-clock            6407092625988  # 6407.093 seconds\ntask-clock           6407153622006  # 6407.154 seconds\npage faults          10148908       # 1583.996\/sec\ncontext switches     82472          # 12.872\/sec\ncpu migrations       1116           # 0.174\/sec\nmajor page faults    5579           # 0.871\/sec\nminor page faults    10143329       # 1583.126\/sec\nalignment faults     0              # 0.000\/sec\nemulation faults     0              # 0.000\/sec\nbranches             2682142543472  # 66.296 branches per 1000 inst\nbranch misses        74076046593    # 2.76% branch miss\nconditional          1961749943694  # 48.490 conditional branches per 1000 inst\nindirect             155574880291   # 3.845 indirect branches per 1000 inst\ncpu-cycles           25435574812907 # 3.69 GHz\ninstructions         40448932603747 # 1.59 IPC\nslots                50891919544002 #\nretiring             13910817499177 # 27.3% (38.0%)\n-- ucode             45352668807    #     0.1%\n-- fastpath          13865464830370 #    27.2%\nfrontend             7678398786243  # 15.1% (21.0%)\n-- latency           5698233394614  #    11.2%\n-- bandwidth         1980165391629  #     3.9%\nbackend              14139056467200 # 27.8% (38.6%)\n-- cpu               5826165098505  #    11.4%\n-- memory            8312891368695  #    16.3%\nspeculation          922320367797   #  1.8% ( 2.5%)\n-- branch mispredict 906767541060   #     1.8%\n-- pipeline restart  15552826737    #     0.0%\nsmt-contention       14241291859119 # 28.0% ( 0.0%)\ncpu-cycles           25475585769439 # 3.70 GHz\ninstructions         40431005409561 # 1.59 IPC\ninstructions         13491461360687 # 37.180 l2 access per 1000 inst\nl2 hit from l1       427130924382   # 6.16% l2 miss\nl2 miss from l1      16902091052    #\nl2 hit from l2 pf    60456329471    #\nl3 hit from l2 pf    4799346465     #\nl3 miss from l2 pf   9222002490     #\ninstructions         13481889986127 # 189.560 float per 1000 inst\nfloat 512            817            # 0.000 AVX-512 per 1000 inst\nfloat 256            948887398      # 0.070 AVX-256 per 1000 inst\nfloat 128            2554674131243  # 189.489 AVX-128 per 1000 inst\nfloat MMX            0              # 0.000 MMX per 1000 inst\nfloat scalar         221            # 0.000 scalar per 1000 inst\ninstructions         40452557815164 #\nopcache              4919130849142  # 121.602 opcache per 1000 inst\nopcache miss         1406572525790  # 28.6% opcache miss rate\nl1 dTLB miss         155777782851   # 3.851 L1 dTLB per 1000 inst\nl2 dTLB miss         1759067227     # 0.043 L2 dTLB per 1000 inst\ninstructions         40452640554767 #\nicache               1733988443631  # 42.865 icache per 1000 inst\nicache miss          736040598315   # 42.4% icache miss rate\nl1 iTLB miss         42629769116    # 1.054 L1 iTLB per 1000 inst\nl2 iTLB miss         0              # 0.000 L2 iTLB per 1000 inst\ntlb flush            231430         # 0.000 TLB flush per 1000 inst\n<\/code><\/pre>\n\n\n\n<p>Process tree shows time spent in x264_r_base.mev with ~3 invocations per benchmark.<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>2533 processes\n\t144 x264_r_base.mev       6253.29    45.72\n\t458 specperl                52.11     7.35\n\t432 imagevalidate_5          8.10     1.67\n\t  1 ldecod_r_base.m          6.71     3.45\n\t 46 specinvoke               0.01     0.14\n\t  1 lsb_release              0.01     0.00\n\t  3 clang                    0.00     0.05\n\t 10 ps                       0.00     0.02\n\t1091 sh                       0.00     0.00\n\t145 bash                     0.00     0.00\n\t 54 specrxp                  0.00     0.00\n\t 24 cat                      0.00     0.00\n\t 21 grep                     0.00     0.00\n\t 12 uniq                     0.00     0.00\n\t 11 sort                     0.00     0.00\n\t 10 expand                   0.00     0.00\n\t  9 specmake                 0.00     0.00\n\t  7 pwd                      0.00     0.00\n\t  6 rm                       0.00     0.00\n\t  5 basename                 0.00     0.00\n\t  5 systemctl                0.00     0.00\n\t  4 specpp                   0.00     0.00\n\t  4 uname                    0.00     0.00\n\t  3 dirname                  0.00     0.00\n\t  3 dmidecode                0.00     0.00\n\t  3 lscpu                    0.00     0.00\n\t  2 df                       0.00     0.00\n\t  2 dpkg                     0.00     0.00\n\t  2 runcpu                   0.00     0.00\n\t  2 specsha512sum            0.00     0.00\n\t  2 specxz                   0.00     0.00\n\t  2 who                      0.00     0.00\n\t  1 cpupower                 0.00     0.00\n\t  1 head                     0.00     0.00\n\t  1 logname                  0.00     0.00\n\t  1 ls                       0.00     0.00\n\t  1 numactl                  0.00     0.00\n\t  1 sysctl                   0.00     0.00\n\t  1 w                        0.00     0.00\n\t  1 wc                       0.00     0.00\n\t  1 which                    0.00     0.00\n0 processes running\n53 maximum processes\n<\/code><\/pre>\n\n\n\n<p>specinvoke fires up separate copies on each logical core.<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>    94979) specinvoke       cpu=9 start=21.10 finish=152.44\n      94981) sh               cpu=0 start=21.10 finish=41.79\n        94991) bash             cpu=0 start=21.10 finish=41.78\n          95013) x264_r_base.mev  cpu=0 start=21.11 finish=41.77\n      94982) sh               cpu=1 start=21.10 finish=41.62\n        94992) bash             cpu=1 start=21.10 finish=41.62\n          95017) x264_r_base.mev  cpu=1 start=21.11 finish=41.60\n      94983) sh               cpu=2 start=21.10 finish=41.95\n        94995) bash             cpu=2 start=21.10 finish=41.95\n          95015) x264_r_base.mev  cpu=2 start=21.11 finish=41.93\n      94984) sh               cpu=3 start=21.10 finish=41.68\n        94996) bash             cpu=3 start=21.10 finish=41.68\n          95022) x264_r_base.mev  cpu=3 start=21.11 finish=41.66\n      94985) sh               cpu=4 start=21.10 finish=41.86\n        94994) bash             cpu=4 start=21.10 finish=41.86\n          95014) x264_r_base.mev  cpu=4 start=21.10 finish=41.84\n      94986) sh               cpu=5 start=21.10 finish=41.95\n        94998) bash             cpu=5 start=21.10 finish=41.95\n          95016) x264_r_base.mev  cpu=5 start=21.11 finish=41.92\n      94987) sh               cpu=6 start=21.10 finish=41.79\n        95007) bash             cpu=6 start=21.10 finish=41.79\n          95027) x264_r_base.mev  cpu=6 start=21.11 finish=41.77\n      94988) sh               cpu=7 start=21.10 finish=41.94\n        94999) bash             cpu=7 start=21.10 finish=41.94\n          95021) x264_r_base.mev  cpu=7 start=21.11 finish=41.92\n      94989) sh               cpu=13 start=21.10 finish=41.79\n        95001) bash             cpu=8 start=21.10 finish=41.79\n          95019) x264_r_base.mev  cpu=8 start=21.11 finish=41.77\n      94990) sh               cpu=9 start=21.10 finish=41.48\n        95002) bash             cpu=9 start=21.10 finish=41.48\n          95018) x264_r_base.mev  cpu=9 start=21.11 finish=41.46\n      94993) sh               cpu=10 start=21.10 finish=41.95\n        95004) bash             cpu=10 start=21.10 finish=41.95\n          95020) x264_r_base.mev  cpu=10 start=21.11 finish=41.93\n      94997) sh               cpu=11 start=21.10 finish=41.68\n        95010) bash             cpu=11 start=21.10 finish=41.68\n          95024) x264_r_base.mev  cpu=11 start=21.11 finish=41.66\n      95000) sh               cpu=12 start=21.10 finish=41.86\n        95008) bash             cpu=12 start=21.10 finish=41.86\n          95025) x264_r_base.mev  cpu=12 start=21.11 finish=41.84\n      95003) sh               cpu=13 start=21.10 finish=41.95\n        95009) bash             cpu=13 start=21.10 finish=41.95\n          95023) x264_r_base.mev  cpu=13 start=21.11 finish=41.92\n      95005) sh               cpu=14 start=21.10 finish=41.79\n        95011) bash             cpu=14 start=21.10 finish=41.79\n          95026) x264_r_base.mev  cpu=14 start=21.11 finish=41.77\n      95006) sh               cpu=15 start=21.10 finish=41.94\n        95012) bash             cpu=15 start=21.10 finish=41.94\n          95028) x264_r_base.mev  cpu=15 start=21.11 finish=41.92\n      95031) sh               cpu=9 start=41.48 finish=94.57\n        95032) bash             cpu=9 start=41.48 finish=94.57\n          95033) x264_r_base.mev  cpu=9 start=41.48 finish=94.56\n      95034) sh               cpu=1 start=41.62 finish=94.64\n        95035) bash             cpu=1 start=41.62 finish=94.64\n          95036) x264_r_base.mev  cpu=1 start=41.63 finish=94.64\n      95037) sh               cpu=3 start=41.68 finish=94.84\n        95038) bash             cpu=3 start=41.68 finish=94.84\n          95040) x264_r_base.mev  cpu=3 start=41.68 finish=94.84\n      95039) sh               cpu=11 start=41.68 finish=94.83\n        95041) bash             cpu=11 start=41.68 finish=94.83\n          95042) x264_r_base.mev  cpu=11 start=41.68 finish=94.82\n      95043) sh               cpu=0 start=41.79 finish=94.90\n        95045) bash             cpu=0 start=41.79 finish=94.90\n          95047) x264_r_base.mev  cpu=0 start=41.79 finish=94.89\n      95044) sh               cpu=8 start=41.79 finish=94.83\n        95046) bash             cpu=8 start=41.79 finish=94.83\n          95048) x264_r_base.mev  cpu=8 start=41.79 finish=94.82\n      95049) sh               cpu=14 start=41.79 finish=95.04\n        95051) bash             cpu=14 start=41.79 finish=95.04\n          95053) x264_r_base.mev  cpu=14 start=41.80 finish=95.03\n      95050) sh               cpu=6 start=41.79 finish=94.89\n        95052) bash             cpu=6 start=41.79 finish=94.88\n          95054) x264_r_base.mev  cpu=6 start=41.80 finish=94.88\n      95055) sh               cpu=12 start=41.86 finish=95.03\n        95057) bash             cpu=12 start=41.86 finish=95.03\n          95059) x264_r_base.mev  cpu=12 start=41.86 finish=95.02\n      95056) sh               cpu=4 start=41.86 finish=94.94\n        95058) bash             cpu=4 start=41.86 finish=94.94\n          95060) x264_r_base.mev  cpu=4 start=41.86 finish=94.94\n      95061) sh               cpu=15 start=41.94 finish=95.35\n        95063) bash             cpu=15 start=41.94 finish=95.35\n          95069) x264_r_base.mev  cpu=15 start=41.95 finish=95.35\n      95062) sh               cpu=7 start=41.94 finish=95.34\n        95064) bash             cpu=7 start=41.95 finish=95.34\n          95070) x264_r_base.mev  cpu=7 start=41.95 finish=95.34\n      95065) sh               cpu=5 start=41.95 finish=95.07\n        95067) bash             cpu=5 start=41.95 finish=95.07\n          95072) x264_r_base.mev  cpu=5 start=41.95 finish=95.06\n      95066) sh               cpu=13 start=41.95 finish=95.25\n        95068) bash             cpu=13 start=41.95 finish=95.25\n          95071) x264_r_base.mev  cpu=13 start=41.95 finish=95.24\n      95073) sh               cpu=2 start=41.95 finish=94.88\n        95076) bash             cpu=2 start=41.95 finish=94.88\n          95077) x264_r_base.mev  cpu=2 start=41.95 finish=94.88\n      95074) sh               cpu=10 start=41.95 finish=95.05\n        95075) bash             cpu=10 start=41.95 finish=95.05\n          95078) x264_r_base.mev  cpu=10 start=41.95 finish=95.04\n      95079) sh               cpu=9 start=94.57 finish=151.25\n        95080) bash             cpu=9 start=94.57 finish=151.25\n          95081) x264_r_base.mev  cpu=9 start=94.57 finish=151.23\n      95082) sh               cpu=1 start=94.64 finish=151.31\n        95083) bash             cpu=1 start=94.64 finish=151.31\n          95084) x264_r_base.mev  cpu=1 start=94.64 finish=151.30\n      95085) sh               cpu=8 start=94.83 finish=152.16\n        95086) bash             cpu=8 start=94.83 finish=152.16\n          95089) x264_r_base.mev  cpu=8 start=94.83 finish=152.15\n      95087) sh               cpu=11 start=94.83 finish=151.87\n        95088) bash             cpu=11 start=94.83 finish=151.87\n          95090) x264_r_base.mev  cpu=11 start=94.83 finish=151.86\n      95091) sh               cpu=3 start=94.84 finish=151.87\n        95092) bash             cpu=3 start=94.85 finish=151.87\n          95093) x264_r_base.mev  cpu=3 start=94.85 finish=151.85\n      95094) sh               cpu=2 start=94.88 finish=151.48\n        95095) bash             cpu=2 start=94.88 finish=151.48\n          95098) x264_r_base.mev  cpu=2 start=94.89 finish=151.46\n      95096) sh               cpu=6 start=94.89 finish=151.55\n        95097) bash             cpu=6 start=94.89 finish=151.55\n          95099) x264_r_base.mev  cpu=6 start=94.89 finish=151.53\n      95100) sh               cpu=0 start=94.90 finish=152.16\n        95101) bash             cpu=0 start=94.90 finish=152.16\n          95102) x264_r_base.mev  cpu=0 start=94.90 finish=152.15\n      95103) sh               cpu=4 start=94.94 finish=151.98\n        95104) bash             cpu=4 start=94.94 finish=151.98\n          95105) x264_r_base.mev  cpu=4 start=94.95 finish=151.97\n      95106) sh               cpu=12 start=95.03 finish=152.00\n        95107) bash             cpu=12 start=95.03 finish=152.00\n          95108) x264_r_base.mev  cpu=12 start=95.03 finish=151.99\n      95109) sh               cpu=14 start=95.04 finish=151.85\n        95110) bash             cpu=14 start=95.04 finish=151.85\n          95111) x264_r_base.mev  cpu=14 start=95.04 finish=151.84\n      95112) sh               cpu=10 start=95.05 finish=151.62\n        95113) bash             cpu=10 start=95.05 finish=151.62\n          95114) x264_r_base.mev  cpu=10 start=95.05 finish=151.60\n      95115) sh               cpu=5 start=95.07 finish=151.71\n        95116) bash             cpu=5 start=95.07 finish=151.71\n          95117) x264_r_base.mev  cpu=5 start=95.07 finish=151.69\n      95118) sh               cpu=13 start=95.25 finish=151.83\n        95119) bash             cpu=13 start=95.25 finish=151.83\n          95120) x264_r_base.mev  cpu=13 start=95.25 finish=151.82\n      95121) sh               cpu=7 start=95.34 finish=152.43\n        95122) bash             cpu=7 start=95.34 finish=152.43\n          95123) x264_r_base.mev  cpu=7 start=95.35 finish=152.42\n      95124) sh               cpu=15 start=95.36 finish=152.44\n        95125) bash             cpu=15 start=95.36 finish=152.44\n          95126) x264_r_base.mev  cpu=15 start=95.36 finish=152.43\n<\/code><\/pre>\n","protected":false},"excerpt":{"rendered":"<p>x264 is a SPEC CPU(R) benchmark written in C and described here. The workload runs on all logical cores Topdown profile shows a somewhat balanced workload with moderate retirement rate. AMD metrics on 7840 show show a higher floating point <span class=\"excerpt-dots\">&hellip;<\/span> <a class=\"more-link\" href=\"https:\/\/mvermeulen.org\/perf\/workloads\/cpu2017\/525-x264_r\/\"><span class=\"more-msg\">Continue reading &rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":2297,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-2387","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/2387","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/comments?post=2387"}],"version-history":[{"count":2,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/2387\/revisions"}],"predecessor-version":[{"id":2461,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/2387\/revisions\/2461"}],"up":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/2297"}],"wp:attachment":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/media?parent=2387"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}