{"id":1248,"date":"2024-02-01T23:35:14","date_gmt":"2024-02-01T23:35:14","guid":{"rendered":"https:\/\/mvermeulen.org\/perf\/?page_id=1248"},"modified":"2024-02-02T03:18:59","modified_gmt":"2024-02-02T03:18:59","slug":"mpcbench","status":"publish","type":"page","link":"https:\/\/mvermeulen.org\/perf\/workloads\/phoronix\/mpcbench\/","title":{"rendered":"mpcbench"},"content":{"rendered":"\n<p>The GNU library for complex numbers. This is a single-threaded program.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1280\" height=\"960\" src=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/02\/systemtime-9.png\" alt=\"\" class=\"wp-image-1282\" srcset=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/02\/systemtime-9.png 1280w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/02\/systemtime-9-1024x768.png 1024w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/02\/systemtime-9-768x576.png 768w\" sizes=\"auto, (max-width: 1280px) 100vw, 1280px\" \/><\/figure>\n\n\n\n<p>Topdown profile has a high retirement rate<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1280\" height=\"960\" src=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/02\/amdtopdown-9.png\" alt=\"\" class=\"wp-image-1284\" srcset=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/02\/amdtopdown-9.png 1280w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/02\/amdtopdown-9-1024x768.png 1024w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/02\/amdtopdown-9-768x576.png 768w\" sizes=\"auto, (max-width: 1280px) 100vw, 1280px\" \/><\/figure>\n\n\n\n<p>AMD metrics show little floating point and not much L2 access<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>elapsed              166.758\non_cpu               0.057          # 0.92 \/ 16 cores\nutime                151.162\nstime                1.479\nnvcsw                2138           # 71.36%\nnivcsw               858            # 28.64%\ninblock              0              # 0.00\/sec\nonblock              16504          # 98.97\/sec\ncpu-clock            152672415749   # 152.672 seconds\ntask-clock           152676091847   # 152.676 seconds\npage faults          557051         # 3648.580\/sec\ncontext switches     3615           # 23.678\/sec\ncpu migrations       323            # 2.116\/sec\nmajor page faults    2              # 0.013\/sec\nminor page faults    557049         # 3648.567\/sec\nalignment faults     0              # 0.000\/sec\nemulation faults     0              # 0.000\/sec\nbranches             274437965873   # 124.238 branches per 1000 inst\nbranch misses        2307028641     # 0.84% branch miss\nconditional          177481790336   # 80.346 conditional branches per 1000 inst\nindirect             27117521790    # 12.276 indirect branches per 1000 inst\ncpu-cycles           706513748986   # 0.26 GHz\ninstructions         2212572685131  # 3.13 IPC high\nslots                1415733915426  #\nretiring             795937030816   # 56.2% (56.2%) high\n-- ucode             318086094      #     0.0%\n-- fastpath          795618944722   #    56.2%\nfrontend             225753182746   # 15.9% (16.0%)\n-- latency           123871977144   #     8.7%\n-- bandwidth         101881205602   #     7.2%\nbackend              337512903478   # 23.8% (23.8%)\n-- cpu               144707653849   #    10.2%\n-- memory            192805249629   #    13.6%\nspeculation          56157233610    #  4.0% ( 4.0%)\n-- branch mispredict 53613407355    #     3.8%\n-- pipeline restart  2543826255     #     0.2%\nsmt-contention       373171679      #  0.0% ( 0.0%)\ncpu-cycles           706487025765   # 0.26 GHz\ninstructions         2216436708146  # 3.14 IPC high\ninstructions         739409511698   # 12.099 l2 access per 1000 inst\nl2 hit from l1       6321172491     # 23.47% l2 miss\nl2 miss from l1      133295780      #\nl2 hit from l2 pf    658730830      #\nl3 hit from l2 pf    1415521643     #\nl3 miss from l2 pf   550753147      #\ninstructions         739266549923   # 2.087 float per 1000 inst\nfloat 512            68             # 0.000 AVX-512 per 1000 inst\nfloat 256            580            # 0.000 AVX-256 per 1000 inst\nfloat 128            1543095062     # 2.087 AVX-128 per 1000 inst\nfloat MMX            0              # 0.000 MMX per 1000 inst\nfloat scalar         0              # 0.000 scalar per 1000 inst\n<\/code><\/pre>\n\n\n\n<p>Intel metrics<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>elapsed              164.961\non_cpu               0.057          # 0.91 \/ 16 cores\nutime                150.022\nstime                0.900\nnvcsw                2145           # 73.31%\nnivcsw               781            # 26.69%\ninblock              2056           # 12.46\/sec\nonblock              5760           # 34.92\/sec\ncpu-clock            150932957744   # 150.933 seconds\ntask-clock           150937250108   # 150.937 seconds\npage faults          552371         # 3659.607\/sec\ncontext switches     3463           # 22.943\/sec\ncpu migrations       354            # 2.345\/sec\nmajor page faults    16             # 0.106\/sec\nminor page faults    552355         # 3659.501\/sec\nalignment faults     0              # 0.000\/sec\nemulation faults     0              # 0.000\/sec\nbranches             249717007520   # 123.921 branches per 1000 inst\nbranch misses        2282770198     # 0.91% branch miss\nconditional          249717023360   # 123.921 conditional branches per 1000 inst\nindirect             22076312923    # 10.955 indirect branches per 1000 inst\nslots                3411955791404  #\nretiring             2085926990436  # 61.1% (61.1%) high\n-- ucode             213396009277   #     6.3%\n-- fastpath          1872530981159  #    54.9%\nfrontend             653782899966   # 19.2% (19.2%)\n-- latency           162807831077   #     4.8%\n-- bandwidth         490975068889   #    14.4%\nbackend              414970189514   # 12.2% (12.2%) low\n-- cpu               176324196229   #     5.2%\n-- memory            238645993285   #     7.0%\nspeculation          429894175274   # 12.6% (12.6%) high\n-- branch mispredict 420757258873   #    12.3%\n-- pipeline restart  9136916401     #     0.3%\nsmt-contention       0              #  0.0% ( 0.0%)\ncpu-cycles           568505974432   # 0.22 GHz\ninstructions         2013312584177  # 3.54 IPC high\nl2 access            23904751544    # 11.887 l2 access per 1000 inst\nl2 miss              7809825493     # 32.67% l2 miss\n<\/code><\/pre>\n\n\n\n<p>Process overview is simple<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>399 processes\n\t 33 mpcbench               150.14     0.70\n\t 68 clinfo                  16.54     5.66\n\t 38 vulkaninfo               1.15     1.11\n\t  6 glxinfo:gdrv0            0.15     0.02\n\t  6 glxinfo:gl0              0.15     0.02\n\t  4 vulkani:disk$0           0.12     0.11\n\t  2 glxinfo                  0.07     0.02\n\t  2 glxinfo:cs0              0.07     0.02\n\t  2 glxinfo:disk$0           0.07     0.02\n\t  2 glxinfo:sh0              0.07     0.02\n\t  2 glxinfo:shlo0            0.07     0.02\n\t  6 clang                    0.06     0.06\n\t  2 llvmpipe-0               0.06     0.06\n\t  2 llvmpipe-1               0.06     0.06\n\t  2 llvmpipe-10              0.06     0.06\n\t  2 llvmpipe-11              0.06     0.06\n\t  2 llvmpipe-12              0.06     0.06\n\t  2 llvmpipe-13              0.06     0.06\n\t  2 llvmpipe-14              0.06     0.06\n\t  2 llvmpipe-15              0.06     0.06\n\t  2 llvmpipe-2               0.06     0.06\n\t  2 llvmpipe-3               0.06     0.06\n\t  2 llvmpipe-4               0.06     0.06\n\t  2 llvmpipe-5               0.06     0.06\n\t  2 llvmpipe-6               0.06     0.06\n\t  2 llvmpipe-7               0.06     0.06\n\t  2 llvmpipe-8               0.06     0.06\n\t  2 llvmpipe-9               0.06     0.06\n\t  6 php                      0.04     0.09\n\t  3 rocminfo                 0.03     0.00\n\t  1 lspci                    0.00     0.02\n\t  1 ps                       0.00     0.01\n\t 81 sh                       0.00     0.00\n\t 13 gcc                      0.00     0.00\n\t 13 gsettings                0.00     0.00\n\t 10 sed                      0.00     0.00\n\t  8 stat                     0.00     0.00\n\t  8 systemd-detect-          0.00     0.00\n\t  6 llvm-link                0.00     0.00\n\t  6 make                     0.00     0.00\n\t  5 phoronix-test-s          0.00     0.00\n\t  3 ls                       0.00     0.00\n\t  2 cc                       0.00     0.00\n\t  2 gmain                    0.00     0.00\n\t  2 lscpu                    0.00     0.00\n\t  2 uname                    0.00     0.00\n\t  2 which                    0.00     0.00\n\t  2 xset                     0.00     0.00\n\t  1 date                     0.00     0.00\n\t  1 dconf worker             0.00     0.00\n\t  1 dirname                  0.00     0.00\n\t  1 dmesg                    0.00     0.00\n\t  1 dmidecode                0.00     0.00\n\t  1 grep                     0.00     0.00\n\t  1 ifconfig                 0.00     0.00\n\t  1 ip                       0.00     0.00\n\t  1 lsmod                    0.00     0.00\n\t  1 mktemp                   0.00     0.00\n\t  1 qdbus                    0.00     0.00\n\t  1 readlink                 0.00     0.00\n\t  1 realpath                 0.00     0.00\n\t  1 sort                     0.00     0.00\n\t  1 stty                     0.00     0.00\n\t  1 systemctl                0.00     0.00\n\t  1 template.sh              0.00     0.00\n\t  1 wc                       0.00     0.00\n\t  1 xrandr                   0.00     0.00\n1 processes running\n48 maximum processes\n<\/code><\/pre>\n\n\n\n<p>Computation structure<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>      110405) mpcbench         cpu=0 start=5.63  finish=55.92\n        110406) make             cpu=6 start=5.63  finish=55.92\n          110407) make             cpu=10 start=5.64  finish=55.92\n            110408) mpcbench         cpu=4 start=5.64  finish=55.92\n              110409) mpcbench         cpu=5 start=5.65  finish=5.65 \n                110410) mpcbench         cpu=11 start=5.65  finish=5.65 \n              110411) mpcbench         cpu=1 start=5.65  finish=5.65 \n              110412) mpcbench         cpu=5 start=5.65  finish=5.65 \n                110413) mpcbench         cpu=11 start=5.65  finish=5.65 \n                110414) sed              cpu=15 start=5.65  finish=5.65 \n              110415) mpcbench         cpu=1 start=5.65  finish=5.66 \n                110416) ls               cpu=11 start=5.65  finish=5.66 \n                110417) sed              cpu=15 start=5.65  finish=5.66 \n              110418) mpcbench         cpu=5 start=5.66  finish=5.66 \n              110419) mpcbench         cpu=11 start=5.66  finish=5.66 \n                110420) mpcbench         cpu=15 start=5.66  finish=5.66 \n                110421) sed              cpu=6 start=5.66  finish=5.66<\/code><\/pre>\n","protected":false},"excerpt":{"rendered":"<p>The GNU library for complex numbers. This is a single-threaded program. Topdown profile has a high retirement rate AMD metrics show little floating point and not much L2 access Intel metrics Process overview is simple Computation structure<\/p>\n <a class=\"more-link\" href=\"https:\/\/mvermeulen.org\/perf\/workloads\/phoronix\/mpcbench\/\"><span class=\"more-msg\">Continue reading &rarr;<\/span><\/a>","protected":false},"author":1,"featured_media":0,"parent":58,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-1248","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/1248","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/comments?post=1248"}],"version-history":[{"count":3,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/1248\/revisions"}],"predecessor-version":[{"id":1285,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/1248\/revisions\/1285"}],"up":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/58"}],"wp:attachment":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/media?parent=1248"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}