{"id":575,"date":"2024-01-14T23:05:32","date_gmt":"2024-01-14T23:05:32","guid":{"rendered":"https:\/\/mvermeulen.org\/perf\/?page_id=575"},"modified":"2024-01-15T14:43:30","modified_gmt":"2024-01-15T14:43:30","slug":"openjpeg","status":"publish","type":"page","link":"https:\/\/mvermeulen.org\/perf\/workloads\/phoronix\/openjpeg\/","title":{"rendered":"openjpeg"},"content":{"rendered":"\n<p>Openjpeg is a very quick running microbenchmark. Barely five seconds per invocation.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1280\" height=\"960\" src=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/systemtime-19.png\" alt=\"\" class=\"wp-image-576\" srcset=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/systemtime-19.png 1280w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/systemtime-19-1024x768.png 1024w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/systemtime-19-768x576.png 768w\" sizes=\"auto, (max-width: 1280px) 100vw, 1280px\" \/><\/figure>\n\n\n\n<p>Topdown metrics also tough to see given the quick runtime but some higher frontend times with both latency and bandwidth<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1280\" height=\"960\" src=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/amdtopdown-57.png\" alt=\"\" class=\"wp-image-577\" srcset=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/amdtopdown-57.png 1280w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/amdtopdown-57-1024x768.png 1024w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/amdtopdown-57-768x576.png 768w\" sizes=\"auto, (max-width: 1280px) 100vw, 1280px\" \/><\/figure>\n\n\n\n<p>AMD metrics<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>elapsed              36.594\non_cpu               0.282          # 4.52 \/ 16 cores\nutime                154.590\nstime                10.736\nnvcsw                554541         # 99.45%\nnivcsw               3044           # 0.55%\ninblock              0              # 0.00\/sec\nonblock              1592080        # 43506.24\/sec\ncpu-clock            165100376148   # 165.100 seconds\ntask-clock           165311190932   # 165.311 seconds\npage faults          2412706        # 14594.934\/sec\ncontext switches     557594         # 3372.996\/sec\ncpu migrations       32889          # 198.952\/sec\nmajor page faults    3              # 0.018\/sec\nminor page faults    2412703        # 14594.916\/sec\nalignment faults     0              # 0.000\/sec\nemulation faults     0              # 0.000\/sec\nbranches             122622194122   # 169.925 branches per 1000 inst\nbranch misses        13173962149    # 10.74% branch miss\nconditional          105557583043   # 146.277 conditional branches per 1000 inst\nindirect             936815492      # 1.298 indirect branches per 1000 inst\ncpu-cycles           4454978848     # 0.08 GHz\ninstructions         8339672492     # 1.87 IPC\nslots                9575987148     #\nretiring             2996012435     # 31.3% (31.3%)\n-- ucode             7993573        #     0.1%\n-- fastpath          2988018862     #    31.2%\nfrontend             4456160818     # 46.5% (46.6%)\n-- latency           3421365390     #    35.7%\n-- bandwidth         1034795428     #    10.8%\nbackend              1616235499     # 16.9% (16.9%)\n-- cpu               252357964      #     2.6%\n-- memory            1363877535     #    14.2%\nspeculation          495471302      #  5.2% ( 5.2%)\n-- branch mispredict 490561519      #     5.1%\n-- pipeline restart  4909783        #     0.1%\nsmt-contention       11872580       #  0.1% ( 0.0%)\ncpu-cycles           4336974307     # 0.08 GHz\ninstructions         8245407389     # 1.90 IPC\ninstructions         2949913646     # 33.427 l2 access per 1000 inst\nl2 hit from l1       82997212       # 18.50% l2 miss\nl2 miss from l1      10232759       #\nl2 hit from l2 pf    7596544        #\nl3 hit from l2 pf    4033408        #\nl3 miss from l2 pf   3979812        #\ninstructions         2905810907     # 12.522 float per 1000 inst\nfloat 512            42             # 0.000 AVX-512 per 1000 inst\nfloat 256            18             # 0.000 AVX-256 per 1000 inst\nfloat 128            36385074       # 12.521 AVX-128 per 1000 inst\nfloat MMX            0              # 0.000 MMX per 1000 inst\nfloat scalar         0              # 0.000 scalar per 1000 inst\n<\/code><\/pre>\n\n\n\n<p>Intel metrics show a high amount of speculation though this is also perhaps with the short runtime.<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>elapsed              38.306\non_cpu               0.362          # 5.79 \/ 16 cores\nutime                214.080\nstime                7.714\nnvcsw                605867         # 93.18%\nnivcsw               44331          # 6.82%\ninblock              0              # 0.00\/sec\nonblock              1580840        # 41269.12\/sec\ncpu-clock            221449835819   # 221.450 seconds\ntask-clock           221595265099   # 221.595 seconds\npage faults          2390649        # 10788.358\/sec\ncontext switches     650233         # 2934.327\/sec\ncpu migrations       62256          # 280.945\/sec\nmajor page faults    3              # 0.014\/sec\nminor page faults    2390646        # 10788.344\/sec\nalignment faults     0              # 0.000\/sec\nemulation faults     0              # 0.000\/sec\nbranches             121389835828   # 168.631 branches per 1000 inst\nbranch misses        13049831180    # 10.75% branch miss\nconditional          121389848820   # 168.631 conditional branches per 1000 inst\nindirect             29846606459    # 41.462 indirect branches per 1000 inst\nslots                1279821990026  #\nretiring             348827985062   # 27.3% (27.3%)\n-- ucode             23641140077    #     1.8%\n-- fastpath          325186844985   #    25.4%\nfrontend             235823088930   # 18.4% (18.4%)\n-- latency           121713340927   #     9.5%\n-- bandwidth         114109748003   #     8.9%\nbackend              246983777931   # 19.3% (19.3%)\n-- cpu               93937745126    #     7.3%\n-- memory            153046032805   #    12.0%\nspeculation          464714479511   # 36.3% (36.3%)\n-- branch mispredict 462567629616   #    36.1%\n-- pipeline restart  2146849895     #     0.2%\nsmt-contention       0              #  0.0% ( 0.0%)\ncpu-cycles           734041796831   # 1.21 GHz\ninstructions         757312439259   # 1.03 IPC\nl2 access            6052244486     # 15.790 l2 access per 1000 inst\nl2 miss              3489149201     # 57.65% l2 miss\n<\/code><\/pre>\n\n\n\n<p>Process overview suggests we mostly measure the overhead of the test suite itself though also looks like we quit prematurely.<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>293 processes\n\t 68 clinfo                  20.50     5.68\n\t 38 vulkaninfo               0.96     1.15\n\t  6 glxinfo:gdrv0            0.16     0.09\n\t  4 vulkani:disk$0           0.11     0.13\n\t  2 glxinfo                  0.08     0.03\n\t  2 glxinfo:cs0              0.08     0.03\n\t  2 glxinfo:disk$0           0.08     0.03\n\t  2 glxinfo:sh0              0.08     0.03\n\t  2 glxinfo:shlo0            0.08     0.03\n\t  2 llvmpipe-0               0.06     0.07\n\t  2 llvmpipe-1               0.06     0.07\n\t  2 llvmpipe-10              0.06     0.07\n\t  2 llvmpipe-11              0.06     0.07\n\t  2 llvmpipe-12              0.06     0.07\n\t  2 llvmpipe-13              0.06     0.07\n\t  2 llvmpipe-14              0.06     0.07\n\t  2 llvmpipe-15              0.06     0.07\n\t  2 llvmpipe-2               0.06     0.07\n\t  2 llvmpipe-3               0.06     0.07\n\t  2 llvmpipe-4               0.06     0.07\n\t  2 llvmpipe-5               0.06     0.07\n\t  2 llvmpipe-6               0.06     0.07\n\t  2 llvmpipe-7               0.06     0.07\n\t  2 llvmpipe-8               0.06     0.07\n\t  2 llvmpipe-9               0.06     0.07\n\t  6 php                      0.06     0.04\n\t  3 clang                    0.05     0.01\n\t  3 rocminfo                 0.03     0.00\n\t  1 lspci                    0.00     0.02\n\t 57 sh                       0.00     0.00\n\t 11 gsettings                0.00     0.00\n\t  6 gcc                      0.00     0.00\n\t  5 phoronix-test-s          0.00     0.00\n\t  5 stat                     0.00     0.00\n\t  4 gmain                    0.00     0.00\n\t  4 systemd-detect-          0.00     0.00\n\t  3 llvm-link                0.00     0.00\n\t  2 lscpu                    0.00     0.00\n\t  2 uname                    0.00     0.00\n\t  2 which                    0.00     0.00\n\t  2 xset                     0.00     0.00\n\t  1 cc                       0.00     0.00\n\t  1 date                     0.00     0.00\n\t  1 dconf worker             0.00     0.00\n\t  1 dirname                  0.00     0.00\n\t  1 dmesg                    0.00     0.00\n\t  1 dmidecode                0.00     0.00\n\t  1 grep                     0.00     0.00\n\t  1 ifconfig                 0.00     0.00\n\t  1 ip                       0.00     0.00\n\t  1 lsmod                    0.00     0.00\n\t  1 mktemp                   0.00     0.00\n\t  1 qdbus                    0.00     0.00\n\t  1 readlink                 0.00     0.00\n\t  1 realpath                 0.00     0.00\n\t  1 sed                      0.00     0.00\n\t  1 sort                     0.00     0.00\n\t  1 template.sh              0.00     0.00\n\t  1 wc                       0.00     0.00\n\t  1 xrandr                   0.00     0.00\n0 processes running\n47 maximum processes\n\n\n<\/code><\/pre>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Openjpeg is a very quick running microbenchmark. Barely five seconds per invocation. Topdown metrics also tough to see given the quick runtime but some higher frontend times with both latency and bandwidth AMD metrics Intel metrics show a high amount <span class=\"excerpt-dots\">&hellip;<\/span> <a class=\"more-link\" href=\"https:\/\/mvermeulen.org\/perf\/workloads\/phoronix\/openjpeg\/\"><span class=\"more-msg\">Continue reading &rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":58,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-575","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/575","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/comments?post=575"}],"version-history":[{"count":1,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/575\/revisions"}],"predecessor-version":[{"id":579,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/575\/revisions\/579"}],"up":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/58"}],"wp:attachment":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/media?parent=575"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}