{"id":290,"date":"2024-01-06T13:20:04","date_gmt":"2024-01-06T13:20:04","guid":{"rendered":"https:\/\/mvermeulen.org\/perf\/?page_id=290"},"modified":"2024-01-07T14:21:24","modified_gmt":"2024-01-07T14:21:24","slug":"compress-7zip","status":"publish","type":"page","link":"https:\/\/mvermeulen.org\/perf\/workloads\/phoronix\/compress-7zip\/","title":{"rendered":"compress-7zip"},"content":{"rendered":"\n<p>Measuring the 7zip for both compression and decompression.  Relatively speaking Intel is worse for decompression (48644 vs. 71980) than compression (63103 vs. 77088).  Also interesting that none tests of different compression tools use the same metrics and workload so not easy to compare between tools.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1280\" height=\"960\" src=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/amdtopdown-22.png\" alt=\"\" class=\"wp-image-316\" srcset=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/amdtopdown-22.png 1280w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/amdtopdown-22-1024x768.png 1024w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/amdtopdown-22-768x576.png 768w\" sizes=\"auto, (max-width: 1280px) 100vw, 1280px\" \/><\/figure>\n\n\n\n<p>AMD metrics show the frontend time is higher than average. This is an integer code with little floating point.<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>elapsed              87.917\non_cpu               0.748          # 11.97 \/ 16 cores\nutime                1039.851\nstime                12.844\nnvcsw                50786          # 59.52%\nnivcsw               34546          # 40.48%\ninblock              4288           # 48.77\/sec\nonblock              1440           # 16.38\/sec\ncpu-clock            1052749586883  # 1052.750 seconds\ntask-clock           1052781408945  # 1052.781 seconds\npage faults          5258853        # 4995.199\/sec\ncontext switches     85583          # 81.292\/sec\ncpu migrations       17387          # 16.515\/sec\nmajor page faults    22             # 0.021\/sec\nminor page faults    5258831        # 4995.178\/sec\nalignment faults     0              # 0.000\/sec\nemulation faults     0              # 0.000\/sec\nbranches             628814146314   # 129.344 branches per 1000 inst\nbranch misses        76982056235    # 12.24% branch miss\nconditional          528387042597   # 108.687 conditional branches per 1000 inst\nindirect             7604961943     # 1.564 indirect branches per 1000 inst\ncpu-cycles           4492468296331  # 3.18 GHz\ninstructions         4850219742291  # 1.08 IPC\nslots                9009034564314  #\nretiring             1558169368082  # 17.3% (21.1%)\n-- ucode             2613094653     #     0.0%\n-- fastpath          1555556273429  #    17.3%\nfrontend             2825639082052  # 31.4% (38.2%)\n-- latency           1899483769890  #    21.1%\n-- bandwidth         926155312162   #    10.3%\nbackend              2220098013052  # 24.6% (30.0%)\n-- cpu               603418490879   #     6.7%\n-- memory            1616679522173  #    17.9%\nspeculation          787553478842   #  8.7% (10.7%)\n-- branch mispredict 787156986822   #     8.7%\n-- pipeline restart  396492020      #     0.0%\nsmt-contention       1617563531983  # 18.0% ( 0.0%)\ncpu-cycles           4493449596005  # 3.23 GHz\ninstructions         4856786322509  # 1.08 IPC\ninstructions         1618496765469  # 6.661 l2 access per 1000 inst\nl2 hit from l1       8190492066     # 35.93% l2 miss\nl2 miss from l1      2069712680     #\nl2 hit from l2 pf    786340679      #\nl3 hit from l2 pf    464653014      #\nl3 miss from l2 pf   1339243948     #\ninstructions         1620260319095  # 0.036 float per 1000 inst\nfloat 512            49             # 0.000 AVX-512 per 1000 inst\nfloat 256            1026           # 0.000 AVX-256 per 1000 inst\nfloat 128            58114731       # 0.036 AVX-128 per 1000 inst\nfloat MMX            0              # 0.000 MMX per 1000 inst\nfloat scalar         0              # 0.000 scalar per 1000 inst\n<\/code><\/pre>\n\n\n\n<p>Intel metrics<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>elapsed              116.142\non_cpu               0.783          # 12.53 \/ 16 cores\nutime                1443.765\nstime                11.430\nnvcsw                45578          # 58.95%\nnivcsw               31733          # 41.05%\ninblock              3760           # 32.37\/sec\nonblock              1432           # 12.33\/sec\ncpu-clock            1455209498359  # 1455.209 seconds\ntask-clock           1455237240817  # 1455.237 seconds\npage faults          5259536        # 3614.212\/sec\ncontext switches     77717          # 53.405\/sec\ncpu migrations       13459          # 9.249\/sec\nmajor page faults    23             # 0.016\/sec\nminor page faults    5259513        # 3614.196\/sec\nalignment faults     0              # 0.000\/sec\nemulation faults     0              # 0.000\/sec\nbranches             626738930866   # 129.047 branches per 1000 inst\nbranch misses        77253997039    # 12.33% branch miss\nconditional          626738959666   # 129.047 conditional branches per 1000 inst\nindirect             214140530082   # 44.092 indirect branches per 1000 inst\nslots                8027257912088  #\nretiring             2207799319543  # 27.5% (27.5%)\n-- ucode             49829353284    #     0.6%\n-- fastpath          2157969966259  #    26.9%\nfrontend             1082501969803  # 13.5% (13.5%)\n-- latency           619984206276   #     7.7%\n-- bandwidth         462517763527   #     5.8%\nbackend              1667269768131  # 20.8% (20.8%)\n-- cpu               898846801693   #    11.2%\n-- memory            768422966438   #     9.6%\nspeculation          3119864195896  # 38.9% (38.9%)\n-- branch mispredict 3115210936873  #    38.8%\n-- pipeline restart  4653259023     #     0.1%\nsmt-contention       0              #  0.0% ( 0.0%)\ncpu-cycles           4194263357929  # 2.27 GHz\ninstructions         4115388761059  # 0.98 IPC\nl2 access            30885642983    # 12.844 l2 access per 1000 inst\nl2 miss              14839356087    # 48.05% l2 miss<\/code><\/pre>\n\n\n\n<p>Overall process profile, with many short-lived processes.<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>884 processes\n\t531 7zz                  82961.41   790.03\n\t 64 clinfo                  10.88     3.52\n\t 38 vulkaninfo               0.57     1.34\n\t  6 glxinfo:gdrv0            0.13     0.07\n\t  4 vulkani:disk$0           0.06     0.14\n\t  6 php                      0.06     0.06\n\t  2 glxinfo                  0.06     0.03\n\t  2 glxinfo:cs0              0.06     0.03\n\t  2 glxinfo:disk$0           0.05     0.03\n\t  2 glxinfo:sh0              0.05     0.03\n\t  2 glxinfo:shlo0            0.05     0.03\n\t  6 clang                    0.04     0.04\n\t  2 llvmpipe-0               0.03     0.07\n\t  2 llvmpipe-1               0.03     0.07\n\t  2 llvmpipe-10              0.03     0.07\n\t  2 llvmpipe-11              0.03     0.07\n\t  2 llvmpipe-12              0.03     0.07\n\t  2 llvmpipe-13              0.03     0.07\n\t  2 llvmpipe-14              0.03     0.07\n\t  2 llvmpipe-15              0.03     0.07\n\t  2 llvmpipe-2               0.03     0.07\n\t  2 llvmpipe-3               0.03     0.07\n\t  2 llvmpipe-4               0.03     0.07\n\t  2 llvmpipe-5               0.03     0.07\n\t  2 llvmpipe-6               0.03     0.07\n\t  2 llvmpipe-7               0.03     0.07\n\t  2 llvmpipe-8               0.03     0.07\n\t  2 llvmpipe-9               0.03     0.07\n\t  1 lspci                    0.00     0.03\n\t 89 sh                       0.00     0.00\n\t 13 gcc                      0.00     0.00\n\t  9 gsettings                0.00     0.00\n\t  9 stty                     0.00     0.00\n\t  8 stat                     0.00     0.00\n\t  8 systemd-detect-          0.00     0.00\n\t  6 llvm-link                0.00     0.00\n\t  5 gmain                    0.00     0.00\n\t  5 phoronix-test-s          0.00     0.00\n\t  3 compress-7zip            0.00     0.00\n\t  2 cc                       0.00     0.00\n\t  2 dconf worker             0.00     0.00\n\t  2 lscpu                    0.00     0.00\n\t  2 uname                    0.00     0.00\n\t  2 which                    0.00     0.00\n\t  2 xset                     0.00     0.00\n\t  1 date                     0.00     0.00\n\t  1 dirname                  0.00     0.00\n\t  1 dmesg                    0.00     0.00\n\t  1 dmidecode                0.00     0.00\n\t  1 grep                     0.00     0.00\n\t  1 ifconfig                 0.00     0.00\n\t  1 ip                       0.00     0.00\n\t  1 lsmod                    0.00     0.00\n\t  1 mktemp                   0.00     0.00\n\t  1 ps                       0.00     0.00\n\t  1 qdbus                    0.00     0.00\n\t  1 readlink                 0.00     0.00\n\t  1 realpath                 0.00     0.00\n\t  1 sed                      0.00     0.00\n\t  1 sort                     0.00     0.00\n\t  1 systemctl                0.00     0.00\n\t  1 template.sh              0.00     0.00\n\t  1 wc                       0.00     0.00\n\t  1 xrandr                   0.00     0.00\n<\/code><\/pre>\n","protected":false},"excerpt":{"rendered":"<p>Measuring the 7zip for both compression and decompression. Relatively speaking Intel is worse for decompression (48644 vs. 71980) than compression (63103 vs. 77088). Also interesting that none tests of different compression tools use the same metrics and workload so not <span class=\"excerpt-dots\">&hellip;<\/span> <a class=\"more-link\" href=\"https:\/\/mvermeulen.org\/perf\/workloads\/phoronix\/compress-7zip\/\"><span class=\"more-msg\">Continue reading &rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":58,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-290","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/290","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/comments?post=290"}],"version-history":[{"count":3,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/290\/revisions"}],"predecessor-version":[{"id":334,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/290\/revisions\/334"}],"up":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/58"}],"wp:attachment":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/media?parent=290"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}