{"id":430,"date":"2024-01-12T01:46:30","date_gmt":"2024-01-12T01:46:30","guid":{"rendered":"https:\/\/mvermeulen.org\/perf\/?page_id=430"},"modified":"2024-01-12T13:47:02","modified_gmt":"2024-01-12T13:47:02","slug":"xmrig","status":"publish","type":"page","link":"https:\/\/mvermeulen.org\/perf\/workloads\/phoronix\/xmrig\/","title":{"rendered":"xmrig"},"content":{"rendered":"\n<p>CPU\/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT, measuring CPU mining performance. There are six workloads all heavily memory-bound. There is a high amount of on cpu time.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1280\" height=\"960\" src=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/systemtime-1.png\" alt=\"\" class=\"wp-image-435\" srcset=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/systemtime-1.png 1280w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/systemtime-1-1024x768.png 1024w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/systemtime-1-768x576.png 768w\" sizes=\"auto, (max-width: 1280px) 100vw, 1280px\" \/><\/figure>\n\n\n\n<p>Topdown profiles show a very heavy backend emphasis for all workloads.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1280\" height=\"960\" src=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/amdtopdown-39.png\" alt=\"\" class=\"wp-image-436\" srcset=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/amdtopdown-39.png 1280w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/amdtopdown-39-1024x768.png 1024w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/amdtopdown-39-768x576.png 768w\" sizes=\"auto, (max-width: 1280px) 100vw, 1280px\" \/><\/figure>\n\n\n\n<p>AMD metrics show a floating point code where can be useful to see how much avx512 or similar might help.<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>elapsed              7633.337\non_cpu               0.983          # 15.73 \/ 16 cores\nutime                120025.052\nstime                30.869\nnvcsw                26685          # 2.42%\nnivcsw               1077914        # 97.58%\ninblock              0              # 0.00\/sec\nonblock              14712          # 1.93\/sec\ncpu-clock            120069039240706 # 120069.039 seconds\ntask-clock           120069840838680 # 120069.841 seconds\npage faults          9296514        # 77.426\/sec\ncontext switches     1142481        # 9.515\/sec\ncpu migrations       1022           # 0.009\/sec\nmajor page faults    2              # 0.000\/sec\nminor page faults    9296512        # 77.426\/sec\nalignment faults     0              # 0.000\/sec\nemulation faults     0              # 0.000\/sec\nbranches             8062573611556  # 39.620 branches per 1000 inst\nbranch misses        153197099578   # 1.90% branch miss\nconditional          7772735242295  # 38.196 conditional branches per 1000 inst\nindirect             38503133079    # 0.189 indirect branches per 1000 inst\ncpu-cycles           546655302339493 # 4.46 GHz\ninstructions         203502838834852 # 0.37 IPC\nslots                1093180504537446 #\nretiring             74455663673890 #  6.8% (10.5%)\n-- ucode             208442333879   #     0.0%\n-- fastpath          74247221340011 #     6.8%\nfrontend             36376680462764 #  3.3% ( 5.1%)\n-- latency           17741449428402 #     1.6%\n-- bandwidth         18635231034362 #     1.7%\nbackend              592122050788629 # 54.2% (83.2%)\n-- cpu               53734686284465 #     4.9%\n-- memory            538387364504164 #    49.2%\nspeculation          8903626848802  #  0.8% ( 1.3%)\n-- branch mispredict 4271095835257  #     0.4%\n-- pipeline restart  4632531013545  #     0.4%\nsmt-contention       381322073349062 # 34.9% ( 0.0%)\ncpu-cycles           548328028280026 # 4.46 GHz\ninstructions         203501469221988 # 0.37 IPC\ninstructions         67852901496794 # 85.166 l2 access per 1000 inst\nl2 hit from l1       4237194060240  # 33.38% l2 miss\nl2 miss from l1      1103219865801  #\nl2 hit from l2 pf    715887438905   #\nl3 hit from l2 pf    530182072891   #\nl3 miss from l2 pf   295527868000   #\ninstructions         67828434875857 # 307.280 float per 1000 inst\nfloat 512            81             # 0.000 AVX-512 per 1000 inst\nfloat 256            642            # 0.000 AVX-256 per 1000 inst\nfloat 128            20842302291360 # 307.280 AVX-128 per 1000 inst\nfloat MMX            0              # 0.000 MMX per 1000 inst\nfloat scalar         9644           # 0.000 scalar per 1000 inst\n<\/code><\/pre>\n\n\n\n<p>Intel metrics<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>elapsed              9704.996\non_cpu               0.987          # 15.80 \/ 16 cores\nutime                153285.828\nstime                30.577\nnvcsw                32235          # 2.85%\nnivcsw               1100800        # 97.15%\ninblock              11464          # 1.18\/sec\nonblock              3552           # 0.37\/sec\ncpu-clock            153326793555754 # 153326.794 seconds\ntask-clock           153327406036385 # 153327.406 seconds\npage faults          9280828        # 60.529\/sec\ncontext switches     1181274        # 7.704\/sec\ncpu migrations       1337           # 0.009\/sec\nmajor page faults    110            # 0.001\/sec\nminor page faults    9280718        # 60.529\/sec\nalignment faults     0              # 0.000\/sec\nemulation faults     0              # 0.000\/sec\nbranches             7635137269000  # 37.692 branches per 1000 inst\nbranch misses        58198391080    # 0.76% branch miss\nconditional          7635137303304  # 37.692 conditional branches per 1000 inst\nindirect             756405371179   # 3.734 indirect branches per 1000 inst\nslots                744881612864960 #\nretiring             123016188945705 # 16.5% (16.5%)\n-- ucode             12533615894677 #     1.7%\n-- fastpath          110482573051028 #    14.8%\nfrontend             64737165983284 #  8.7% ( 8.7%)\n-- latency           58449599406336 #     7.8%\n-- bandwidth         6287566576948  #     0.8%\nbackend              554520019479004 # 74.4% (74.4%)\n-- cpu               101359296967534 #    13.6%\n-- memory            453160722511470 #    60.8%\nspeculation          3914361186934  #  0.5% ( 0.5%)\n-- branch mispredict 2540078098156  #     0.3%\n-- pipeline restart  1374283088778  #     0.2%\nsmt-contention       0              #  0.0% ( 0.0%)\ncpu-cycles           343117809236538 # 2.21 GHz\ninstructions         162554156441033 # 0.47 IPC\nl2 access            7992706341828  # 67.465 l2 access per 1000 inst\nl2 miss              2853267229070  # 35.70% l2 miss\n<\/code><\/pre>\n\n\n\n<p>Process tree died after first run but shows straightforward setup<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>656 processes\n\t378 xmrig                1193344.81   545.56\n\t 34 clinfo                   9.92     3.00\n\t 19 vulkaninfo               0.57     0.58\n\t  3 glxinfo:gdrv0            0.08     0.06\n\t  2 vulkani:disk$0           0.06     0.07\n\t  6 clang                    0.06     0.06\n\t  1 glxinfo                  0.04     0.02\n\t  1 glxinfo:cs0              0.04     0.02\n\t  1 glxinfo:disk$0           0.04     0.02\n\t  1 glxinfo:sh0              0.04     0.02\n\t  1 glxinfo:shlo0            0.04     0.02\n\t  1 llvmpipe-0               0.03     0.04\n\t  1 llvmpipe-1               0.03     0.04\n\t  1 llvmpipe-10              0.03     0.04\n\t  1 llvmpipe-11              0.03     0.04\n\t  1 llvmpipe-12              0.03     0.04\n\t  1 llvmpipe-13              0.03     0.04\n\t  1 llvmpipe-14              0.03     0.04\n\t  1 llvmpipe-15              0.03     0.04\n\t  1 llvmpipe-2               0.03     0.04\n\t  1 llvmpipe-3               0.03     0.04\n\t  1 llvmpipe-4               0.03     0.04\n\t  1 llvmpipe-5               0.03     0.04\n\t  1 llvmpipe-6               0.03     0.04\n\t  1 llvmpipe-7               0.03     0.04\n\t  1 llvmpipe-8               0.03     0.04\n\t  1 llvmpipe-9               0.03     0.04\n\t  1 ps                       0.00     0.01\n\t 75 sh                       0.00     0.00\n\t 13 gcc                      0.00     0.00\n\t 11 modprobe                 0.00     0.00\n\t  8 gsettings                0.00     0.00\n\t  8 systemd-detect-          0.00     0.00\n\t  7 stat                     0.00     0.00\n\t  6 llvm-link                0.00     0.00\n\t  5 gmain                    0.00     0.00\n\t  4 phoronix-test-s          0.00     0.00\n\t  2 dconf worker             0.00     0.00\n\t  2 which                    0.00     0.00\n\t  1 cc                       0.00     0.00\n\t  1 date                     0.00     0.00\n\t  1 dirname                  0.00     0.00\n\t  1 grep                     0.00     0.00\n\t  1 ifconfig                 0.00     0.00\n\t  1 ip                       0.00     0.00\n\t  1 lscpu                    0.00     0.00\n\t  1 mktemp                   0.00     0.00\n\t  1 qdbus                    0.00     0.00\n\t  1 readlink                 0.00     0.00\n\t  1 realpath                 0.00     0.00\n\t  1 sed                      0.00     0.00\n\t  1 sort                     0.00     0.00\n\t  1 stty                     0.00     0.00\n\t  1 systemctl                0.00     0.00\n\t  1 template.sh              0.00     0.00\n\t  1 wc                       0.00     0.00\n\t  1 xrandr                   0.00     0.00\n\t  1 xset                     0.00     0.00\n32 processes running\n47 maximum processes\n<\/code><\/pre>\n\n\n\n<p>Core computation blocks start in parallel<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>      22116) xmrig            cpu=5 start=6.18  finish=272.99\n        22117) xmrig            cpu=15 start=6.18  finish=272.84\n          22118) xmrig            cpu=8 start=6.19  finish=272.84\n          22119) xmrig            cpu=9 start=6.19  finish=272.84\n          22120) xmrig            cpu=14 start=6.19  finish=272.84\n          22121) xmrig            cpu=12 start=6.19  finish=272.84\n          22122) xmrig            cpu=2 start=6.19  finish=272.84\n            22125) xmrig            cpu=11 start=6.58  finish=9.08 \n            22126) xmrig            cpu=4 start=6.58  finish=9.10 \n            22127) xmrig            cpu=5 start=6.58  finish=9.09 \n            22128) xmrig            cpu=14 start=6.58  finish=9.10 \n            22129) xmrig            cpu=7 start=6.58  finish=9.09 \n            22130) xmrig            cpu=0 start=6.58  finish=9.10 \n            22131) xmrig            cpu=9 start=6.58  finish=9.07 \n            22132) xmrig            cpu=10 start=6.58  finish=9.07 \n            22133) xmrig            cpu=3 start=6.58  finish=9.08 \n            22134) xmrig            cpu=12 start=6.58  finish=9.10 \n            22135) xmrig            cpu=13 start=6.58  finish=9.09 \n            22136) xmrig            cpu=6 start=6.58  finish=9.10 \n            22137) xmrig            cpu=15 start=6.58  finish=9.09 \n            22138) xmrig            cpu=8 start=6.58  finish=9.10 \n            22139) xmrig            cpu=1 start=6.58  finish=9.06 \n            22140) xmrig            cpu=2 start=6.58  finish=9.07 \n          22123) sh               cpu=3 start=6.19  finish=6.20 \n            22124) modprobe         cpu=5 start=6.20  finish=6.20 \n          22141) xmrig            cpu=0 start=9.10  finish=272.84\n          22142) xmrig            cpu=13 start=9.10  finish=272.84\n          22143) xmrig            cpu=6 start=9.10  finish=272.84\n          22144) xmrig            cpu=7 start=9.10  finish=272.84\n          22145) xmrig            cpu=3 start=9.10  finish=272.84\n          22146) xmrig            cpu=2 start=9.10  finish=272.84\n          22147) xmrig            cpu=10 start=9.11  finish=272.84\n          22148) xmrig            cpu=15 start=9.11  finish=272.84\n          22149) xmrig            cpu=9 start=9.11  finish=272.84\n          22150) xmrig            cpu=12 start=9.11  finish=272.84\n          22151) xmrig            cpu=11 start=9.11  finish=272.84\n          22152) xmrig            cpu=5 start=9.11  finish=272.84\n          22153) xmrig            cpu=14 start=9.11  finish=272.84\n          22154) xmrig            cpu=1 start=9.11  finish=272.84\n          22155) xmrig            cpu=4 start=9.11  finish=272.84\n          22156) xmrig            cpu=8 start=9.11  finish=272.84\n<\/code><\/pre>\n","protected":false},"excerpt":{"rendered":"<p>CPU\/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT, measuring CPU mining performance. There are six workloads all heavily memory-bound. There is a high amount of on cpu time. Topdown profiles show a very heavy backend emphasis for all workloads. AMD <span class=\"excerpt-dots\">&hellip;<\/span> <a class=\"more-link\" href=\"https:\/\/mvermeulen.org\/perf\/workloads\/phoronix\/xmrig\/\"><span class=\"more-msg\">Continue reading &rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":58,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-430","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/430","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/comments?post=430"}],"version-history":[{"count":2,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/430\/revisions"}],"predecessor-version":[{"id":437,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/430\/revisions\/437"}],"up":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/58"}],"wp:attachment":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/media?parent=430"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}