CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT, measuring CPU mining performance. There are six workloads all heavily memory-bound. There is a high amount of on cpu time.

Topdown profiles show a very heavy backend emphasis for all workloads.

AMD metrics show a floating point code where can be useful to see how much avx512 or similar might help.
elapsed 7633.337
on_cpu 0.983 # 15.73 / 16 cores
utime 120025.052
stime 30.869
nvcsw 26685 # 2.42%
nivcsw 1077914 # 97.58%
inblock 0 # 0.00/sec
onblock 14712 # 1.93/sec
cpu-clock 120069039240706 # 120069.039 seconds
task-clock 120069840838680 # 120069.841 seconds
page faults 9296514 # 77.426/sec
context switches 1142481 # 9.515/sec
cpu migrations 1022 # 0.009/sec
major page faults 2 # 0.000/sec
minor page faults 9296512 # 77.426/sec
alignment faults 0 # 0.000/sec
emulation faults 0 # 0.000/sec
branches 8062573611556 # 39.620 branches per 1000 inst
branch misses 153197099578 # 1.90% branch miss
conditional 7772735242295 # 38.196 conditional branches per 1000 inst
indirect 38503133079 # 0.189 indirect branches per 1000 inst
cpu-cycles 546655302339493 # 4.46 GHz
instructions 203502838834852 # 0.37 IPC
slots 1093180504537446 #
retiring 74455663673890 # 6.8% (10.5%)
-- ucode 208442333879 # 0.0%
-- fastpath 74247221340011 # 6.8%
frontend 36376680462764 # 3.3% ( 5.1%)
-- latency 17741449428402 # 1.6%
-- bandwidth 18635231034362 # 1.7%
backend 592122050788629 # 54.2% (83.2%)
-- cpu 53734686284465 # 4.9%
-- memory 538387364504164 # 49.2%
speculation 8903626848802 # 0.8% ( 1.3%)
-- branch mispredict 4271095835257 # 0.4%
-- pipeline restart 4632531013545 # 0.4%
smt-contention 381322073349062 # 34.9% ( 0.0%)
cpu-cycles 548328028280026 # 4.46 GHz
instructions 203501469221988 # 0.37 IPC
instructions 67852901496794 # 85.166 l2 access per 1000 inst
l2 hit from l1 4237194060240 # 33.38% l2 miss
l2 miss from l1 1103219865801 #
l2 hit from l2 pf 715887438905 #
l3 hit from l2 pf 530182072891 #
l3 miss from l2 pf 295527868000 #
instructions 67828434875857 # 307.280 float per 1000 inst
float 512 81 # 0.000 AVX-512 per 1000 inst
float 256 642 # 0.000 AVX-256 per 1000 inst
float 128 20842302291360 # 307.280 AVX-128 per 1000 inst
float MMX 0 # 0.000 MMX per 1000 inst
float scalar 9644 # 0.000 scalar per 1000 inst
Intel metrics
elapsed 9704.996
on_cpu 0.987 # 15.80 / 16 cores
utime 153285.828
stime 30.577
nvcsw 32235 # 2.85%
nivcsw 1100800 # 97.15%
inblock 11464 # 1.18/sec
onblock 3552 # 0.37/sec
cpu-clock 153326793555754 # 153326.794 seconds
task-clock 153327406036385 # 153327.406 seconds
page faults 9280828 # 60.529/sec
context switches 1181274 # 7.704/sec
cpu migrations 1337 # 0.009/sec
major page faults 110 # 0.001/sec
minor page faults 9280718 # 60.529/sec
alignment faults 0 # 0.000/sec
emulation faults 0 # 0.000/sec
branches 7635137269000 # 37.692 branches per 1000 inst
branch misses 58198391080 # 0.76% branch miss
conditional 7635137303304 # 37.692 conditional branches per 1000 inst
indirect 756405371179 # 3.734 indirect branches per 1000 inst
slots 744881612864960 #
retiring 123016188945705 # 16.5% (16.5%)
-- ucode 12533615894677 # 1.7%
-- fastpath 110482573051028 # 14.8%
frontend 64737165983284 # 8.7% ( 8.7%)
-- latency 58449599406336 # 7.8%
-- bandwidth 6287566576948 # 0.8%
backend 554520019479004 # 74.4% (74.4%)
-- cpu 101359296967534 # 13.6%
-- memory 453160722511470 # 60.8%
speculation 3914361186934 # 0.5% ( 0.5%)
-- branch mispredict 2540078098156 # 0.3%
-- pipeline restart 1374283088778 # 0.2%
smt-contention 0 # 0.0% ( 0.0%)
cpu-cycles 343117809236538 # 2.21 GHz
instructions 162554156441033 # 0.47 IPC
l2 access 7992706341828 # 67.465 l2 access per 1000 inst
l2 miss 2853267229070 # 35.70% l2 miss
Process tree died after first run but shows straightforward setup
656 processes
378 xmrig 1193344.81 545.56
34 clinfo 9.92 3.00
19 vulkaninfo 0.57 0.58
3 glxinfo:gdrv0 0.08 0.06
2 vulkani:disk$0 0.06 0.07
6 clang 0.06 0.06
1 glxinfo 0.04 0.02
1 glxinfo:cs0 0.04 0.02
1 glxinfo:disk$0 0.04 0.02
1 glxinfo:sh0 0.04 0.02
1 glxinfo:shlo0 0.04 0.02
1 llvmpipe-0 0.03 0.04
1 llvmpipe-1 0.03 0.04
1 llvmpipe-10 0.03 0.04
1 llvmpipe-11 0.03 0.04
1 llvmpipe-12 0.03 0.04
1 llvmpipe-13 0.03 0.04
1 llvmpipe-14 0.03 0.04
1 llvmpipe-15 0.03 0.04
1 llvmpipe-2 0.03 0.04
1 llvmpipe-3 0.03 0.04
1 llvmpipe-4 0.03 0.04
1 llvmpipe-5 0.03 0.04
1 llvmpipe-6 0.03 0.04
1 llvmpipe-7 0.03 0.04
1 llvmpipe-8 0.03 0.04
1 llvmpipe-9 0.03 0.04
1 ps 0.00 0.01
75 sh 0.00 0.00
13 gcc 0.00 0.00
11 modprobe 0.00 0.00
8 gsettings 0.00 0.00
8 systemd-detect- 0.00 0.00
7 stat 0.00 0.00
6 llvm-link 0.00 0.00
5 gmain 0.00 0.00
4 phoronix-test-s 0.00 0.00
2 dconf worker 0.00 0.00
2 which 0.00 0.00
1 cc 0.00 0.00
1 date 0.00 0.00
1 dirname 0.00 0.00
1 grep 0.00 0.00
1 ifconfig 0.00 0.00
1 ip 0.00 0.00
1 lscpu 0.00 0.00
1 mktemp 0.00 0.00
1 qdbus 0.00 0.00
1 readlink 0.00 0.00
1 realpath 0.00 0.00
1 sed 0.00 0.00
1 sort 0.00 0.00
1 stty 0.00 0.00
1 systemctl 0.00 0.00
1 template.sh 0.00 0.00
1 wc 0.00 0.00
1 xrandr 0.00 0.00
1 xset 0.00 0.00
32 processes running
47 maximum processes
Core computation blocks start in parallel
22116) xmrig cpu=5 start=6.18 finish=272.99
22117) xmrig cpu=15 start=6.18 finish=272.84
22118) xmrig cpu=8 start=6.19 finish=272.84
22119) xmrig cpu=9 start=6.19 finish=272.84
22120) xmrig cpu=14 start=6.19 finish=272.84
22121) xmrig cpu=12 start=6.19 finish=272.84
22122) xmrig cpu=2 start=6.19 finish=272.84
22125) xmrig cpu=11 start=6.58 finish=9.08
22126) xmrig cpu=4 start=6.58 finish=9.10
22127) xmrig cpu=5 start=6.58 finish=9.09
22128) xmrig cpu=14 start=6.58 finish=9.10
22129) xmrig cpu=7 start=6.58 finish=9.09
22130) xmrig cpu=0 start=6.58 finish=9.10
22131) xmrig cpu=9 start=6.58 finish=9.07
22132) xmrig cpu=10 start=6.58 finish=9.07
22133) xmrig cpu=3 start=6.58 finish=9.08
22134) xmrig cpu=12 start=6.58 finish=9.10
22135) xmrig cpu=13 start=6.58 finish=9.09
22136) xmrig cpu=6 start=6.58 finish=9.10
22137) xmrig cpu=15 start=6.58 finish=9.09
22138) xmrig cpu=8 start=6.58 finish=9.10
22139) xmrig cpu=1 start=6.58 finish=9.06
22140) xmrig cpu=2 start=6.58 finish=9.07
22123) sh cpu=3 start=6.19 finish=6.20
22124) modprobe cpu=5 start=6.20 finish=6.20
22141) xmrig cpu=0 start=9.10 finish=272.84
22142) xmrig cpu=13 start=9.10 finish=272.84
22143) xmrig cpu=6 start=9.10 finish=272.84
22144) xmrig cpu=7 start=9.10 finish=272.84
22145) xmrig cpu=3 start=9.10 finish=272.84
22146) xmrig cpu=2 start=9.10 finish=272.84
22147) xmrig cpu=10 start=9.11 finish=272.84
22148) xmrig cpu=15 start=9.11 finish=272.84
22149) xmrig cpu=9 start=9.11 finish=272.84
22150) xmrig cpu=12 start=9.11 finish=272.84
22151) xmrig cpu=11 start=9.11 finish=272.84
22152) xmrig cpu=5 start=9.11 finish=272.84
22153) xmrig cpu=14 start=9.11 finish=272.84
22154) xmrig cpu=1 start=9.11 finish=272.84
22155) xmrig cpu=4 start=9.11 finish=272.84
22156) xmrig cpu=8 start=9.11 finish=272.84
