A test of eight different mutex operations. These look to be single-threaded.

Topdown profile shows a high retirement rate overall and some backend stalls depending on the operator.

AMD metrics confirm ~0.8 cores of activity and a high retirement rate. There is no floating point or l2 access. The opcache miss rate is very low and the icache access is also low.
elapsed 768.246
on_cpu 0.051 # 0.81 / 16 cores
utime 624.890
stime 0.979
nvcsw 2235 # 46.24%
nivcsw 2598 # 53.76%
inblock 0 # 0.00/sec
onblock 13848 # 18.03/sec
cpu-clock 625977324320 # 625.977 seconds
task-clock 625986101516 # 625.986 seconds
page faults 164393 # 262.614/sec
context switches 8443 # 13.488/sec
cpu migrations 304 # 0.486/sec
major page faults 2 # 0.003/sec
minor page faults 164391 # 262.611/sec
alignment faults 0 # 0.000/sec
emulation faults 0 # 0.000/sec
branches 1592406098562 # 184.609 branches per 1000 inst
branch misses 241102970 # 0.02% branch miss
conditional 651535715124 # 75.533 conditional branches per 1000 inst
indirect 46735009277 # 5.418 indirect branches per 1000 inst
cpu-cycles 2897158648882 # 0.24 GHz
instructions 8622339610405 # 2.98 IPC
slots 5800859735712 #
retiring 3253046943596 # 56.1% (56.1%) high
-- ucode 24462969586 # 0.4%
-- fastpath 3228583974010 # 55.7%
frontend 873341239219 # 15.1% (15.1%)
-- latency 433757472954 # 7.5%
-- bandwidth 439583766265 # 7.6%
backend 1577324717331 # 27.2% (27.2%)
-- cpu 150783304857 # 2.6%
-- memory 1426541412474 # 24.6%
speculation 96796569505 # 1.7% ( 1.7%)
-- branch mispredict 12546909414 # 0.2%
-- pipeline restart 84249660091 # 1.5%
smt-contention 349845427 # 0.0% ( 0.0%)
cpu-cycles 2895380779245 # 0.24 GHz
instructions 8625321930524 # 2.98 IPC
instructions 2877430346634 # 0.054 l2 access per 1000 inst
l2 hit from l1 135717532 # 17.98% l2 miss
l2 miss from l1 17906204 #
l2 hit from l2 pf 8493747 #
l3 hit from l2 pf 4915787 #
l3 miss from l2 pf 4856894 #
instructions 2875652905685 # 0.018 float per 1000 inst
float 512 92 # 0.000 AVX-512 per 1000 inst
float 256 596 # 0.000 AVX-256 per 1000 inst
float 128 50502963 # 0.018 AVX-128 per 1000 inst
float MMX 0 # 0.000 MMX per 1000 inst
float scalar 0 # 0.000 scalar per 1000 inst
instructions 8625751799175 #
opcache 1754552237838 # 203.409 opcache per 1000 inst
opcache miss 3140782157 # 0.2% opcache miss rate
l1 dTLB miss 30325200 # 0.004 L1 dTLB per 1000 inst
l2 dTLB miss 5222148 # 0.001 L2 dTLB per 1000 inst
instructions 8976029807411 #
icache 4197245078 # 0.468 icache per 1000 inst
icache miss 220014730 # 5.2% icache miss rate
l1 iTLB miss 9439540 # 0.001 L1 iTLB per 1000 inst
l2 iTLB miss 0 # 0.000 L2 iTLB per 1000 inst
tlb flush 20437 # 0.000 TLB flush per 1000 inst
Intel metrics show backend stalls as store-bound.
elapsed 1009.781
on_cpu 0.054 # 0.86 / 16 cores
utime 867.884
stime 0.576
nvcsw 2071 # 33.11%
nivcsw 4184 # 66.89%
inblock 8 # 0.01/sec
onblock 2496 # 2.47/sec
cpu-clock 868580465423 # 868.580 seconds
task-clock 868590350103 # 868.590 seconds
page faults 156145 # 179.768/sec
context switches 11080 # 12.756/sec
cpu migrations 440 # 0.507/sec
major page faults 0 # 0.000/sec
minor page faults 156145 # 179.768/sec
alignment faults 0 # 0.000/sec
emulation faults 0 # 0.000/sec
branches 1592194973756 # 184.595 branches per 1000 inst
branch misses 22779649 # 0.00% branch miss
conditional 1592194987388 # 184.595 conditional branches per 1000 inst
indirect 46747423365 # 5.420 indirect branches per 1000 inst
slots 27528185511464 #
retiring 13766928686759 # 50.0% (50.0%)
-- ucode 2564852441369 # 9.3%
-- fastpath 11202076245390 # 40.7%
frontend 1454939362848 # 5.3% ( 5.3%)
-- latency 628410579732 # 2.3%
-- bandwidth 826528783116 # 3.0%
backend 12177694657167 # 44.2% (44.2%)
-- cpu 2595455796043 # 9.4%
-- memory 9582238861124 # 34.8%
speculation 5619303554 # 0.0% ( 0.0%) low
-- branch mispredict 3012286657 # 0.0%
-- pipeline restart 2607016897 # 0.0%
smt-contention 0 # 0.0% ( 0.0%)
cpu-cycles 3362261398526 # 0.20 GHz
instructions 8780623521865 # 2.61 IPC
l2 access 348782437 # 0.040 l2 access per 1000 inst
l2 miss 111403109 # 31.94% l2 miss
cpu-cycles 3289745122103 # 38.2% memory latency
load stalls 649290423321 # 19.7% l1 bound
l1 miss 1067015600 # 0.0% l2 bound
l2 miss 459914057 # 0.0% l3 bound
l3 miss 254176849 # 0.0% dram bound
store_stalls 608692073488 # 18.5% store bound
Process overview
410 processes
24 BenchmarkMutex 622.88 0.00
68 clinfo 17.12 5.66
38 vulkaninfo 1.12 1.52
4 vulkani:disk$0 0.11 0.16
6 php 0.10 0.24
6 glxinfo:gdrv0 0.07 0.10
6 glxinfo:gl0 0.07 0.10
2 llvmpipe-0 0.06 0.08
2 llvmpipe-1 0.06 0.08
2 llvmpipe-10 0.06 0.08
2 llvmpipe-11 0.06 0.08
2 llvmpipe-12 0.06 0.08
2 llvmpipe-13 0.06 0.08
2 llvmpipe-14 0.06 0.08
2 llvmpipe-15 0.06 0.08
2 llvmpipe-2 0.06 0.08
2 llvmpipe-3 0.06 0.08
2 llvmpipe-4 0.06 0.08
2 llvmpipe-5 0.06 0.08
2 llvmpipe-6 0.06 0.08
2 llvmpipe-7 0.06 0.08
2 llvmpipe-8 0.06 0.08
2 llvmpipe-9 0.06 0.08
2 glxinfo 0.05 0.04
2 glxinfo:cs0 0.05 0.04
2 glxinfo:disk$0 0.05 0.04
2 glxinfo:sh0 0.05 0.04
2 glxinfo:shlo0 0.05 0.04
6 clang 0.04 0.08
1 lspci 0.00 0.02
1 ps 0.00 0.01
96 sh 0.00 0.00
24 mutex 0.00 0.00
13 gcc 0.00 0.00
12 gsettings 0.00 0.00
8 stat 0.00 0.00
8 systemd-detect- 0.00 0.00
6 llvm-link 0.00 0.00
5 phoronix-test-s 0.00 0.00
3 gmain 0.00 0.00
3 rocminfo 0.00 0.00
2 cc 0.00 0.00
2 lscpu 0.00 0.00
2 uname 0.00 0.00
2 which 0.00 0.00
2 xset 0.00 0.00
1 date 0.00 0.00
1 dconf worker 0.00 0.00
1 dirname 0.00 0.00
1 dmesg 0.00 0.00
1 dmidecode 0.00 0.00
1 grep 0.00 0.00
1 ifconfig 0.00 0.00
1 ip 0.00 0.00
1 lsmod 0.00 0.00
1 mktemp 0.00 0.00
1 qdbus 0.00 0.00
1 readlink 0.00 0.00
1 realpath 0.00 0.00
1 sed 0.00 0.00
1 sort 0.00 0.00
1 stty 0.00 0.00
1 systemctl 0.00 0.00
1 template.sh 0.00 0.00
1 wc 0.00 0.00
1 xrandr 0.00 0.00
0 processes running
47 maximum processes
Computation blocks
1044464) mutex cpu=6 start=6.49 finish=22.02
1044465) BenchmarkMutex cpu=1 start=6.50 finish=22.02
1044468) mutex cpu=6 start=26.03 finish=41.57
1044469) BenchmarkMutex cpu=7 start=26.03 finish=41.57
1044470) mutex cpu=7 start=45.57 finish=61.15
1044471) BenchmarkMutex cpu=8 start=45.58 finish=61.15
1044472) sh cpu=14 start=61.15 finish=61.15
1044473) sh cpu=7 start=61.15 finish=61.15
1044474) mutex cpu=0 start=71.35 finish=170.56
1044475) BenchmarkMutex cpu=1 start=71.35 finish=170.56
1044476) mutex cpu=0 start=174.57 finish=273.25
1044477) BenchmarkMutex cpu=10 start=174.57 finish=273.25
1044612) mutex cpu=0 start=277.25 finish=374.00
1044613) BenchmarkMutex cpu=1 start=277.26 finish=374.00
1044616) sh cpu=1 start=374.00 finish=374.00
1044617) sh cpu=2 start=374.00 finish=374.00
1044618) mutex cpu=6 start=384.42 finish=397.09
1044619) BenchmarkMutex cpu=7 start=384.42 finish=397.09
1044620) mutex cpu=15 start=401.10 finish=413.73
1044621) BenchmarkMutex cpu=0 start=401.10 finish=413.73
1044624) mutex cpu=6 start=417.73 finish=430.36
1044625) BenchmarkMutex cpu=15 start=417.73 finish=430.36
1044626) sh cpu=15 start=430.36 finish=430.36
1044627) sh cpu=3 start=430.36 finish=430.36
1044628) mutex cpu=1 start=440.56 finish=455.14
1044629) BenchmarkMutex cpu=10 start=440.57 finish=455.14
1044630) mutex cpu=9 start=459.14 finish=473.68
1044631) BenchmarkMutex cpu=2 start=459.14 finish=473.68
1044632) mutex cpu=1 start=477.68 finish=492.46
1044633) BenchmarkMutex cpu=2 start=477.69 finish=492.46
1044634) sh cpu=2 start=492.46 finish=492.46
1044635) sh cpu=3 start=492.46 finish=492.46
1044636) mutex cpu=1 start=503.44 finish=511.86
1044637) BenchmarkMutex cpu=2 start=503.45 finish=511.85
1044640) mutex cpu=9 start=515.86 finish=524.24
1044641) BenchmarkMutex cpu=2 start=515.86 finish=524.24
1044642) mutex cpu=9 start=528.24 finish=536.62
1044643) BenchmarkMutex cpu=10 start=528.24 finish=536.62
1044644) sh cpu=1 start=536.62 finish=536.62
1044645 1044646) mutex cpu=1 start=547.36 finish=568.53
1044647) BenchmarkMutex cpu=2 start=547.36 finish=568.53
1044650) mutex cpu=1 start=572.53 finish=593.61
1044651) BenchmarkMutex cpu=2 start=572.53 finish=593.61
1044652) mutex cpu=9 start=597.62 finish=618.78
1044653) BenchmarkMutex cpu=2 start=597.62 finish=618.78
1044654) sh cpu=9 start=618.79 finish=618.79
1044655) sh cpu=3 start=618.79 finish=618.79
1044656) mutex cpu=1 start=629.03 finish=656.63
1044657) BenchmarkMutex cpu=2 start=629.03 finish=656.63
1044658) mutex cpu=1 start=660.64 finish=688.22
1044659) BenchmarkMutex cpu=2 start=660.64 finish=688.21
1044663) mutex cpu=9 start=692.22 finish=719.79
1044664) BenchmarkMutex cpu=10 start=692.22 finish=719.79
1044665) sh cpu=9 start=719.79 finish=719.79
1044666) sh cpu=10 start=719.79 finish=719.79
1044667) mutex cpu=1 start=730.00 finish=739.59
1044668) BenchmarkMutex cpu=10 start=730.00 finish=739.59
1044669) mutex cpu=1 start=743.60 finish=753.15
1044670) BenchmarkMutex cpu=2 start=743.60 finish=753.15
1044671) mutex cpu=2 start=757.16 finish=766.73
1044672) BenchmarkMutex cpu=3 start=757.16 finish=766.73
