A distributed memory object caching system. There are five tests. These run on AMD system but appeared to hang on my Intel system. Overall a high level of interrupts.

Topdown profile shows a high level of frontend stalls and lower backend stalls.

AMD metrics show little floating point. There is a high opcache miss rate and relatively high icache miss rate.

elapsed              1084.896
on_cpu               0.389          # 6.22 / 16 cores
utime                1805.389
stime                4939.507
nvcsw                222708743      # 99.61%
nivcsw               879355         # 0.39%
inblock              0              # 0.00/sec
onblock              13520          # 12.46/sec
cpu-clock            14028143035597 # 14028.143 seconds
task-clock           14037521121094 # 14037.521 seconds
page faults          414637         # 29.538/sec
context switches     452028530      # 32201.450/sec
cpu migrations       13908050       # 990.777/sec
major page faults    4              # 0.000/sec
minor page faults    414633         # 29.537/sec
alignment faults     0              # 0.000/sec
emulation faults     0              # 0.000/sec
branches             7951525581786  # 201.057 branches per 1000 inst
branch misses        699053211919   # 8.79% branch miss
conditional          4336195723529  # 109.642 conditional branches per 1000 inst
indirect             190468268236   # 4.816 indirect branches per 1000 inst
cpu-cycles           55276954057609 # 3.19 GHz
instructions         40357397448074 # 0.73 IPC
slots                109074132046182 #
retiring             14946314860049 # 13.7% (15.0%)
-- ucode             84759708463    #     0.1%
-- fastpath          14861555151586 #    13.6%
frontend             72590179804234 # 66.6% (72.8%) high
-- latency           61583813788416 #    56.5%
-- bandwidth         11006366015818 #    10.1%
backend              11812472983875 # 10.8% (11.8%) low
-- cpu               1924685512382  #     1.8%
-- memory            9887787471493  #     9.1%
speculation          405394937992   #  0.4% ( 0.4%) low
-- branch mispredict 404417600603   #     0.4%
-- pipeline restart  977337389      #     0.0%
smt-contention       9314599861603  #  8.5% ( 0.0%)
cpu-cycles           55180514514939 # 3.18 GHz
instructions         40266935720025 # 0.73 IPC
instructions         13262256645581 # 96.029 l2 access per 1000 inst
l2 hit from l1       1230269784634  # 3.68% l2 miss
l2 miss from l1      35338152294    #
l2 hit from l2 pf    31699410997    #
l3 hit from l2 pf    11257846706    #
l3 miss from l2 pf   331933190      #
instructions         13269290387328 # 12.628 float per 1000 inst
float 512            110            # 0.000 AVX-512 per 1000 inst
float 256            378            # 0.000 AVX-256 per 1000 inst
float 128            167564373522   # 12.628 AVX-128 per 1000 inst
float MMX            0              # 0.000 MMX per 1000 inst
float scalar         0              # 0.000 scalar per 1000 inst
instructions         40133117880873 #
opcache              10528047887039 # 262.328 opcache per 1000 inst
opcache miss         6905745934303  # 65.6% opcache miss rate
l1 dTLB miss         190180493018   # 4.739 L1 dTLB per 1000 inst
l2 dTLB miss         24651135861    # 0.614 L2 dTLB per 1000 inst
instructions         39830954865181 #
icache               10343658357264 # 259.689 icache per 1000 inst
icache miss          2793707140548  # 27.0% icache miss rate
l1 iTLB miss         83597661651    # 2.099 L1 iTLB per 1000 inst
l2 iTLB miss         0              # 0.000 L2 iTLB per 1000 inst
tlb flush            407944934      # 0.010 TLB flush per 1000 inst

Process summary shows memtier_benchmark as the primary driver process.

971 processes
	255 memtier_benchma      32375.71 83025.09
	240 mc-worker            19305.92 97636.80
	 30 memcached             1206.62  6102.30
	 15 mc-assocmaint         1206.62  6102.30
	 15 mc-itemcrawler        1206.62  6102.30
	 15 mc-log                1206.62  6102.30
	 15 mc-lrumaint           1206.62  6102.30
	 15 mc-slabmaint          1206.62  6102.30
	 68 clinfo                  16.22     6.33
	 38 vulkaninfo               1.33     1.15
	  4 vulkani:disk$0           0.14     0.12
	  6 php                      0.08     0.24
	  2 llvmpipe-0               0.07     0.06
	  2 llvmpipe-1               0.07     0.06
	  2 llvmpipe-10              0.07     0.06
	  2 llvmpipe-11              0.07     0.06
	  2 llvmpipe-12              0.07     0.06
	  2 llvmpipe-13              0.07     0.06
	  2 llvmpipe-14              0.07     0.06
	  2 llvmpipe-15              0.07     0.06
	  2 llvmpipe-2               0.07     0.06
	  2 llvmpipe-3               0.07     0.06
	  2 llvmpipe-4               0.07     0.06
	  2 llvmpipe-5               0.07     0.06
	  2 llvmpipe-6               0.07     0.06
	  2 llvmpipe-7               0.07     0.06
	  2 llvmpipe-8               0.07     0.06
	  2 llvmpipe-9               0.07     0.06
	  6 glxinfo:gdrv0            0.06     0.12
	  6 glxinfo:gl0              0.06     0.12
	  6 clang                    0.06     0.06
	  2 glxinfo                  0.04     0.05
	  2 glxinfo:cs0              0.04     0.05
	  2 glxinfo:disk$0           0.04     0.05
	  2 glxinfo:sh0              0.04     0.04
	  2 glxinfo:shlo0            0.04     0.04
	  1 lspci                    0.01     0.01
	  3 rocminfo                 0.00     0.03
	  1 ps                       0.00     0.01
	 90 sh                       0.00     0.00
	 15 sleep                    0.00     0.00
	 13 gcc                      0.00     0.00
	  8 stat                     0.00     0.00
	  8 systemd-detect-          0.00     0.00
	  7 gsettings                0.00     0.00
	  6 llvm-link                0.00     0.00
	  5 gmain                    0.00     0.00
	  5 phoronix-test-s          0.00     0.00
	  4 dconf worker             0.00     0.00
	  2 cc                       0.00     0.00
	  2 lscpu                    0.00     0.00
	  2 uname                    0.00     0.00
	  2 which                    0.00     0.00
	  2 xset                     0.00     0.00
	  1 date                     0.00     0.00
	  1 dirname                  0.00     0.00
	  1 dmesg                    0.00     0.00
	  1 dmidecode                0.00     0.00
	  1 grep                     0.00     0.00
	  1 ifconfig                 0.00     0.00
	  1 ip                       0.00     0.00
	  1 lsmod                    0.00     0.00
	  1 mktemp                   0.00     0.00
	  1 qdbus                    0.00     0.00
	  1 readlink                 0.00     0.00
	  1 realpath                 0.00     0.00
	  1 sed                      0.00     0.00
	  1 sort                     0.00     0.00
	  1 stty                     0.00     0.00
	  1 systemctl                0.00     0.00
	  1 template.sh              0.00     0.00
	  1 wc                       0.00     0.00
	  1 xrandr                   0.00     0.00
0 processes running
49 maximum processes

Process summary shows a collection of worker threads started on each core and then a set of benchmark threads started six seconds later.

      988848) memcached        cpu=12 start=5.73  finish=71.75
        988849) memcached        cpu=15 start=5.73  finish=72.03
          988851) mc-log           cpu=2 start=5.74  finish=72.03
          988852) mc-worker        cpu=8 start=5.76  finish=72.03
          988853) mc-worker        cpu=13 start=5.76  finish=72.03
          988854) mc-worker        cpu=0 start=5.76  finish=72.03
          988855) mc-worker        cpu=3 start=5.76  finish=72.03
          988856) mc-worker        cpu=9 start=5.76  finish=72.03
          988857) mc-worker        cpu=14 start=5.76  finish=72.03
          988858) mc-worker        cpu=10 start=5.76  finish=72.03
          988859) mc-worker        cpu=4 start=5.76  finish=72.03
          988860) mc-worker        cpu=5 start=5.76  finish=72.03
          988861) mc-worker        cpu=12 start=5.76  finish=72.03
          988862) mc-worker        cpu=10 start=5.76  finish=72.03
          988863) mc-worker        cpu=7 start=5.76  finish=72.03
          988864) mc-worker        cpu=11 start=5.76  finish=72.03
          988865) mc-worker        cpu=6 start=5.76  finish=72.03
          988866) mc-worker        cpu=7 start=5.76  finish=72.03
          988867) mc-worker        cpu=1 start=5.76  finish=72.03
          988868) mc-assocmaint    cpu=1 start=5.76  finish=72.03
          988869) mc-itemcrawler   cpu=15 start=5.76  finish=72.03
          988870) mc-lrumaint      cpu=3 start=5.76  finish=72.03
          988871) mc-slabmaint     cpu=5 start=5.76  finish=72.03
        988850) sleep            cpu=15 start=5.73  finish=11.73
        988872) memtier_benchma  cpu=12 start=11.74 finish=71.75
          988873) memtier_benchma  cpu=11 start=11.74 finish=71.74
          988874) memtier_benchma  cpu=15 start=11.74 finish=71.74
          988875) memtier_benchma  cpu=1 start=11.74 finish=71.74
          988876) memtier_benchma  cpu=15 start=11.74 finish=71.74
          988877) memtier_benchma  cpu=5 start=11.74 finish=71.74
          988878) memtier_benchma  cpu=1 start=11.74 finish=71.74
          988879) memtier_benchma  cpu=13 start=11.74 finish=71.74
          988880) memtier_benchma  cpu=9 start=11.74 finish=71.74
          988881) memtier_benchma  cpu=12 start=11.74 finish=71.74
          988882) memtier_benchma  cpu=8 start=11.74 finish=71.74
          988883) memtier_benchma  cpu=13 start=11.74 finish=71.74
          988884) memtier_benchma  cpu=14 start=11.74 finish=71.74
          988885) memtier_benchma  cpu=5 start=11.74 finish=71.74
          988886) memtier_benchma  cpu=0 start=11.74 finish=71.74
          988887) memtier_benchma  cpu=6 start=11.74 finish=71.74
          988888) memtier_benchma  cpu=10 start=11.74 finish=71.74