uvg266 is a video encoder for VVC/H.266 based on Kvazaar. There are 10 workloads. Overall slightly lower IPC but most metrics are similar.

Topdown overview

AMD metrics

elapsed              1226.459
on_cpu               0.789          # 12.63 / 16 cores
utime                15210.235
stime                276.940
nvcsw                20082024       # 61.17%
nivcsw               12746891       # 38.83%
inblock              0              # 0.00/sec
onblock              22496          # 18.34/sec
cpu-clock            15487261555793 # 15487.262 seconds
task-clock           15490033937969 # 15490.034 seconds
page faults          14118358       # 911.448/sec
context switches     32834801       # 2119.737/sec
cpu migrations       777121         # 50.169/sec
major page faults    2              # 0.000/sec
minor page faults    14118356       # 911.448/sec
alignment faults     0              # 0.000/sec
emulation faults     0              # 0.000/sec
branches             9488374461340  # 84.697 branches per 1000 inst
branch misses        176580290813   # 1.86% branch miss
conditional          7372985440727  # 65.814 conditional branches per 1000 inst
indirect             476354889281   # 4.252 indirect branches per 1000 inst
cpu-cycles           59222760227158 # 3.02 GHz
instructions         112113224173244 # 1.89 IPC
slots                118341567323460 #
retiring             39571594319069 # 33.4% (50.0%)
-- ucode             99400535036    #     0.1%
-- fastpath          39472193784033 #    33.4%
frontend             18606899854651 # 15.7% (23.5%)
-- latency           11404823371602 #     9.6%
-- bandwidth         7202076483049  #     6.1%
backend              18508891525338 # 15.6% (23.4%)
-- cpu               6387284955523  #     5.4%
-- memory            12121606569815 #    10.2%
speculation          2443403174228  #  2.1% ( 3.1%)
-- branch mispredict 2351990600409  #     2.0%
-- pipeline restart  91412573819    #     0.1%
smt-contention       39209393842876 # 33.1% ( 0.0%)
cpu-cycles           59233020683832 # 3.02 GHz
instructions         112104812727986 # 1.89 IPC
instructions         37336699054165 # 39.285 l2 access per 1000 inst
l2 hit from l1       1124260604785  # 5.14% l2 miss
l2 miss from l1      41199565892    #
l2 hit from l2 pf    308374602830   #
l3 hit from l2 pf    25687086559    #
l3 miss from l2 pf   8446252065     #
instructions         37327978214966 # 123.040 float per 1000 inst
float 512            106            # 0.000 AVX-512 per 1000 inst
float 256            356            # 0.000 AVX-256 per 1000 inst
float 128            4592836790618  # 123.040 AVX-128 per 1000 inst
float MMX            0              # 0.000 MMX per 1000 inst
float scalar         0              # 0.000 scalar per 1000 inst

Intel metrics

elapsed              2006.258
on_cpu               0.773          # 12.36 / 16 cores
utime                24527.891
stime                278.416
nvcsw                26498601       # 77.59%
nivcsw               7653750        # 22.41%
inblock              18235080       # 9089.10/sec
onblock              17968          # 8.96/sec
cpu-clock            24807513144133 # 24807.513 seconds
task-clock           24812450667548 # 24812.451 seconds
page faults          19015950       # 766.387/sec
context switches     34162087       # 1376.812/sec
cpu migrations       1480974        # 59.687/sec
major page faults    73             # 0.003/sec
minor page faults    19015877       # 766.384/sec
alignment faults     0              # 0.000/sec
emulation faults     0              # 0.000/sec
branches             10988043510016 # 83.506 branches per 1000 inst
branch misses        202479273078   # 1.84% branch miss
conditional          10988043555712 # 83.506 conditional branches per 1000 inst
indirect             3409696603840  # 25.913 indirect branches per 1000 inst
slots                119227881800384 #
retiring             72354003596444 # 60.7% (60.7%)
-- ucode             6177833054651  #     5.2%
-- fastpath          66176170541793 #    55.5%
frontend             28456843609843 # 23.9% (23.9%)
-- latency           12151341821410 #    10.2%
-- bandwidth         16305501788433 #    13.7%
backend              8926358411337  #  7.5% ( 7.5%)
-- cpu               5078008768688  #     4.3%
-- memory            3848349642649  #     3.2%
speculation          9809590154881  #  8.2% ( 8.2%)
-- branch mispredict 9387525599800  #     7.9%
-- pipeline restart  422064555081   #     0.4%
smt-contention       0              #  0.0% ( 0.0%)
cpu-cycles           77735101284795 # 2.39 GHz
instructions         153645864074988 # 1.98 IPC
l2 access            1975746574306  # 26.935 l2 access per 1000 inst
l2 miss              257360554948   # 13.03% l2 miss

Process overview

930 processes
	570 uvg266               273549.88  4020.09
	 68 clinfo                  16.54     6.32
	 38 vulkaninfo               0.76     1.50
	  6 php                      0.16     0.23
	  6 glxinfo:gdrv0            0.16     0.04
	  4 vulkani:disk$0           0.08     0.16
	  2 glxinfo                  0.08     0.02
	  2 glxinfo:cs0              0.08     0.02
	  2 glxinfo:disk$0           0.08     0.02
	  2 glxinfo:sh0              0.08     0.02
	  2 glxinfo:shlo0            0.08     0.02
	  6 clang                    0.06     0.06
	  2 llvmpipe-0               0.04     0.08
	  2 llvmpipe-1               0.04     0.08
	  2 llvmpipe-10              0.04     0.08
	  2 llvmpipe-11              0.04     0.08
	  2 llvmpipe-12              0.04     0.08
	  2 llvmpipe-13              0.04     0.08
	  2 llvmpipe-14              0.04     0.08
	  2 llvmpipe-15              0.04     0.08
	  2 llvmpipe-2               0.04     0.08
	  2 llvmpipe-3               0.04     0.08
	  2 llvmpipe-4               0.04     0.08
	  2 llvmpipe-5               0.04     0.08
	  2 llvmpipe-6               0.04     0.08
	  2 llvmpipe-7               0.04     0.08
	  2 llvmpipe-8               0.04     0.08
	  2 llvmpipe-9               0.04     0.08
	  3 rocminfo                 0.03     0.00
	  1 lspci                    0.01     0.01
	  1 ps                       0.00     0.01
	100 sh                       0.00     0.00
	 13 gcc                      0.00     0.00
	 12 gsettings                0.00     0.00
	  8 stat                     0.00     0.00
	  8 systemd-detect-          0.00     0.00
	  6 llvm-link                0.00     0.00
	  5 phoronix-test-s          0.00     0.00
	  3 gmain                    0.00     0.00
	  2 cc                       0.00     0.00
	  2 lscpu                    0.00     0.00
	  2 uname                    0.00     0.00
	  2 which                    0.00     0.00
	  2 xset                     0.00     0.00
	  1 date                     0.00     0.00
	  1 dconf worker             0.00     0.00
	  1 dirname                  0.00     0.00
	  1 dmesg                    0.00     0.00
	  1 dmidecode                0.00     0.00
	  1 grep                     0.00     0.00
	  1 ifconfig                 0.00     0.00
	  1 ip                       0.00     0.00
	  1 lsmod                    0.00     0.00
	  1 mktemp                   0.00     0.00
	  1 qdbus                    0.00     0.00
	  1 readlink                 0.00     0.00
	  1 realpath                 0.00     0.00
	  1 sed                      0.00     0.00
	  1 sort                     0.00     0.00
	  1 stty                     0.00     0.00
	  1 systemctl                0.00     0.00
	  1 template.sh              0.00     0.00
	  1 wc                       0.00     0.00
	  1 xrandr                   0.00     0.00
0 processes running
47 maximum processes

Core computation structure with one thread per thread.

      238433) uvg266           cpu=2 start=5.83  finish=111.07
        238434) uvg266           cpu=8 start=5.83  finish=111.05
          238435) uvg266           cpu=5 start=5.83  finish=111.05
          238436) uvg266           cpu=2 start=5.83  finish=111.05
          238437) uvg266           cpu=9 start=5.83  finish=111.05
          238438) uvg266           cpu=1 start=5.83  finish=111.05
          238439) uvg266           cpu=6 start=5.83  finish=111.05
          238440) uvg266           cpu=14 start=5.83  finish=111.05
          238441) uvg266           cpu=13 start=5.83  finish=111.05
          238442) uvg266           cpu=10 start=5.83  finish=111.05
          238443) uvg266           cpu=4 start=5.83  finish=111.05
          238444) uvg266           cpu=8 start=5.83  finish=111.05
          238445) uvg266           cpu=15 start=5.83  finish=111.05
          238446) uvg266           cpu=3 start=5.83  finish=111.05
          238447) uvg266           cpu=11 start=5.83  finish=111.05
          238448) uvg266           cpu=7 start=5.83  finish=111.05
          238449) uvg266           cpu=0 start=5.83  finish=111.05
          238450) uvg266           cpu=12 start=5.83  finish=111.05
          238451) uvg266           cpu=0 start=5.86  finish=108.68