Test to encrypt a 2.7GB file with GnuPG. Looks like a single-threaded program that runs in about a minute.

Topdown profile shows speculation stalls as particularly high and backend stalls as low.

AMD metrics confirm a single-threaded program. Little floating point and little L2 access. Not many indirect branches but still very high branch mis-prediction. Frontend stalls are more latency than bandwidth

elapsed              169.011
on_cpu               0.057          # 0.92 / 16 cores
utime                151.248
stime                3.799
nvcsw                2031           # 72.85%
nivcsw               757            # 27.15%
inblock              0              # 0.00/sec
onblock              4207744        # 24896.30/sec
cpu-clock            155083495851   # 155.083 seconds
task-clock           155087202998   # 155.087 seconds
page faults          148931         # 960.305/sec
context switches     3455           # 22.278/sec
cpu migrations       256            # 1.651/sec
major page faults    2              # 0.013/sec
minor page faults    148929         # 960.292/sec
alignment faults     0              # 0.000/sec
emulation faults     0              # 0.000/sec
branches             204801389499   # 146.104 branches per 1000 inst
branch misses        14376986247    # 7.02% branch miss
conditional          191213671302   # 136.411 conditional branches per 1000 inst
indirect             142014949      # 0.101 indirect branches per 1000 inst
cpu-cycles           715665699465   # 0.27 GHz
instructions         1400096299132  # 1.96 IPC
slots                1434267036432  #
retiring             445645996834   # 31.1% (31.1%)
-- ucode             47579507       #     0.0%
-- fastpath          445598417327   #    31.1%
frontend             473772218359   # 33.0% (33.0%)
-- latency           248750339964   #    17.3%
-- bandwidth         225021878395   #    15.7%
backend              214028036606   # 14.9% (14.9%) low
-- cpu               63033651425    #     4.4%
-- memory            150994385181   #    10.5%
speculation          300713825870   # 21.0% (21.0%) high
-- branch mispredict 300208618021   #    20.9%
-- pipeline restart  505207849      #     0.0%
smt-contention       106654498      #  0.0% ( 0.0%)
cpu-cycles           718195890990   # 0.27 GHz
instructions         1400152976411  # 1.95 IPC
instructions         467215406957   # 17.124 l2 access per 1000 inst
l2 hit from l1       4745977338     # 1.02% l2 miss
l2 miss from l1      31050704       #
l2 hit from l2 pf    3203773986     #
l3 hit from l2 pf    17341357       #
l3 miss from l2 pf   33443160       #
instructions         467198274463   # 23.010 float per 1000 inst
float 512            43             # 0.000 AVX-512 per 1000 inst
float 256            654            # 0.000 AVX-256 per 1000 inst
float 128            10750115468    # 23.010 AVX-128 per 1000 inst
float MMX            0              # 0.000 MMX per 1000 inst
float scalar         0              # 0.000 scalar per 1000 inst

Intel metrics

elapsed              205.884
on_cpu               0.058          # 0.93 / 16 cores
utime                189.745
stime                2.491
nvcsw                1952           # 70.17%
nivcsw               830            # 29.83%
inblock              0              # 0.00/sec
onblock              4196488        # 20382.77/sec
cpu-clock            192257964300   # 192.258 seconds
task-clock           192261256286   # 192.261 seconds
page faults          138112         # 718.356/sec
context switches     3632           # 18.891/sec
cpu migrations       281            # 1.462/sec
major page faults    0              # 0.000/sec
minor page faults    138112         # 718.356/sec
alignment faults     0              # 0.000/sec
emulation faults     0              # 0.000/sec
branches             203656765790   # 145.850 branches per 1000 inst
branch misses        14459480379    # 7.10% branch miss
conditional          203656778046   # 145.850 conditional branches per 1000 inst
indirect             146325163      # 0.105 indirect branches per 1000 inst
slots                4363710725378  #
retiring             1314816821795  # 30.1% (30.1%)
-- ucode             86447683819    #     2.0%
-- fastpath          1228369137976  #    28.1%
frontend             819185548728   # 18.8% (18.8%)
-- latency           377630877144   #     8.7%
-- bandwidth         441554671584   #    10.1%
backend              510111406692   # 11.7% (11.7%) low
-- cpu               416279869681   #     9.5%
-- memory            93831537011    #     2.2%
speculation          1725289697089  # 39.5% (39.5%) high
-- branch mispredict 1713750484505  #    39.3%
-- pipeline restart  11539212584    #     0.3%
smt-contention       0              #  0.0% ( 0.0%)
cpu-cycles           727399626683   # 0.22 GHz
instructions         1396330348949  # 1.92 IPC
l2 access            11452117161    # 8.202 l2 access per 1000 inst
l2 miss              432509657      # 3.78% l2 miss

Process overview shows the gpg process and rest is test system overhead.

363 processes
	  3 gpg                    149.76     1.32
	 68 clinfo                  17.20     5.72
	 38 vulkaninfo               0.95     1.33
	  6 glxinfo:gdrv0            0.15     0.04
	  6 glxinfo:gl0              0.15     0.04
	  4 vulkani:disk$0           0.10     0.14
	  6 clang                    0.08     0.04
	  2 glxinfo                  0.07     0.02
	  2 glxinfo:cs0              0.07     0.02
	  2 glxinfo:disk$0           0.07     0.02
	  2 glxinfo:sh0              0.07     0.02
	  2 glxinfo:shlo0            0.07     0.02
	  6 php                      0.05     0.08
	  2 llvmpipe-0               0.05     0.07
	  2 llvmpipe-1               0.05     0.07
	  2 llvmpipe-10              0.05     0.07
	  2 llvmpipe-11              0.05     0.07
	  2 llvmpipe-12              0.05     0.07
	  2 llvmpipe-13              0.05     0.07
	  2 llvmpipe-14              0.05     0.07
	  2 llvmpipe-15              0.05     0.07
	  2 llvmpipe-2               0.05     0.07
	  2 llvmpipe-3               0.05     0.07
	  2 llvmpipe-4               0.05     0.07
	  2 llvmpipe-5               0.05     0.07
	  2 llvmpipe-6               0.05     0.07
	  2 llvmpipe-7               0.05     0.07
	  2 llvmpipe-8               0.05     0.07
	  2 llvmpipe-9               0.05     0.07
	  3 rocminfo                 0.03     0.00
	  1 dd                       0.00     1.72
	  1 rm                       0.00     0.25
	  1 lspci                    0.00     0.02
	 84 sh                       0.00     0.00
	 13 gcc                      0.00     0.00
	  9 gsettings                0.00     0.00
	  8 stat                     0.00     0.00
	  8 systemd-detect-          0.00     0.00
	  6 gnupg                    0.00     0.00
	  6 llvm-link                0.00     0.00
	  5 phoronix-test-s          0.00     0.00
	  4 gmain                    0.00     0.00
	  3 dconf worker             0.00     0.00
	  2 bash                     0.00     0.00
	  2 cc                       0.00     0.00
	  2 lscpu                    0.00     0.00
	  2 uname                    0.00     0.00
	  2 which                    0.00     0.00
	  2 xset                     0.00     0.00
	  1 date                     0.00     0.00
	  1 dirname                  0.00     0.00
	  1 dmesg                    0.00     0.00
	  1 dmidecode                0.00     0.00
	  1 grep                     0.00     0.00
	  1 ifconfig                 0.00     0.00
	  1 ip                       0.00     0.00
	  1 lsmod                    0.00     0.00
	  1 mktemp                   0.00     0.00
	  1 ps                       0.00     0.00
	  1 qdbus                    0.00     0.00
	  1 readlink                 0.00     0.00
	  1 realpath                 0.00     0.00
	  1 sed                      0.00     0.00
	  1 sort                     0.00     0.00
	  1 stty                     0.00     0.00
	  1 systemctl                0.00     0.00
	  1 template.sh              0.00     0.00
	  1 wc                       0.00     0.00
	  1 xrandr                   0.00     0.00
0 processes running
47 maximum processes

The core computation sections

      1173365) gnupg            cpu=4 start=7.23  finish=57.56
        1173366) gnupg            cpu=5 start=7.24  finish=7.24 
        1173367) gpg              cpu=14 start=7.24  finish=57.56
      1173372) gnupg            cpu=13 start=61.57 finish=111.85
        1173373) gnupg            cpu=14 start=61.57 finish=61.57
        1173374) gpg              cpu=8 start=61.57 finish=111.85
      1173376) gnupg            cpu=4 start=115.86 finish=166.37
        1173377) gnupg            cpu=5 start=115.86 finish=115.86
        1173378) gpg              cpu=14 start=115.86 finish=166.37