The GNU library for complex numbers. This is a single-threaded program.

Topdown profile has a high retirement rate

AMD metrics show little floating point and not much L2 access

elapsed              166.758
on_cpu               0.057          # 0.92 / 16 cores
utime                151.162
stime                1.479
nvcsw                2138           # 71.36%
nivcsw               858            # 28.64%
inblock              0              # 0.00/sec
onblock              16504          # 98.97/sec
cpu-clock            152672415749   # 152.672 seconds
task-clock           152676091847   # 152.676 seconds
page faults          557051         # 3648.580/sec
context switches     3615           # 23.678/sec
cpu migrations       323            # 2.116/sec
major page faults    2              # 0.013/sec
minor page faults    557049         # 3648.567/sec
alignment faults     0              # 0.000/sec
emulation faults     0              # 0.000/sec
branches             274437965873   # 124.238 branches per 1000 inst
branch misses        2307028641     # 0.84% branch miss
conditional          177481790336   # 80.346 conditional branches per 1000 inst
indirect             27117521790    # 12.276 indirect branches per 1000 inst
cpu-cycles           706513748986   # 0.26 GHz
instructions         2212572685131  # 3.13 IPC high
slots                1415733915426  #
retiring             795937030816   # 56.2% (56.2%) high
-- ucode             318086094      #     0.0%
-- fastpath          795618944722   #    56.2%
frontend             225753182746   # 15.9% (16.0%)
-- latency           123871977144   #     8.7%
-- bandwidth         101881205602   #     7.2%
backend              337512903478   # 23.8% (23.8%)
-- cpu               144707653849   #    10.2%
-- memory            192805249629   #    13.6%
speculation          56157233610    #  4.0% ( 4.0%)
-- branch mispredict 53613407355    #     3.8%
-- pipeline restart  2543826255     #     0.2%
smt-contention       373171679      #  0.0% ( 0.0%)
cpu-cycles           706487025765   # 0.26 GHz
instructions         2216436708146  # 3.14 IPC high
instructions         739409511698   # 12.099 l2 access per 1000 inst
l2 hit from l1       6321172491     # 23.47% l2 miss
l2 miss from l1      133295780      #
l2 hit from l2 pf    658730830      #
l3 hit from l2 pf    1415521643     #
l3 miss from l2 pf   550753147      #
instructions         739266549923   # 2.087 float per 1000 inst
float 512            68             # 0.000 AVX-512 per 1000 inst
float 256            580            # 0.000 AVX-256 per 1000 inst
float 128            1543095062     # 2.087 AVX-128 per 1000 inst
float MMX            0              # 0.000 MMX per 1000 inst
float scalar         0              # 0.000 scalar per 1000 inst

Intel metrics

elapsed              164.961
on_cpu               0.057          # 0.91 / 16 cores
utime                150.022
stime                0.900
nvcsw                2145           # 73.31%
nivcsw               781            # 26.69%
inblock              2056           # 12.46/sec
onblock              5760           # 34.92/sec
cpu-clock            150932957744   # 150.933 seconds
task-clock           150937250108   # 150.937 seconds
page faults          552371         # 3659.607/sec
context switches     3463           # 22.943/sec
cpu migrations       354            # 2.345/sec
major page faults    16             # 0.106/sec
minor page faults    552355         # 3659.501/sec
alignment faults     0              # 0.000/sec
emulation faults     0              # 0.000/sec
branches             249717007520   # 123.921 branches per 1000 inst
branch misses        2282770198     # 0.91% branch miss
conditional          249717023360   # 123.921 conditional branches per 1000 inst
indirect             22076312923    # 10.955 indirect branches per 1000 inst
slots                3411955791404  #
retiring             2085926990436  # 61.1% (61.1%) high
-- ucode             213396009277   #     6.3%
-- fastpath          1872530981159  #    54.9%
frontend             653782899966   # 19.2% (19.2%)
-- latency           162807831077   #     4.8%
-- bandwidth         490975068889   #    14.4%
backend              414970189514   # 12.2% (12.2%) low
-- cpu               176324196229   #     5.2%
-- memory            238645993285   #     7.0%
speculation          429894175274   # 12.6% (12.6%) high
-- branch mispredict 420757258873   #    12.3%
-- pipeline restart  9136916401     #     0.3%
smt-contention       0              #  0.0% ( 0.0%)
cpu-cycles           568505974432   # 0.22 GHz
instructions         2013312584177  # 3.54 IPC high
l2 access            23904751544    # 11.887 l2 access per 1000 inst
l2 miss              7809825493     # 32.67% l2 miss

Process overview is simple

399 processes
	 33 mpcbench               150.14     0.70
	 68 clinfo                  16.54     5.66
	 38 vulkaninfo               1.15     1.11
	  6 glxinfo:gdrv0            0.15     0.02
	  6 glxinfo:gl0              0.15     0.02
	  4 vulkani:disk$0           0.12     0.11
	  2 glxinfo                  0.07     0.02
	  2 glxinfo:cs0              0.07     0.02
	  2 glxinfo:disk$0           0.07     0.02
	  2 glxinfo:sh0              0.07     0.02
	  2 glxinfo:shlo0            0.07     0.02
	  6 clang                    0.06     0.06
	  2 llvmpipe-0               0.06     0.06
	  2 llvmpipe-1               0.06     0.06
	  2 llvmpipe-10              0.06     0.06
	  2 llvmpipe-11              0.06     0.06
	  2 llvmpipe-12              0.06     0.06
	  2 llvmpipe-13              0.06     0.06
	  2 llvmpipe-14              0.06     0.06
	  2 llvmpipe-15              0.06     0.06
	  2 llvmpipe-2               0.06     0.06
	  2 llvmpipe-3               0.06     0.06
	  2 llvmpipe-4               0.06     0.06
	  2 llvmpipe-5               0.06     0.06
	  2 llvmpipe-6               0.06     0.06
	  2 llvmpipe-7               0.06     0.06
	  2 llvmpipe-8               0.06     0.06
	  2 llvmpipe-9               0.06     0.06
	  6 php                      0.04     0.09
	  3 rocminfo                 0.03     0.00
	  1 lspci                    0.00     0.02
	  1 ps                       0.00     0.01
	 81 sh                       0.00     0.00
	 13 gcc                      0.00     0.00
	 13 gsettings                0.00     0.00
	 10 sed                      0.00     0.00
	  8 stat                     0.00     0.00
	  8 systemd-detect-          0.00     0.00
	  6 llvm-link                0.00     0.00
	  6 make                     0.00     0.00
	  5 phoronix-test-s          0.00     0.00
	  3 ls                       0.00     0.00
	  2 cc                       0.00     0.00
	  2 gmain                    0.00     0.00
	  2 lscpu                    0.00     0.00
	  2 uname                    0.00     0.00
	  2 which                    0.00     0.00
	  2 xset                     0.00     0.00
	  1 date                     0.00     0.00
	  1 dconf worker             0.00     0.00
	  1 dirname                  0.00     0.00
	  1 dmesg                    0.00     0.00
	  1 dmidecode                0.00     0.00
	  1 grep                     0.00     0.00
	  1 ifconfig                 0.00     0.00
	  1 ip                       0.00     0.00
	  1 lsmod                    0.00     0.00
	  1 mktemp                   0.00     0.00
	  1 qdbus                    0.00     0.00
	  1 readlink                 0.00     0.00
	  1 realpath                 0.00     0.00
	  1 sort                     0.00     0.00
	  1 stty                     0.00     0.00
	  1 systemctl                0.00     0.00
	  1 template.sh              0.00     0.00
	  1 wc                       0.00     0.00
	  1 xrandr                   0.00     0.00
1 processes running
48 maximum processes

Computation structure

      110405) mpcbench         cpu=0 start=5.63  finish=55.92
        110406) make             cpu=6 start=5.63  finish=55.92
          110407) make             cpu=10 start=5.64  finish=55.92
            110408) mpcbench         cpu=4 start=5.64  finish=55.92
              110409) mpcbench         cpu=5 start=5.65  finish=5.65 
                110410) mpcbench         cpu=11 start=5.65  finish=5.65 
              110411) mpcbench         cpu=1 start=5.65  finish=5.65 
              110412) mpcbench         cpu=5 start=5.65  finish=5.65 
                110413) mpcbench         cpu=11 start=5.65  finish=5.65 
                110414) sed              cpu=15 start=5.65  finish=5.65 
              110415) mpcbench         cpu=1 start=5.65  finish=5.66 
                110416) ls               cpu=11 start=5.65  finish=5.66 
                110417) sed              cpu=15 start=5.65  finish=5.66 
              110418) mpcbench         cpu=5 start=5.66  finish=5.66 
              110419) mpcbench         cpu=11 start=5.66  finish=5.66 
                110420) mpcbench         cpu=15 start=5.66  finish=5.66 
                110421) sed              cpu=6 start=5.66  finish=5.66