{"id":866,"date":"2024-01-24T12:38:10","date_gmt":"2024-01-24T12:38:10","guid":{"rendered":"https:\/\/mvermeulen.org\/perf\/?page_id=866"},"modified":"2024-01-25T02:46:29","modified_gmt":"2024-01-25T02:46:29","slug":"indigobench","status":"publish","type":"page","link":"https:\/\/mvermeulen.org\/perf\/workloads\/phoronix\/indigobench\/","title":{"rendered":"indigobench"},"content":{"rendered":"\n<p>Indigo Renderer benchmark. There are two rendering workloads. These can be done on both GPU and CPU but below are the CPU tests only. Looks like a regular benchmar as far as keeping all cores busy and running consistently on all cores.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1280\" height=\"960\" src=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/systemtime-60.png\" alt=\"\" class=\"wp-image-884\" srcset=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/systemtime-60.png 1280w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/systemtime-60-1024x768.png 1024w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/systemtime-60-768x576.png 768w\" sizes=\"auto, (max-width: 1280px) 100vw, 1280px\" \/><\/figure>\n\n\n\n<p>Topdown profile shows a consistent backend-bound nature with ~1\/3 backend CPU and the other 2\/3 are memory bound.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1280\" height=\"960\" src=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/amdtopdown-98.png\" alt=\"\" class=\"wp-image-886\" srcset=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/amdtopdown-98.png 1280w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/amdtopdown-98-1024x768.png 1024w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/amdtopdown-98-768x576.png 768w\" sizes=\"auto, (max-width: 1280px) 100vw, 1280px\" \/><\/figure>\n\n\n\n<p>AMD metrics include a large amount of floating point operations and not as many branches.<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>elapsed              404.353\non_cpu               0.894          # 14.31 \/ 16 cores\nutime                5764.252\nstime                22.314\nnvcsw                666006         # 89.12%\nnivcsw               81316          # 10.88%\ninblock              81032          # 200.40\/sec\nonblock              12792          # 31.64\/sec\ncpu-clock            5786852053047  # 5786.852 seconds\ntask-clock           5787091569274  # 5787.092 seconds\npage faults          3664949        # 633.297\/sec\ncontext switches     749144         # 129.451\/sec\ncpu migrations       9179           # 1.586\/sec\nmajor page faults    466            # 0.081\/sec\nminor page faults    3664483        # 633.217\/sec\nalignment faults     0              # 0.000\/sec\nemulation faults     0              # 0.000\/sec\nbranches             1927522446306  # 84.099 branches per 1000 inst\nbranch misses        76177485507    # 3.95% branch miss\nconditional          1385827915516  # 60.465 conditional branches per 1000 inst\nindirect             91982710671    # 4.013 indirect branches per 1000 inst\ncpu-cycles           22951749219719 # 3.54 GHz\ninstructions         22849653371136 # 1.00 IPC\nslots                45910223556726 #\nretiring             8176828084673  # 17.8% (24.7%)\n-- ucode             52899791055    #     0.1%\n-- fastpath          8123928293618  #    17.7%\nfrontend             4448403748523  #  9.7% (13.4%)\n-- latency           2857039901472  #     6.2%\n-- bandwidth         1591363847051  #     3.5%\nbackend              18865029799474 # 41.1% (57.0%)\n-- cpu               6527164587666  #    14.2%\n-- memory            12337865211808 #    26.9%\nspeculation          1590415849440  #  3.5% ( 4.8%)\n-- branch mispredict 1510073060500  #     3.3%\n-- pipeline restart  80342788940    #     0.2%\nsmt-contention       12829406620801 # 27.9% ( 0.0%)\ncpu-cycles           22929148832658 # 3.55 GHz\ninstructions         22850737354587 # 1.00 IPC\ninstructions         7618191301881  # 58.648 l2 access per 1000 inst\nl2 hit from l1       418471421240   # 8.57% l2 miss\nl2 miss from l1      22857102562    #\nl2 hit from l2 pf    12904572285    #\nl3 hit from l2 pf    10082387582    #\nl3 miss from l2 pf   5334695570     #\ninstructions         7614549991552  # 555.900 float per 1000 inst\nfloat 512            53             # 0.000 AVX-512 per 1000 inst\nfloat 256            656            # 0.000 AVX-256 per 1000 inst\nfloat 128            4232926639144  # 555.900 AVX-128 per 1000 inst\nfloat MMX            0              # 0.000 MMX per 1000 inst\nfloat scalar         0              # 0.000 scalar per 1000 inst\n<\/code><\/pre>\n\n\n\n<p>Intel metrics<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>elapsed              402.006\non_cpu               0.885          # 14.16 \/ 16 cores\nutime                5657.048\nstime                34.367\nnvcsw                1692013        # 95.68%\nnivcsw               76460          # 4.32%\ninblock              344            # 0.86\/sec\nonblock              1536           # 3.82\/sec\ncpu-clock            5690616617544  # 5690.617 seconds\ntask-clock           5690921144997  # 5690.921 seconds\npage faults          3643107        # 640.161\/sec\ncontext switches     1770284        # 311.072\/sec\ncpu migrations       34308          # 6.029\/sec\nmajor page faults    9              # 0.002\/sec\nminor page faults    3643098        # 640.160\/sec\nalignment faults     0              # 0.000\/sec\nemulation faults     0              # 0.000\/sec\nbranches             1496514444286  # 84.153 branches per 1000 inst\nbranch misses        58536232706    # 3.91% branch miss\nconditional          1496514476734  # 84.153 conditional branches per 1000 inst\nindirect             425038614791   # 23.901 indirect branches per 1000 inst\nslots                27842666415128 #\nretiring             10034344071443 # 36.0% (36.0%)\n-- ucode             620691813018   #     2.2%\n-- fastpath          9413652258425  #    33.8%\nfrontend             6611392839780  # 23.7% (23.7%)\n-- latency           4549594771092  #    16.3%\n-- bandwidth         2061798068688  #     7.4%\nbackend              7080573550157  # 25.4% (25.4%)\n-- cpu               3038130250122  #    10.9%\n-- memory            4042443300035  #    14.5%\nspeculation          4083962313292  # 14.7% (14.7%)\n-- branch mispredict 3743951192849  #    13.4%\n-- pipeline restart  340011120443   #     1.2%\nsmt-contention       0              #  0.0% ( 0.0%)\ncpu-cycles           16283526204732 # 2.53 GHz\ninstructions         17908395877986 # 1.10 IPC\nl2 access            404574679649   # 41.095 l2 access per 1000 inst\nl2 miss              73652577745    # 18.20% l2 miss\n<\/code><\/pre>\n\n\n\n<p>Large portion is in the benchmark driver itself.<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>946 processes\n\t558 indigo_benchmar      438000.46  1775.86\n\t  6 QXcbEventQueue        5763.06    22.45\n\t  6 indigo_:disk$0        5763.05    22.45\n\t  6 indigo_b:shlo0        5763.05    22.45\n\t  6 indigo_ben:cs0        5763.05    22.45\n\t  6 indigo_ben:sh0        5763.05    22.45\n\t 68 clinfo                  19.18     6.32\n\t 38 vulkaninfo               1.11     1.14\n\t  6 Qt bearer threa          0.21     0.00\n\t  6 glxinfo:gdrv0            0.15     0.12\n\t  4 vulkani:disk$0           0.11     0.12\n\t  6 php                      0.07     0.14\n\t  2 glxinfo                  0.07     0.04\n\t  2 glxinfo:cs0              0.07     0.04\n\t  2 glxinfo:disk$0           0.07     0.04\n\t  2 glxinfo:sh0              0.07     0.04\n\t  2 glxinfo:shlo0            0.07     0.04\n\t  2 llvmpipe-0               0.06     0.06\n\t  2 llvmpipe-1               0.06     0.06\n\t  2 llvmpipe-10              0.06     0.06\n\t  2 llvmpipe-11              0.06     0.06\n\t  2 llvmpipe-12              0.06     0.06\n\t  2 llvmpipe-13              0.06     0.06\n\t  2 llvmpipe-14              0.06     0.06\n\t  2 llvmpipe-15              0.06     0.06\n\t  2 llvmpipe-2               0.06     0.06\n\t  2 llvmpipe-3               0.06     0.06\n\t  2 llvmpipe-4               0.06     0.06\n\t  2 llvmpipe-5               0.06     0.06\n\t  2 llvmpipe-6               0.06     0.06\n\t  2 llvmpipe-7               0.06     0.06\n\t  2 llvmpipe-8               0.06     0.06\n\t  2 llvmpipe-9               0.06     0.06\n\t  6 clang                    0.05     0.07\n\t  3 rocminfo                 0.03     0.00\n\t  1 lspci                    0.00     0.03\n\t  1 ps                       0.00     0.01\n\t 85 sh                       0.00     0.00\n\t 13 gcc                      0.00     0.00\n\t 10 gsettings                0.00     0.00\n\t  9 systemd-detect-          0.00     0.00\n\t  8 stat                     0.00     0.00\n\t  6 indigobench              0.00     0.00\n\t  6 llvm-link                0.00     0.00\n\t  5 phoronix-test-s          0.00     0.00\n\t  4 gmain                    0.00     0.00\n\t  2 cc                       0.00     0.00\n\t  2 dconf worker             0.00     0.00\n\t  2 lscpu                    0.00     0.00\n\t  2 uname                    0.00     0.00\n\t  2 which                    0.00     0.00\n\t  2 xset                     0.00     0.00\n\t  1 date                     0.00     0.00\n\t  1 dirname                  0.00     0.00\n\t  1 dmesg                    0.00     0.00\n\t  1 dmidecode                0.00     0.00\n\t  1 grep                     0.00     0.00\n\t  1 ifconfig                 0.00     0.00\n\t  1 ip                       0.00     0.00\n\t  1 lsmod                    0.00     0.00\n\t  1 mktemp                   0.00     0.00\n\t  1 qdbus                    0.00     0.00\n\t  1 readlink                 0.00     0.00\n\t  1 realpath                 0.00     0.00\n\t  1 sed                      0.00     0.00\n\t  1 sort                     0.00     0.00\n\t  1 stty                     0.00     0.00\n\t  1 systemctl                0.00     0.00\n\t  1 template.sh              0.00     0.00\n\t  1 wc                       0.00     0.00\n\t  1 xrandr                   0.00     0.00\n0 processes running\n100 maximum processes<\/code><\/pre>\n\n\n\n<p>The process structure starts many indigobenchmark processes.<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>      67197) indigobench      cpu=10 start=5.58  finish=67.34\n        67198) indigo_benchmar  cpu=3 start=5.58  finish=67.25\n          67199) QXcbEventQueue   cpu=10 start=5.59  finish=67.25\n          67200) indigo_benchmar  cpu=0 start=5.61  finish=67.25\n          67201) indigo_benchmar  cpu=1 start=5.61  finish=67.25\n          67202) indigo_benchmar  cpu=2 start=5.61  finish=67.25\n          67203) indigo_benchmar  cpu=3 start=5.61  finish=67.25\n          67204) indigo_benchmar  cpu=4 start=5.61  finish=67.25\n          67205) indigo_benchmar  cpu=5 start=5.61  finish=67.25\n          67206) indigo_benchmar  cpu=6 start=5.61  finish=67.25\n          67207) indigo_benchmar  cpu=7 start=5.61  finish=67.25\n          67208) indigo_benchmar  cpu=8 start=5.61  finish=67.25\n          67209) indigo_benchmar  cpu=9 start=5.61  finish=67.25\n          67210) indigo_benchmar  cpu=10 start=5.61  finish=67.25\n          67211) indigo_benchmar  cpu=11 start=5.61  finish=67.25\n          67212) indigo_benchmar  cpu=12 start=5.61  finish=67.25\n          67213) indigo_benchmar  cpu=13 start=5.61  finish=67.25\n          67214) indigo_benchmar  cpu=14 start=5.61  finish=67.25\n          67215) indigo_benchmar  cpu=15 start=5.61  finish=67.25\n          67218) indigo_benchmar  cpu=2 start=5.67  finish=67.25\n          67219) indigo_benchmar  cpu=9 start=5.67  finish=5.67 \n          67220) Qt bearer threa  cpu=0 start=5.70  finish=67.25\n          67221) indigo_benchmar  cpu=10 start=6.17  finish=67.25\n          67222) indigo_benchmar  cpu=2 start=6.17  finish=67.25\n          67223) indigo_benchmar  cpu=10 start=6.17  finish=67.25\n          67224) indigo_benchmar  cpu=10 start=6.17  finish=67.25\n          67225) indigo_benchmar  cpu=11 start=6.17  finish=67.25\n          67226) indigo_benchmar  cpu=10 start=6.18  finish=67.25\n          67227) indigo_benchmar  cpu=10 start=6.18  finish=67.25\n          67228) indigo_benchmar  cpu=2 start=6.18  finish=67.25\n          67229) indigo_benchmar  cpu=8 start=6.18  finish=67.25\n          67230) indigo_benchmar  cpu=2 start=6.18  finish=67.25\n          67231) indigo_benchmar  cpu=10 start=6.18  finish=67.25\n          67232) indigo_benchmar  cpu=2 start=6.18  finish=67.25\n          67233) indigo_benchmar  cpu=6 start=6.18  finish=67.25\n          67234) indigo_benchmar  cpu=2 start=6.18  finish=67.25\n          67235) indigo_benchmar  cpu=4 start=6.18  finish=67.25\n          67236) indigo_benchmar  cpu=9 start=6.18  finish=67.25\n          67237) indigo_benchmar  cpu=11 start=6.18  finish=67.25\n          67238) indigo_benchmar  cpu=10 start=6.18  finish=67.25\n            67239) indigo_benchmar  cpu=10 start=6.18  finish=67.25\n            67240) indigo_benchmar  cpu=10 start=6.18  finish=67.25\n            67241) indigo_benchmar  cpu=2 start=6.18  finish=67.25\n            67242) indigo_benchmar  cpu=10 start=6.18  finish=67.25\n            67243) indigo_benchmar  cpu=2 start=6.18  finish=67.24\n            67244) indigo_benchmar  cpu=10 start=6.18  finish=67.24\n            67245) indigo_benchmar  cpu=11 start=6.18  finish=67.24\n            67246) indigo_benchmar  cpu=11 start=6.18  finish=67.24\n            67247) indigo_benchmar  cpu=10 start=6.18  finish=67.24\n            67248) indigo_benchmar  cpu=10 start=6.18  finish=67.24\n            67249) indigo_benchmar  cpu=10 start=6.18  finish=67.24\n            67250) indigo_benchmar  cpu=2 start=6.18  finish=67.24\n            67251) indigo_benchmar  cpu=10 start=6.18  finish=67.24\n            67252) indigo_benchmar  cpu=2 start=6.18  finish=67.24\n            67253) indigo_benchmar  cpu=7 start=6.18  finish=67.24\n            67254) indigo_benchmar  cpu=11 start=6.18  finish=67.24\n            67255) indigo_benchmar  cpu=4 start=6.18  finish=6.23 \n            67256) indigo_benchmar  cpu=13 start=6.18  finish=6.23 \n            67257) indigo_benchmar  cpu=1 start=6.19  finish=6.23 \n            67258) indigo_benchmar  cpu=0 start=6.19  finish=6.23 \n            67259) indigo_benchmar  cpu=15 start=6.19  finish=6.23 \n            67260) indigo_benchmar  cpu=12 start=6.19  finish=6.23 \n            67261) indigo_benchmar  cpu=3 start=6.19  finish=6.23 \n            67262) indigo_benchmar  cpu=5 start=6.19  finish=6.23 \n            67263) indigo_benchmar  cpu=7 start=6.19  finish=6.23 \n            67264) indigo_benchmar  cpu=11 start=6.19  finish=6.23 \n            67265) indigo_benchmar  cpu=2 start=6.19  finish=6.23 \n            67266) indigo_benchmar  cpu=15 start=6.19  finish=6.23 \n            67267) indigo_benchmar  cpu=10 start=6.19  finish=6.23 \n            67268) indigo_benchmar  cpu=6 start=6.19  finish=6.23 \n            67269) indigo_benchmar  cpu=9 start=6.19  finish=6.23 \n            67270) indigo_benchmar  cpu=14 start=6.19  finish=6.23 \n            67275) indigo_benchmar  cpu=14 start=7.14  finish=67.24\n            67276) indigo_benchmar  cpu=10 start=7.14  finish=67.24\n            67277) indigo_benchmar  cpu=11 start=7.14  finish=67.24\n            67278) indigo_benchmar  cpu=12 start=7.14  finish=67.24\n            67279) indigo_benchmar  cpu=1 start=7.14  finish=67.24\n            67280) indigo_benchmar  cpu=1 start=7.14  finish=67.24\n            67281) indigo_benchmar  cpu=0 start=7.14  finish=67.24\n            67282) indigo_benchmar  cpu=10 start=7.14  finish=67.24\n            67283) indigo_benchmar  cpu=5 start=7.14  finish=67.24\n            67284) indigo_benchmar  cpu=11 start=7.14  finish=67.24\n            67285) indigo_benchmar  cpu=2 start=7.14  finish=67.24\n            67286) indigo_benchmar  cpu=11 start=7.14  finish=67.24\n            67287) indigo_benchmar  cpu=10 start=7.14  finish=67.24\n            67288) indigo_benchmar  cpu=6 start=7.14  finish=67.24\n            67289) indigo_benchmar  cpu=13 start=7.14  finish=67.24\n            67290) indigo_benchmar  cpu=0 start=7.14  finish=67.24\n          67271) indigo_ben:cs0   cpu=2 start=6.23  finish=67.24\n          67272) indigo_:disk$0   cpu=12 start=6.23  finish=67.24\n          67273) indigo_ben:sh0   cpu=2 start=6.23  finish=67.24\n          67274) indigo_b:shlo0   cpu=2 start=6.23  finish=67.24\n<\/code><\/pre>\n","protected":false},"excerpt":{"rendered":"<p>Indigo Renderer benchmark. There are two rendering workloads. These can be done on both GPU and CPU but below are the CPU tests only. Looks like a regular benchmar as far as keeping all cores busy and running consistently on <span class=\"excerpt-dots\">&hellip;<\/span> <a class=\"more-link\" href=\"https:\/\/mvermeulen.org\/perf\/workloads\/phoronix\/indigobench\/\"><span class=\"more-msg\">Continue reading &rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":58,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-866","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/866","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/comments?post=866"}],"version-history":[{"count":2,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/866\/revisions"}],"predecessor-version":[{"id":887,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/866\/revisions\/887"}],"up":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/58"}],"wp:attachment":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/media?parent=866"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}