{"id":739,"date":"2024-01-20T12:54:18","date_gmt":"2024-01-20T12:54:18","guid":{"rendered":"https:\/\/mvermeulen.org\/perf\/?page_id=739"},"modified":"2024-01-20T19:10:20","modified_gmt":"2024-01-20T19:10:20","slug":"svt-vp9","status":"publish","type":"page","link":"https:\/\/mvermeulen.org\/perf\/workloads\/phoronix\/svt-vp9\/","title":{"rendered":"svt-vp9"},"content":{"rendered":"\n<p>A video encoder for the vp9 format. Runs with six test cases. Launches threads p to the 30 but also variable.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1280\" height=\"960\" src=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/systemtime-45.png\" alt=\"\" class=\"wp-image-767\" srcset=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/systemtime-45.png 1280w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/systemtime-45-1024x768.png 1024w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/systemtime-45-768x576.png 768w\" sizes=\"auto, (max-width: 1280px) 100vw, 1280px\" \/><\/figure>\n\n\n\n<p>Topdown profile shows medium level retirement rate limited by some backend stalls and with frontend stalls less of a factor. Also some variation between test cases.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1280\" height=\"960\" src=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/amdtopdown-83.png\" alt=\"\" class=\"wp-image-769\" srcset=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/amdtopdown-83.png 1280w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/amdtopdown-83-1024x768.png 1024w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/01\/amdtopdown-83-768x576.png 768w\" sizes=\"auto, (max-width: 1280px) 100vw, 1280px\" \/><\/figure>\n\n\n\n<p>AMD metrics shows some floating point and moderate number of branches.<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>elapsed              266.392\non_cpu               0.418          # 6.69 \/ 16 cores\nutime                1708.096\nstime                73.706\nnvcsw                2218418        # 79.83%\nnivcsw               560492         # 20.17%\ninblock              600            # 2.25\/sec\nonblock              14184          # 53.24\/sec\ncpu-clock            1782258318300  # 1782.258 seconds\ntask-clock           1782953304677  # 1782.953 seconds\npage faults          14950365       # 8385.169\/sec\ncontext switches     2780028        # 1559.226\/sec\ncpu migrations       387964         # 217.596\/sec\nmajor page faults    23             # 0.013\/sec\nminor page faults    14950342       # 8385.156\/sec\nalignment faults     0              # 0.000\/sec\nemulation faults     0              # 0.000\/sec\nbranches             656720657493   # 76.023 branches per 1000 inst\nbranch misses        10997348830    # 1.67% branch miss\nconditional          502666679558   # 58.190 conditional branches per 1000 inst\nindirect             21010290401    # 2.432 indirect branches per 1000 inst\ncpu-cycles           7046430654174  # 1.70 GHz\ninstructions         8624634778235  # 1.22 IPC\nslots                14123897786988 #\nretiring             3071043407476  # 21.7% (26.7%)\n-- ucode             85466210742    #     0.6%\n-- fastpath          2985577196734  #    21.1%\nfrontend             2189782239321  # 15.5% (19.1%)\n-- latency           1147789014888  #     8.1%\n-- bandwidth         1041993224433  #     7.4%\nbackend              6045479931307  # 42.8% (52.6%)\n-- cpu               2288298994869  #    16.2%\n-- memory            3757180936438  #    26.6%\nspeculation          176219975819   #  1.2% ( 1.5%)\n-- branch mispredict 170597713920   #     1.2%\n-- pipeline restart  5622261899     #     0.0%\nsmt-contention       2641149276223  # 18.7% ( 0.0%)\ncpu-cycles           7028720585099  # 1.70 GHz\ninstructions         8631733824331  # 1.23 IPC\ninstructions         2880802386642  # 61.126 l2 access per 1000 inst\nl2 hit from l1       139518253857   # 12.79% l2 miss\nl2 miss from l1      11080156914    #\nl2 hit from l2 pf    25138535837    #\nl3 hit from l2 pf    4529358713     #\nl3 miss from l2 pf   6905820477     #\ninstructions         2876027823141  # 127.789 float per 1000 inst\nfloat 512            77             # 0.000 AVX-512 per 1000 inst\nfloat 256            574            # 0.000 AVX-256 per 1000 inst\nfloat 128            367523319628   # 127.789 AVX-128 per 1000 inst\nfloat MMX            0              # 0.000 MMX per 1000 inst\nfloat scalar         2              # 0.000 scalar per 1000 inst\n<\/code><\/pre>\n\n\n\n<p>Intel metrics<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>elapsed              955.109\non_cpu               0.483          # 7.72 \/ 16 cores\nutime                7090.140\nstime                286.201\nnvcsw                10235560       # 77.18%\nnivcsw               3027090        # 22.82%\ninblock              81444768       # 85272.71\/sec\nonblock              5920           # 6.20\/sec\ncpu-clock            7374837680273  # 7374.838 seconds\ntask-clock           7377544161006  # 7377.544 seconds\npage faults          56516061       # 7660.552\/sec\ncontext switches     13267164       # 1798.317\/sec\ncpu migrations       2698093        # 365.717\/sec\nmajor page faults    76             # 0.010\/sec\nminor page faults    56515985       # 7660.542\/sec\nalignment faults     0              # 0.000\/sec\nemulation faults     0              # 0.000\/sec\nbranches             2374199693841  # 76.574 branches per 1000 inst\nbranch misses        30873470201    # 1.30% branch miss\nconditional          2374199980081  # 76.574 conditional branches per 1000 inst\nindirect             714692719974   # 23.051 indirect branches per 1000 inst\nslots                48054039135266 #\nretiring             23387060560527 # 48.7% (48.7%)\n-- ucode             3022764780478  #     6.3%\n-- fastpath          20364295780049 #    42.4%\nfrontend             8537158800292  # 17.8% (17.8%)\n-- latency           5088733620331  #    10.6%\n-- bandwidth         3448425179961  #     7.2%\nbackend              13737332127616 # 28.6% (28.6%)\n-- cpu               3478462484718  #     7.2%\n-- memory            10258869642898 #    21.3%\nspeculation          2988699083208  #  6.2% ( 6.2%)\n-- branch mispredict 2758680527908  #     5.7%\n-- pipeline restart  230018555300   #     0.5%\nsmt-contention       0              #  0.0% ( 0.0%)\ncpu-cycles           23749625878854 # 1.55 GHz\ninstructions         35707894166039 # 1.50 IPC\nl2 access            866445792868   # 51.238 l2 access per 1000 inst\nl2 miss              223753253127   # 25.82% l2 miss\n<\/code><\/pre>\n\n\n\n<p>Process overview concentrates on the SvtVp9EncApp<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>2620 processes\n\t2250 SvtVp9EncApp         107501.07  4524.13\n\t 68 clinfo                  16.27     6.65\n\t 38 vulkaninfo               1.13     0.96\n\t  6 glxinfo:gdrv0            0.19     0.09\n\t  4 vulkani:disk$0           0.12     0.10\n\t  6 php                      0.09     0.16\n\t  2 glxinfo                  0.09     0.03\n\t  2 glxinfo:cs0              0.09     0.03\n\t  2 glxinfo:disk$0           0.09     0.03\n\t  2 glxinfo:sh0              0.09     0.03\n\t  2 glxinfo:shlo0            0.09     0.03\n\t  2 llvmpipe-0               0.06     0.05\n\t  2 llvmpipe-1               0.06     0.05\n\t  2 llvmpipe-10              0.06     0.05\n\t  2 llvmpipe-11              0.06     0.05\n\t  2 llvmpipe-12              0.06     0.05\n\t  2 llvmpipe-13              0.06     0.05\n\t  2 llvmpipe-14              0.06     0.05\n\t  2 llvmpipe-15              0.06     0.05\n\t  2 llvmpipe-2               0.06     0.05\n\t  2 llvmpipe-3               0.06     0.05\n\t  2 llvmpipe-4               0.06     0.05\n\t  2 llvmpipe-5               0.06     0.05\n\t  2 llvmpipe-6               0.06     0.05\n\t  2 llvmpipe-7               0.06     0.05\n\t  2 llvmpipe-8               0.06     0.05\n\t  2 llvmpipe-9               0.06     0.05\n\t  6 clang                    0.05     0.07\n\t  3 rocminfo                 0.00     0.03\n\t  1 lspci                    0.00     0.02\n\t  1 ps                       0.00     0.01\n\t 92 sh                       0.00     0.00\n\t 18 svt-vp9                  0.00     0.00\n\t 13 gcc                      0.00     0.00\n\t 11 gsettings                0.00     0.00\n\t  8 stat                     0.00     0.00\n\t  8 systemd-detect-          0.00     0.00\n\t  6 llvm-link                0.00     0.00\n\t  5 phoronix-test-s          0.00     0.00\n\t  3 gmain                    0.00     0.00\n\t  2 cc                       0.00     0.00\n\t  2 dconf worker             0.00     0.00\n\t  2 lscpu                    0.00     0.00\n\t  2 uname                    0.00     0.00\n\t  2 which                    0.00     0.00\n\t  2 xset                     0.00     0.00\n\t  1 date                     0.00     0.00\n\t  1 dirname                  0.00     0.00\n\t  1 dmesg                    0.00     0.00\n\t  1 dmidecode                0.00     0.00\n\t  1 grep                     0.00     0.00\n\t  1 ifconfig                 0.00     0.00\n\t  1 ip                       0.00     0.00\n\t  1 lsmod                    0.00     0.00\n\t  1 mktemp                   0.00     0.00\n\t  1 qdbus                    0.00     0.00\n\t  1 readlink                 0.00     0.00\n\t  1 realpath                 0.00     0.00\n\t  1 sed                      0.00     0.00\n\t  1 sort                     0.00     0.00\n\t  1 stty                     0.00     0.00\n\t  1 systemctl                0.00     0.00\n\t  1 template.sh              0.00     0.00\n\t  1 wc                       0.00     0.00\n\t  1 xrandr                   0.00     0.00\n0 processes running\n73 maximum processes\n<\/code><\/pre>\n\n\n\n<p>Most of these get launched from the thread app<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>      2895193) svt-vp9          cpu=10 start=5.69  finish=17.95\n        2895194) SvtVp9EncApp     cpu=11 start=5.69  finish=17.87\n          2895195) SvtVp9EncApp     cpu=5 start=6.78  finish=6.78 \n          2895196) SvtVp9EncApp     cpu=14 start=6.78  finish=17.57\n          2895197) SvtVp9EncApp     cpu=0 start=6.78  finish=6.78 \n          2895198) SvtVp9EncApp     cpu=9 start=6.78  finish=17.57\n          2895199) SvtVp9EncApp     cpu=2 start=6.78  finish=6.78 \n          2895200) SvtVp9EncApp     cpu=4 start=6.78  finish=17.57\n          2895201) SvtVp9EncApp     cpu=14 start=6.78  finish=6.78 \n          2895202) SvtVp9EncApp     cpu=6 start=6.78  finish=17.57\n          2895203) SvtVp9EncApp     cpu=13 start=6.78  finish=6.78 \n          2895204) SvtVp9EncApp     cpu=4 start=6.78  finish=17.57\n          2895205) SvtVp9EncApp     cpu=15 start=6.78  finish=6.78 \n          2895206) SvtVp9EncApp     cpu=0 start=6.78  finish=17.57\n          2895207) SvtVp9EncApp     cpu=9 start=6.78  finish=6.78 \n          2895208) SvtVp9EncApp     cpu=2 start=6.78  finish=17.57\n          2895209) SvtVp9EncApp     cpu=11 start=6.78  finish=6.78 \n          2895210) SvtVp9EncApp     cpu=4 start=6.78  finish=17.57\n          2895211) SvtVp9EncApp     cpu=14 start=6.78  finish=6.78 \n          2895212) SvtVp9EncApp     cpu=2 start=6.78  finish=17.57\n          2895213) SvtVp9EncApp     cpu=14 start=6.78  finish=6.78 \n          2895214) SvtVp9EncApp     cpu=0 start=6.78  finish=17.57\n          2895215) SvtVp9EncApp     cpu=14 start=6.78  finish=6.78 \n          2895216) SvtVp9EncApp     cpu=9 start=6.78  finish=17.57\n          2895217) SvtVp9EncApp     cpu=14 start=6.78  finish=6.78 \n          2895218) SvtVp9EncApp     cpu=3 start=6.78  finish=17.57\n          2895219) SvtVp9EncApp     cpu=14 start=6.78  finish=6.78 \n          2895220) SvtVp9EncApp     cpu=14 start=6.78  finish=17.57\n          2895221) SvtVp9EncApp     cpu=14 start=6.78  finish=6.78 \n          2895222) SvtVp9EncApp     cpu=6 start=6.78  finish=17.56\n          2895223) SvtVp9EncApp     cpu=14 start=6.78  finish=6.78 \n          2895224) SvtVp9EncApp     cpu=13 start=6.78  finish=17.56\n          2895225) SvtVp9EncApp     cpu=5 start=6.78  finish=6.78 \n          2895226) SvtVp9EncApp     cpu=6 start=6.78  finish=17.56\n          2895227) SvtVp9EncApp     cpu=5 start=6.78  finish=6.78 \n          2895228) SvtVp9EncApp     cpu=4 start=6.78  finish=17.56\n          2895229) SvtVp9EncApp     cpu=5 start=6.78  finish=6.78 \n          2895230) SvtVp9EncApp     cpu=13 start=6.78  finish=17.56\n          2895231) SvtVp9EncApp     cpu=14 start=6.78  finish=6.78 \n          2895232) SvtVp9EncApp     cpu=8 start=6.78  finish=17.56\n          2895233) SvtVp9EncApp     cpu=5 start=6.78  finish=6.78 \n          2895234) SvtVp9EncApp     cpu=7 start=6.78  finish=17.56\n          2895235) SvtVp9EncApp     cpu=14 start=6.78  finish=6.78 \n          2895236) SvtVp9EncApp     cpu=13 start=6.78  finish=17.56\n          2895237) SvtVp9EncApp     cpu=5 start=6.78  finish=6.78 \n          2895238) SvtVp9EncApp     cpu=12 start=6.78  finish=17.56\n          2895239) SvtVp9EncApp     cpu=2 start=6.78  finish=6.78 \n          2895240) SvtVp9EncApp     cpu=4 start=6.78  finish=17.56\n          2895241) SvtVp9EncApp     cpu=11 start=6.78  finish=6.78 \n          2895242) SvtVp9EncApp     cpu=3 start=6.78  finish=17.56\n          2895243) SvtVp9EncApp     cpu=14 start=6.78  finish=6.78 \n          2895244) SvtVp9EncApp     cpu=0 start=6.78  finish=17.56\n          2895245) SvtVp9EncApp     cpu=5 start=6.78  finish=6.78 \n          2895246) SvtVp9EncApp     cpu=15 start=6.78  finish=17.56\n          2895247) SvtVp9EncApp     cpu=14 start=6.78  finish=6.78 \n          2895248) SvtVp9EncApp     cpu=2 start=6.78  finish=17.56\n          2895249) SvtVp9EncApp     cpu=5 start=6.78  finish=6.78 \n          2895250) SvtVp9EncApp     cpu=9 start=6.78  finish=17.56\n          2895251) SvtVp9EncApp     cpu=15 start=6.78  finish=6.78 \n          2895252) SvtVp9EncApp     cpu=4 start=6.78  finish=17.56\n          2895253) SvtVp9EncApp     cpu=2 start=6.78  finish=6.78 \n          2895254) SvtVp9EncApp     cpu=10 start=6.78  finish=17.56\n          2895255) SvtVp9EncApp     cpu=2 start=6.78  finish=6.78 \n          2895256) SvtVp9EncApp     cpu=0 start=6.78  finish=17.56\n          2895257) SvtVp9EncApp     cpu=15 start=6.78  finish=6.78 \n          2895258) SvtVp9EncApp     cpu=14 start=6.78  finish=17.56\n          2895259) SvtVp9EncApp     cpu=11 start=6.78  finish=6.78 \n          2895260) SvtVp9EncApp     cpu=6 start=6.78  finish=17.56\n          2895261) SvtVp9EncApp     cpu=14 start=6.78  finish=6.78 \n          2895262) SvtVp9EncApp     cpu=9 start=6.78  finish=17.56\n          2895263) SvtVp9EncApp     cpu=5 start=6.78  finish=6.78 \n          2895264) SvtVp9EncApp     cpu=14 start=6.78  finish=17.56\n          2895265) SvtVp9EncApp     cpu=11 start=6.78  finish=6.78 \n          2895266) SvtVp9EncApp     cpu=12 start=6.78  finish=17.56\n          2895267) SvtVp9EncApp     cpu=14 start=6.78  finish=6.78 \n          2895268) SvtVp9EncApp     cpu=3 start=6.78  finish=17.56\n          2895269) SvtVp9EncApp     cpu=5 start=6.78  finish=6.78 \n          2895270) SvtVp9EncApp     cpu=2 start=6.78  finish=17.56\n          2895271) SvtVp9EncApp     cpu=0 start=6.78  finish=6.78 \n          2895272) SvtVp9EncApp     cpu=10 start=6.78  finish=17.56\n          2895273) SvtVp9EncApp     cpu=5 start=6.78  finish=6.78 \n          2895274) SvtVp9EncApp     cpu=4 start=6.78  finish=17.56\n          2895275) SvtVp9EncApp     cpu=5 start=6.78  finish=6.78 \n          2895276) SvtVp9EncApp     cpu=14 start=6.78  finish=17.56\n          2895277) SvtVp9EncApp     cpu=5 start=6.78  finish=6.78 \n          2895278) SvtVp9EncApp     cpu=10 start=6.78  finish=17.56\n          2895279) SvtVp9EncApp     cpu=0 start=6.78  finish=6.78 \n          2895280) SvtVp9EncApp     cpu=1 start=6.78  finish=17.56\n          2895281) SvtVp9EncApp     cpu=14 start=6.79  finish=6.79 \n          2895282) SvtVp9EncApp     cpu=6 start=6.79  finish=17.56\n          2895283) SvtVp9EncApp     cpu=11 start=6.79  finish=6.79 \n          2895284) SvtVp9EncApp     cpu=14 start=6.79  finish=17.56\n          2895285) SvtVp9EncApp     cpu=0 start=6.79  finish=6.79 \n          2895286) SvtVp9EncApp     cpu=4 start=6.79  finish=17.56\n          2895287) SvtVp9EncApp     cpu=2 start=6.79  finish=6.79 \n          2895288) SvtVp9EncApp     cpu=2 start=6.79  finish=17.56\n          2895289) SvtVp9EncApp     cpu=5 start=6.79  finish=6.79 \n          2895290) SvtVp9EncApp     cpu=13 start=6.79  finish=17.56\n          2895291) SvtVp9EncApp     cpu=5 start=6.79  finish=6.79 \n          2895292) SvtVp9EncApp     cpu=7 start=6.79  finish=17.56\n          2895293) SvtVp9EncApp     cpu=15 start=6.79  finish=6.79 \n          2895294) SvtVp9EncApp     cpu=9 start=6.79  finish=17.56\n          2895295) SvtVp9EncApp     cpu=14 start=6.79  finish=6.79 \n          2895296) SvtVp9EncApp     cpu=15 start=6.79  finish=17.56\n          2895297) SvtVp9EncApp     cpu=14 start=6.79  finish=6.79 \n          2895298) SvtVp9EncApp     cpu=10 start=6.79  finish=17.56\n          2895299) SvtVp9EncApp     cpu=11 start=6.79  finish=6.79 \n          2895300) SvtVp9EncApp     cpu=12 start=6.79  finish=17.56\n          2895301) SvtVp9EncApp     cpu=11 start=6.79  finish=6.79 \n          2895302) SvtVp9EncApp     cpu=0 start=6.79  finish=17.56\n          2895303) SvtVp9EncApp     cpu=5 start=6.79  finish=6.79 \n          2895304) SvtVp9EncApp     cpu=6 start=6.79  finish=17.56\n          2895305) SvtVp9EncApp     cpu=5 start=6.79  finish=6.79 \n          2895306) SvtVp9EncApp     cpu=3 start=6.79  finish=17.56\n          2895307) SvtVp9EncApp     cpu=15 start=6.79  finish=6.79 \n          2895308) SvtVp9EncApp     cpu=8 start=6.79  finish=17.56\n          2895309) SvtVp9EncApp     cpu=15 start=6.79  finish=6.79 \n          2895310) SvtVp9EncApp     cpu=12 start=6.79  finish=17.56\n          2895311) SvtVp9EncApp     cpu=14 start=6.79  finish=6.79 \n          2895312) SvtVp9EncApp     cpu=14 start=6.79  finish=17.56\n          2895313) SvtVp9EncApp     cpu=14 start=6.79  finish=6.79 \n          2895314) SvtVp9EncApp     cpu=3 start=6.79  finish=17.56\n          2895315) SvtVp9EncApp     cpu=5 start=6.79  finish=6.79 \n          2895316) SvtVp9EncApp     cpu=2 start=6.79  finish=17.56\n          2895317) SvtVp9EncApp     cpu=5 start=6.79  finish=6.79 \n          2895318) SvtVp9EncApp     cpu=14 start=6.79  finish=17.56\n<\/code><\/pre>\n","protected":false},"excerpt":{"rendered":"<p>A video encoder for the vp9 format. Runs with six test cases. Launches threads p to the 30 but also variable. Topdown profile shows medium level retirement rate limited by some backend stalls and with frontend stalls less of a <span class=\"excerpt-dots\">&hellip;<\/span> <a class=\"more-link\" href=\"https:\/\/mvermeulen.org\/perf\/workloads\/phoronix\/svt-vp9\/\"><span class=\"more-msg\">Continue reading &rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":58,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-739","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/739","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/comments?post=739"}],"version-history":[{"count":2,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/739\/revisions"}],"predecessor-version":[{"id":770,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/739\/revisions\/770"}],"up":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/58"}],"wp:attachment":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/media?parent=739"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}