{"id":1851,"date":"2024-02-29T18:59:15","date_gmt":"2024-02-29T18:59:15","guid":{"rendered":"https:\/\/mvermeulen.org\/perf\/?page_id=1851"},"modified":"2024-03-01T02:20:04","modified_gmt":"2024-03-01T02:20:04","slug":"rsvg","status":"publish","type":"page","link":"https:\/\/mvermeulen.org\/perf\/workloads\/phoronix\/rsvg\/","title":{"rendered":"rsvg"},"content":{"rendered":"\n<p>Testing the librsvg vector graphics library. There is one test that runs in less than 30 seconds. It looks mostly single-threaded.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1280\" height=\"960\" src=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/03\/systemtime-4.png\" alt=\"\" class=\"wp-image-1876\" srcset=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/03\/systemtime-4.png 1280w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/03\/systemtime-4-1024x768.png 1024w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/03\/systemtime-4-768x576.png 768w\" sizes=\"auto, (max-width: 1280px) 100vw, 1280px\" \/><\/figure>\n\n\n\n<p>Topdown profile shows retirement stalls as the highest portion.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1280\" height=\"960\" src=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/03\/amdtopdown-4.png\" alt=\"\" class=\"wp-image-1878\" srcset=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/03\/amdtopdown-4.png 1280w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/03\/amdtopdown-4-1024x768.png 1024w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/03\/amdtopdown-4-768x576.png 768w\" sizes=\"auto, (max-width: 1280px) 100vw, 1280px\" \/><\/figure>\n\n\n\n<p>AMD metrics show oncore of only 0.7 and low backend stalls. There is not much floating point.<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>elapsed              91.690\non_cpu               0.043          # 0.69 \/ 16 cores\nutime                45.313\nstime                18.342\nnvcsw                76062          # 95.55%\nnivcsw               3545           # 4.45%\ninblock              8              # 0.09\/sec\nonblock              103192         # 1125.45\/sec\ncpu-clock            90815954701    # 90.816 seconds\ntask-clock           90912432328    # 90.912 seconds\npage faults          4042969        # 44471.024\/sec\ncontext switches     77945          # 857.363\/sec\ncpu migrations       5046           # 55.504\/sec\nmajor page faults    2              # 0.022\/sec\nminor page faults    4042967        # 44471.002\/sec\nalignment faults     0              # 0.000\/sec\nemulation faults     0              # 0.000\/sec\nbranches             147471797199   # 176.955 branches per 1000 inst\nbranch misses        3089473014     # 2.09% branch miss\nconditional          110382888857   # 132.451 conditional branches per 1000 inst\nindirect             5037841192     # 6.045 indirect branches per 1000 inst\ncpu-cycles           205545500455   # 0.14 GHz\ninstructions         642870821370   # 3.13 IPC high\nslots                619621597422   #\nretiring             273417324155   # 44.1% (44.3%)\n-- ucode             793369832      #     0.1%\n-- fastpath          272623954323   #    44.0%\nfrontend             219722230918   # 35.5% (35.6%)\n-- latency           168134443470   #    27.1%\n-- bandwidth         51587787448    #     8.3%\nbackend              96600027920    # 15.6% (15.7%) low\n-- cpu               23569596994    #     3.8%\n-- memory            73030430926    #    11.8%\nspeculation          27318925102    #  4.4% ( 4.4%)\n-- branch mispredict 27122343208    #     4.4%\n-- pipeline restart  196581894      #     0.0%\nsmt-contention       2561245890     #  0.4% ( 0.0%)\ncpu-cycles           205272619877   # 0.14 GHz\ninstructions         637323484540   # 3.10 IPC high\ninstructions         275283291422   # 14.327 l2 access per 1000 inst\nl2 hit from l1       3380047154     # 21.79% l2 miss\nl2 miss from l1      582451810      #\nl2 hit from l2 pf    287054714      #\nl3 hit from l2 pf    187892718      #\nl3 miss from l2 pf   89099902       #\ninstructions         279028735556   # 29.884 float per 1000 inst\nfloat 512            664            # 0.000 AVX-512 per 1000 inst\nfloat 256            596            # 0.000 AVX-256 per 1000 inst\nfloat 128            8338532082     # 29.884 AVX-128 per 1000 inst\nfloat MMX            0              # 0.000 MMX per 1000 inst\nfloat scalar         0              # 0.000 scalar per 1000 inst\ninstructions         830841454647   #\nopcache              135720537947   # 163.353 opcache per 1000 inst\nopcache miss         20306663678    # 15.0% opcache miss rate\nl1 dTLB miss         975531218      # 1.174 L1 dTLB per 1000 inst\nl2 dTLB miss         61755707       # 0.074 L2 dTLB per 1000 inst\ninstructions         831010059760   #\nicache               44486178211    # 53.533 icache per 1000 inst\nicache miss          5914794798     # 13.3% icache miss rate\nl1 iTLB miss         73929852       # 0.089 L1 iTLB per 1000 inst\nl2 iTLB miss         0              # 0.000 L2 iTLB per 1000 inst\ntlb flush            893267         # 0.001 TLB flush per 1000 inst\n<\/code><\/pre>\n\n\n\n<p>Intel metrics<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>elapsed              102.997\non_cpu               0.044          # 0.70 \/ 16 cores\nutime                56.876\nstime                15.552\nnvcsw                74213          # 92.13%\nnivcsw               6340           # 7.87%\ninblock              8624           # 83.73\/sec\nonblock              91928          # 892.53\/sec\ncpu-clock            94859011406    # 94.859 seconds\ntask-clock           95098965683    # 95.099 seconds\npage faults          3987414        # 41929.100\/sec\ncontext switches     78931          # 829.988\/sec\ncpu migrations       7729           # 81.273\/sec\nmajor page faults    66             # 0.694\/sec\nminor page faults    3987348        # 41928.406\/sec\nalignment faults     0              # 0.000\/sec\nemulation faults     0              # 0.000\/sec\nbranches             143294059807   # 173.502 branches per 1000 inst\nbranch misses        1333887632     # 0.93% branch miss\nconditional          143295083391   # 173.503 conditional branches per 1000 inst\nindirect             6271520462     # 7.594 indirect branches per 1000 inst\nslots                1481294216192  #\nretiring             763595480156   # 51.5% (51.5%)\n-- ucode             44478293800    #     3.0%\n-- fastpath          719117186356   #    48.5%\nfrontend             336402997348   # 22.7% (22.7%)\n-- latency           170261793634   #    11.5%\n-- bandwidth         166141203714   #    11.2%\nbackend              213270529849   # 14.4% (14.4%) low\n-- cpu               116212942772   #     7.8%\n-- memory            97057587077    #     6.6%\nspeculation          171291661914   # 11.6% (11.6%) high\n-- branch mispredict 165903031700   #    11.2%\n-- pipeline restart  5388630214     #     0.4%\nsmt-contention       0              #  0.0% ( 0.0%)\ncpu-cycles           258683319982   # 0.15 GHz\ninstructions         841791808851   # 3.25 IPC high\nl2 access            9594028901     # 11.888 l2 access per 1000 inst\nl2 miss              3246302962     # 33.84% l2 miss\ncpu-cycles           248821553158   # 13.1% memory latency\nload stalls          30878249372    #  1.7% l1 bound\nl1 miss              26556503992    #  5.9% l2 bound\nl2 miss              11955857818    #  2.1% l3 bound\nl3 miss              6672889025     #  2.7% dram bound\nstore_stalls         1636418541     #  0.7% store bound\n<\/code><\/pre>\n\n\n\n<p>Process overview shows many short-lived processes running.<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>31989 processes\n\t25845 rsvg-convert           432.25   174.98\n\t1942 gmain                   52.40    14.06\n\t1938 gdbus                   52.37    14.05\n\t 68 clinfo                  15.55     6.97\n\t 38 vulkaninfo               1.52     0.95\n\t  4 vulkani:disk$0           0.16     0.10\n\t  2 llvmpipe-0               0.08     0.05\n\t  2 llvmpipe-1               0.08     0.05\n\t  2 llvmpipe-10              0.08     0.05\n\t  2 llvmpipe-11              0.08     0.05\n\t  2 llvmpipe-12              0.08     0.05\n\t  2 llvmpipe-13              0.08     0.05\n\t  2 llvmpipe-14              0.08     0.05\n\t  2 llvmpipe-15              0.08     0.05\n\t  2 llvmpipe-2               0.08     0.05\n\t  2 llvmpipe-3               0.08     0.05\n\t  2 llvmpipe-4               0.08     0.05\n\t  2 llvmpipe-5               0.08     0.05\n\t  2 llvmpipe-6               0.08     0.05\n\t  2 llvmpipe-7               0.08     0.05\n\t  2 llvmpipe-8               0.08     0.05\n\t  2 llvmpipe-9               0.08     0.05\n\t  3 rsvg                     0.07     0.45\n\t  6 php                      0.06     0.07\n\t  6 clang                    0.06     0.06\n\t  3 rocminfo                 0.03     0.00\n\t1935 &#91;pango] FcInit           0.00     5.30\n\t  1 lspci                    0.00     0.02\n\t  1 ps                       0.00     0.01\n\t 82 sh                       0.00     0.00\n\t 12 gcc                      0.00     0.00\n\t  9 gsettings                0.00     0.00\n\t  8 stat                     0.00     0.00\n\t  8 systemd-detect-          0.00     0.00\n\t  6 llvm-link                0.00     0.00\n\t  5 phoronix-test-s          0.00     0.00\n\t  4 glxinfo                  0.00     0.00\n\t  4 grep                     0.00     0.00\n\t  3 dconf worker             0.00     0.00\n\t  2 lscpu                    0.00     0.00\n\t  2 setterm                  0.00     0.00\n\t  2 uname                    0.00     0.00\n\t  2 which                    0.00     0.00\n\t  1 cc                       0.00     0.00\n\t  1 date                     0.00     0.00\n\t  1 dirname                  0.00     0.00\n\t  1 dmesg                    0.00     0.00\n\t  1 dmidecode                0.00     0.00\n\t  1 ifconfig                 0.00     0.00\n\t  1 ip                       0.00     0.00\n\t  1 lsmod                    0.00     0.00\n\t  1 mktemp                   0.00     0.00\n\t  1 qdbus                    0.00     0.00\n\t  1 readlink                 0.00     0.00\n\t  1 realpath                 0.00     0.00\n\t  1 sed                      0.00     0.00\n\t  1 sort                     0.00     0.00\n\t  1 stty                     0.00     0.00\n\t  1 systemctl                0.00     0.00\n\t  1 template.sh              0.00     0.00\n\t  1 wc                       0.00     0.00\n0 processes running\n47 maximum processes\n<\/code><\/pre>\n","protected":false},"excerpt":{"rendered":"<p>Testing the librsvg vector graphics library. There is one test that runs in less than 30 seconds. It looks mostly single-threaded. Topdown profile shows retirement stalls as the highest portion. AMD metrics show oncore of only 0.7 and low backend <span class=\"excerpt-dots\">&hellip;<\/span> <a class=\"more-link\" href=\"https:\/\/mvermeulen.org\/perf\/workloads\/phoronix\/rsvg\/\"><span class=\"more-msg\">Continue reading &rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":58,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-1851","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/1851","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/comments?post=1851"}],"version-history":[{"count":2,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/1851\/revisions"}],"predecessor-version":[{"id":1879,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/1851\/revisions\/1879"}],"up":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/58"}],"wp:attachment":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/media?parent=1851"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}