{"id":2282,"date":"2024-06-01T12:19:57","date_gmt":"2024-06-01T12:19:57","guid":{"rendered":"https:\/\/mvermeulen.org\/perf\/?page_id=2282"},"modified":"2024-06-01T12:31:52","modified_gmt":"2024-06-01T12:31:52","slug":"node-octane","status":"publish","type":"page","link":"https:\/\/mvermeulen.org\/perf\/workloads\/phoronix\/node-octane\/","title":{"rendered":"node-octane"},"content":{"rendered":"\n<p>Javascript Octane benchmark with one workload. It is quick running, single threaded.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1280\" height=\"960\" src=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/06\/systemtime-3.png\" alt=\"\" class=\"wp-image-2286\" srcset=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/06\/systemtime-3.png 1280w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/06\/systemtime-3-1024x768.png 1024w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/06\/systemtime-3-768x576.png 768w\" sizes=\"auto, (max-width: 1280px) 100vw, 1280px\" \/><\/figure>\n\n\n\n<p>Topdown profile looks a bit scattered on retirement and stalls.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1280\" height=\"960\" src=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/06\/amdtopdown-4.png\" alt=\"\" class=\"wp-image-2288\" srcset=\"https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/06\/amdtopdown-4.png 1280w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/06\/amdtopdown-4-1024x768.png 1024w, https:\/\/mvermeulen.org\/perf\/wp-content\/uploads\/sites\/7\/2024\/06\/amdtopdown-4-768x576.png 768w\" sizes=\"auto, (max-width: 1280px) 100vw, 1280px\" \/><\/figure>\n\n\n\n<p>AMD metrics show not much floating point, a high IPC and reasonable retirement rate.<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>elapsed              104.178\non_cpu               0.067          # 1.07 \/ 16 cores\nutime                106.784\nstime                4.829\nnvcsw                195263         # 99.61%\nnivcsw               764            # 0.39%\ninblock              8              # 0.08\/sec\nonblock              12536          # 120.33\/sec\ncpu-clock            111094196613   # 111.094 seconds\ntask-clock           111231388136   # 111.231 seconds\npage faults          1122156        # 10088.483\/sec\ncontext switches     196368         # 1765.401\/sec\ncpu migrations       326            # 2.931\/sec\nmajor page faults    4              # 0.036\/sec\nminor page faults    1122152        # 10088.447\/sec\nalignment faults     0              # 0.000\/sec\nemulation faults     0              # 0.000\/sec\nbranches             338956254589   # 216.164 branches per 1000 inst\nbranch misses        1771149942     # 0.52% branch miss\nconditional          276845569978   # 176.554 conditional branches per 1000 inst\nindirect             13220468194    # 8.431 indirect branches per 1000 inst\ncpu-cycles           484061004939   # 0.29 GHz\ninstructions         1559784642354  # 3.22 IPC high\nslots                971703456978   #\nretiring             496249861182   # 51.1% (51.3%)\n-- ucode             1218928126     #     0.1%\n-- fastpath          495030933056   #    50.9%\nfrontend             177971339405   # 18.3% (18.4%)\n-- latency           106594150902   #    11.0%\n-- bandwidth         71377188503    #     7.3%\nbackend              250138346546   # 25.7% (25.9%)\n-- cpu               63203304721    #     6.5%\n-- memory            186935041825   #    19.2%\nspeculation          42066752102    #  4.3% ( 4.4%)\n-- branch mispredict 40368937288    #     4.2%\n-- pipeline restart  1697814814     #     0.2%\nsmt-contention       5273139832     #  0.5% ( 0.0%)\ncpu-cycles           487529093501   # 0.29 GHz\ninstructions         1569196479319  # 3.22 IPC high\ninstructions         523496464659   # 15.041 l2 access per 1000 inst\nl2 hit from l1       6108036355     # 22.63% l2 miss\nl2 miss from l1      667325636      #\nl2 hit from l2 pf    651701718      #\nl3 hit from l2 pf    460659885      #\nl3 miss from l2 pf   653723722      #\ninstructions         523874216240   # 26.092 float per 1000 inst\nfloat 512            60             # 0.000 AVX-512 per 1000 inst\nfloat 256            644            # 0.000 AVX-256 per 1000 inst\nfloat 128            13669029298    # 26.092 AVX-128 per 1000 inst\nfloat MMX            0              # 0.000 MMX per 1000 inst\nfloat scalar         0              # 0.000 scalar per 1000 inst\ninstructions         1566071016253  #\nopcache              246248600449   # 157.240 opcache per 1000 inst\nopcache miss         14375350694    #  5.8% opcache miss rate\nl1 dTLB miss         1935602692     # 1.236 L1 dTLB per 1000 inst\nl2 dTLB miss         158505658      # 0.101 L2 dTLB per 1000 inst\ninstructions         1566138815632  #\nicache               23805191255    # 15.200 icache per 1000 inst\nicache miss          6022286080     # 25.3% icache miss rate\nl1 iTLB miss         965276691      # 0.616 L1 iTLB per 1000 inst\nl2 iTLB miss         0              # 0.000 L2 iTLB per 1000 inst\ntlb flush            191985         # 0.000 TLB flush per 1000 inst\n<\/code><\/pre>\n\n\n\n<p>Intel metrics<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>elapsed              104.844\non_cpu               0.068          # 1.08 \/ 16 cores\nutime                109.421\nstime                4.072\nnvcsw                176369         # 99.48%\nnivcsw               930            # 0.52%\ninblock              79480          # 758.08\/sec\nonblock              1296           # 12.36\/sec\ncpu-clock            112122082393   # 112.122 seconds\ntask-clock           112228433866   # 112.228 seconds\npage faults          999778         # 8908.420\/sec\ncontext switches     177654         # 1582.968\/sec\ncpu migrations       902            # 8.037\/sec\nmajor page faults    396            # 3.529\/sec\nminor page faults    999382         # 8904.891\/sec\nalignment faults     0              # 0.000\/sec\nemulation faults     0              # 0.000\/sec\nbranches             311098773367   # 215.942 branches per 1000 inst\nbranch misses        1513153923     # 0.49% branch miss\nconditional          311098783799   # 215.942 conditional branches per 1000 inst\nindirect             14539163236    # 10.092 indirect branches per 1000 inst\nslots                2388429927608  #\nretiring             1402967208441  # 58.7% (58.7%) high\n-- ucode             45194709311    #     1.9%\n-- fastpath          1357772499130  #    56.8%\nfrontend             472431510660   # 19.8% (19.8%)\n-- latency           165358888557   #     6.9%\n-- bandwidth         307072622103   #    12.9%\nbackend              339247010658   # 14.2% (14.2%) low\n-- cpu               197357121559   #     8.3%\n-- memory            141889889099   #     5.9%\nspeculation          195543874473   #  8.2% ( 8.2%)\n-- branch mispredict 185516547666   #     7.8%\n-- pipeline restart  10027326807    #     0.4%\nsmt-contention       0              #  0.0% ( 0.0%)\ncpu-cycles           416392267737   # 0.25 GHz\ninstructions         1474876949895  # 3.54 IPC high\nl2 access            19775574302    # 13.988 l2 access per 1000 inst\nl2 miss              7092133274     # 35.86% l2 miss\ncpu-cycles           400784340244   # 13.4% memory latency\nload stalls          43115337904    #  0.0% l1 bound\nl1 miss              43919464973    #  6.4% l2 bound\nl2 miss              18099082550    #  2.0% l3 bound\nl3 miss              9920802371     #  2.5% dram bound\nstore_stalls         10390568159    #  2.6% store bound\n<\/code><\/pre>\n\n\n\n<p>Process summary shows node as the primary process,.<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>357 processes\n\t 21 node                   742.28    26.18\n\t 68 clinfo                  15.19     7.00\n\t 38 vulkaninfo               1.33     1.14\n\t  4 vulkani:disk$0           0.14     0.12\n\t  2 llvmpipe-0               0.07     0.06\n\t  2 llvmpipe-1               0.07     0.06\n\t  2 llvmpipe-10              0.07     0.06\n\t  2 llvmpipe-11              0.07     0.06\n\t  2 llvmpipe-12              0.07     0.06\n\t  2 llvmpipe-13              0.07     0.06\n\t  2 llvmpipe-14              0.07     0.06\n\t  2 llvmpipe-15              0.07     0.06\n\t  2 llvmpipe-2               0.07     0.06\n\t  2 llvmpipe-3               0.07     0.06\n\t  2 llvmpipe-4               0.07     0.06\n\t  2 llvmpipe-5               0.07     0.06\n\t  2 llvmpipe-6               0.07     0.06\n\t  2 llvmpipe-7               0.07     0.06\n\t  2 llvmpipe-8               0.07     0.06\n\t  2 llvmpipe-9               0.07     0.06\n\t  6 php                      0.06     0.07\n\t  6 clang                    0.06     0.06\n\t  3 rocminfo                 0.03     0.00\n\t  1 lspci                    0.01     0.02\n\t  3 nodejs                   0.01     0.00\n\t 83 sh                       0.00     0.00\n\t 12 gcc                      0.00     0.00\n\t 10 gsettings                0.00     0.00\n\t  8 stat                     0.00     0.00\n\t  8 systemd-detect-          0.00     0.00\n\t  6 llvm-link                0.00     0.00\n\t  5 glxinfo                  0.00     0.00\n\t  5 gmain                    0.00     0.00\n\t  5 phoronix-test-s          0.00     0.00\n\t  3 node-octane              0.00     0.00\n\t  2 grep                     0.00     0.00\n\t  2 lscpu                    0.00     0.00\n\t  2 setterm                  0.00     0.00\n\t  2 uname                    0.00     0.00\n\t  2 which                    0.00     0.00\n\t  1 cc                       0.00     0.00\n\t  1 date                     0.00     0.00\n\t  1 dconf worker             0.00     0.00\n\t  1 dirname                  0.00     0.00\n\t  1 dmesg                    0.00     0.00\n\t  1 dmidecode                0.00     0.00\n\t  1 ifconfig                 0.00     0.00\n\t  1 ip                       0.00     0.00\n\t  1 lsmod                    0.00     0.00\n\t  1 mktemp                   0.00     0.00\n\t  1 ps                       0.00     0.00\n\t  1 qdbus                    0.00     0.00\n\t  1 readlink                 0.00     0.00\n\t  1 realpath                 0.00     0.00\n\t  1 sed                      0.00     0.00\n\t  1 sort                     0.00     0.00\n\t  1 stty                     0.00     0.00\n\t  1 systemctl                0.00     0.00\n\t  1 template.sh              0.00     0.00\n\t  1 wc                       0.00     0.00\n0 processes running\n47 maximum processes\n<\/code><\/pre>\n\n\n\n<p>Computation block<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>      520810) node-octane      cpu=2 start=5.47  finish=35.14\n        520811) node             cpu=4 start=5.47  finish=35.12\n          520812) node             cpu=11 start=5.49  finish=35.12\n          520813) node             cpu=3 start=5.49  finish=35.12\n          520814) node             cpu=8 start=5.49  finish=35.12\n          520815) node             cpu=6 start=5.49  finish=35.12\n          520816) node             cpu=5 start=5.49  finish=35.12\n          520817) node             cpu=15 start=5.50  finish=35.12\n        520819) nodejs           cpu=5 start=35.13 finish=35.14\n<\/code><\/pre>\n","protected":false},"excerpt":{"rendered":"<p>Javascript Octane benchmark with one workload. It is quick running, single threaded. Topdown profile looks a bit scattered on retirement and stalls. AMD metrics show not much floating point, a high IPC and reasonable retirement rate. Intel metrics Process summary <span class=\"excerpt-dots\">&hellip;<\/span> <a class=\"more-link\" href=\"https:\/\/mvermeulen.org\/perf\/workloads\/phoronix\/node-octane\/\"><span class=\"more-msg\">Continue reading &rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":58,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-2282","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/2282","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/comments?post=2282"}],"version-history":[{"count":2,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/2282\/revisions"}],"predecessor-version":[{"id":2289,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/2282\/revisions\/2289"}],"up":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/pages\/58"}],"wp:attachment":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/media?parent=2282"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}