{"id":92,"date":"2023-12-17T15:14:45","date_gmt":"2023-12-17T15:14:45","guid":{"rendered":"https:\/\/mvermeulen.org\/perf\/?p=92"},"modified":"2023-12-17T19:31:27","modified_gmt":"2023-12-17T19:31:27","slug":"new-ryzen-7840-machine","status":"publish","type":"post","link":"https:\/\/mvermeulen.org\/perf\/2023\/12\/17\/new-ryzen-7840-machine\/","title":{"rendered":"New Ryzen 7840 machine"},"content":{"rendered":"\n<p>I have set up a new AMD performance machine for experiments.  The processors is a <a href=\"https:\/\/www.amd.com\/en\/products\/apu\/amd-ryzen-7-7840u\">Ryzen 7840<\/a> (Phoenix) in a <a href=\"https:\/\/www.bee-link.com\/beelink-gaming-pc-ser7840-19943849-clone-1\">Beelink SER7<\/a> mini-PC.<\/p>\n\n\n\n<p>Following are some of the major parameters.  This comparison is with Intel i5-13500H which will be my Intel comparison microprocessor.<\/p>\n\n\n\n\n<table id=\"tablepress-1\" class=\"tablepress tablepress-id-1\">\n<thead>\n<tr class=\"row-1\">\n\t<th class=\"column-1\">Item<\/th><th class=\"column-2\">Ryzen 7840HS<\/th><th class=\"column-3\">i5-13500H<\/th><th class=\"column-4\">Notes<\/th>\n<\/tr>\n<\/thead>\n<tbody class=\"row-striping row-hover\">\n<tr class=\"row-2\">\n\t<td class=\"column-1\">Architecture<\/td><td class=\"column-2\">Zen4<\/td><td class=\"column-3\">Raptor Lake<\/td><td class=\"column-4\"><\/td>\n<\/tr>\n<tr class=\"row-3\">\n\t<td class=\"column-1\">Cores<\/td><td class=\"column-2\">8<\/td><td class=\"column-3\">12<br \/>\n4 performance (raptor cove)<br \/>\n8 efficiency (gracemont)<\/td><td class=\"column-4\"><\/td>\n<\/tr>\n<tr class=\"row-4\">\n\t<td class=\"column-1\">Threads<\/td><td class=\"column-2\">16<\/td><td class=\"column-3\">16<\/td><td class=\"column-4\"><\/td>\n<\/tr>\n<tr class=\"row-5\">\n\t<td class=\"column-1\">Base Clock<\/td><td class=\"column-2\">3.8 GHz<\/td><td class=\"column-3\">2.6 GHz, 1.9 GHz<\/td><td class=\"column-4\"><\/td>\n<\/tr>\n<tr class=\"row-6\">\n\t<td class=\"column-1\">Boost Clock<\/td><td class=\"column-2\">5.1 GHz<\/td><td class=\"column-3\">4.7 GHz, 3.5 GHz<\/td><td class=\"column-4\"><\/td>\n<\/tr>\n<tr class=\"row-7\">\n\t<td class=\"column-1\">TDP<\/td><td class=\"column-2\">35-45W<\/td><td class=\"column-3\">45W-95W<\/td><td class=\"column-4\">Set by vendor<\/td>\n<\/tr>\n<tr class=\"row-8\">\n\t<td class=\"column-1\">Memory<\/td><td class=\"column-2\">32 GB (2 x 16 GiB)<br \/>\n<br \/>\nDDR5 - 5600<br \/>\n<br \/>\n2 Memory Channels<\/td><td class=\"column-3\">16 GB<br \/>\n<br \/>\nDDR4 - 3200<br \/>\n<br \/>\n2 Memory Channels<\/td><td class=\"column-4\">Check BIOS for actual speed<\/td>\n<\/tr>\n<tr class=\"row-9\">\n\t<td class=\"column-1\">Stream<\/td><td class=\"column-2\">Copy: 71400 MB\/s<br \/>\nScale: 70300 MB\/s<br \/>\nAdd: 73600 MB\/s<br \/>\nTriad: 73000 MB\/s<\/td><td class=\"column-3\">Copy: 39200 MB\/s<br \/>\nScale: 39100 MB\/s<br \/>\nAdd: 40100 MB\/s<br \/>\nTriad: 40000 MB\/s<\/td><td class=\"column-4\">Measured<\/td>\n<\/tr>\n<tr class=\"row-10\">\n\t<td class=\"column-1\">Cache<\/td><td class=\"column-2\">L1 - 32kB, 8 way, 4 clocks<br \/>\n<br \/>\nL2 - 1 MB, 8-way, 14 clocks<br \/>\n<br \/>\nL3 - 16MB, 24 way, 47 clocks<\/td><td class=\"column-3\">L1 - 48 kB, 12-way\/8-way, 3\/5 clocks<br \/>\n<br \/>\nL2 - 1 MB, 10-way\/16-way, 15-20 clocks<br \/>\n<br \/>\nL3 - 18 MB, 10-way, 65-20 clocks<\/td><td class=\"column-4\">Agner Fog architecture document and likwid-topology<\/td>\n<\/tr>\n<tr class=\"row-11\">\n\t<td class=\"column-1\">lmbench<\/td><td class=\"column-2\">L1 - 0.8 ns<br \/>\nL2 - 3 ns<br \/>\nL3 - 8 ns<\/td><td class=\"column-3\">L1 - 1.3 ns, 1.0 ns<br \/>\nL2 - 4.4 ns, 8ns<br \/>\nL3 - 12 ns, 19ns<\/td><td class=\"column-4\">Measured in Nanoseconds<\/td>\n<\/tr>\n<tr class=\"row-12\">\n\t<td class=\"column-1\">Graphics<\/td><td class=\"column-2\">Radeon 780M<br \/>\n<br \/>\n12 cores<br \/>\n<br \/>\n2700 MHz<\/td><td class=\"column-3\">Intel Iris Xe<\/td><td class=\"column-4\"><\/td>\n<\/tr>\n<tr class=\"row-13\">\n\t<td class=\"column-1\">Phoronix stream<\/td><td class=\"column-2\">Average: 40604 MB\/s<\/td><td class=\"column-3\">Average: 35422 MB\/s<\/td><td class=\"column-4\">1.15x ratio smaller than optimized compiler results above<\/td>\n<\/tr>\n<tr class=\"row-14\">\n\t<td class=\"column-1\">Phoronix coremark<\/td><td class=\"column-2\">Average 464076 Iterations\/second<\/td><td class=\"column-3\">Average 388569 Iterations\/second<\/td><td class=\"column-4\">1.19 ratio<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<!-- #tablepress-1 from cache -->\n\n\n\n<p>Following is the topology shown by likwid-topology<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>--------------------------------------------------------------------------------\nCPU name:\tAMD Ryzen 7 7840HS w\/ Radeon 780M Graphics     \nCPU type:\tnil\nCPU stepping:\t1\n********************************************************************************\nHardware Thread Topology\n********************************************************************************\nSockets:\t\t1\nCores per socket:\t8\nThreads per core:\t2\n--------------------------------------------------------------------------------\nHWThread\tThread\t\tCore\t\tSocket\t\tAvailable\n0\t\t0\t\t0\t\t0\t\t*\n1\t\t0\t\t1\t\t0\t\t*\n2\t\t0\t\t2\t\t0\t\t*\n3\t\t0\t\t3\t\t0\t\t*\n4\t\t0\t\t4\t\t0\t\t*\n5\t\t0\t\t5\t\t0\t\t*\n6\t\t0\t\t6\t\t0\t\t*\n7\t\t0\t\t7\t\t0\t\t*\n8\t\t1\t\t0\t\t0\t\t*\n9\t\t1\t\t1\t\t0\t\t*\n10\t\t1\t\t2\t\t0\t\t*\n11\t\t1\t\t3\t\t0\t\t*\n12\t\t1\t\t4\t\t0\t\t*\n13\t\t1\t\t5\t\t0\t\t*\n14\t\t1\t\t6\t\t0\t\t*\n15\t\t1\t\t7\t\t0\t\t*\n--------------------------------------------------------------------------------\nSocket 0:\t\t( 0 8 1 9 2 10 3 11 4 12 5 13 6 14 7 15 )\n--------------------------------------------------------------------------------\n********************************************************************************\nCache Topology\n********************************************************************************\nLevel:\t\t\t1\nSize:\t\t\t32 kB\nCache groups:\t\t( 0 8 ) ( 1 9 ) ( 2 10 ) ( 3 11 ) ( 4 12 ) ( 5 13 ) ( 6 14 ) ( 7 15 )\n--------------------------------------------------------------------------------\nLevel:\t\t\t2\nSize:\t\t\t1 MB\nCache groups:\t\t( 0 8 ) ( 1 9 ) ( 2 10 ) ( 3 11 ) ( 4 12 ) ( 5 13 ) ( 6 14 ) ( 7 15 )\n--------------------------------------------------------------------------------\nLevel:\t\t\t3\nSize:\t\t\t16 MB\nCache groups:\t\t( 0 8 1 9 2 10 3 11 4 12 5 13 6 14 7 15 )\n--------------------------------------------------------------------------------\n*******************************************************************************\nNUMA Topology\n********************************************************************************\nNUMA domains:\t\t1\n--------------------------------------------------------------------------------\nDomain:\t\t\t0\nProcessors:\t\t( 0 8 1 9 2 10 3 11 4 12 5 13 6 14 7 15 )\nDistances:\t\t10\nFree memory:\t\t4102.21 MB\nTotal memory:\t\t27760.4 MB\n--------------------------------------------------------------------------------<\/code><\/pre>\n\n\n\n<p>Following are the outputs from stream<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>-------------------------------------------------------------\nSTREAM version $Revision: 5.10 $\n-------------------------------------------------------------\nThis system uses 8 bytes per array element.\n-------------------------------------------------------------\nArray size = 100000000 (elements), Offset = 0 (elements)\nMemory per array = 762.9 MiB (= 0.7 GiB).\nTotal memory required = 2288.8 MiB (= 2.2 GiB).\nEach kernel will be executed 100 times.\n The *best* time for each kernel (excluding the first iteration)\n will be used to compute the reported bandwidth.\n-------------------------------------------------------------\nNumber of Threads requested = 2\nNumber of Threads counted = 2\n-------------------------------------------------------------\nYour clock granularity\/precision appears to be 1 microseconds.\nEach test below will take on the order of 26015 microseconds.\n   (= 26015 clock ticks)\nIncrease the size of the arrays if this shows that\nyou are not getting at least 20 clock ticks per test.\n-------------------------------------------------------------\nWARNING -- The above is only a rough guideline.\nFor best results, please be sure you know the\nprecision of your system timer.\n-------------------------------------------------------------\nFunction    Best Rate MB\/s  Avg time     Min time     Max time\nCopy:           71476.8     0.023065     0.022385     0.025265\nScale:          70360.2     0.023345     0.022740     0.025791\nAdd:            73617.1     0.033653     0.032601     0.036008\nTriad:          73017.4     0.033967     0.032869     0.036821\n-------------------------------------------------------------\nSolution Validates: avg error less than 1.000000e-13 on all three arrays\n-------------------------------------------------------------\n<\/code><\/pre>\n\n\n\n<p>Following are the outputs from lmbench<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>\"stride=1024\n0.00098 0.782\n0.00195 0.782\n0.00293 0.782\n0.00391 0.790\n0.00586 0.784\n0.00781 0.787\n0.01172 0.789\n0.01562 0.785\n0.02344 0.782\n0.03125 0.785\n0.04688 1.480\n0.06250 2.746\n0.09375 1.243\n0.12500 2.746\n0.18750 2.745\n0.25000 2.744\n0.37500 3.043\n0.50000 3.041\n0.75000 3.084\n1.00000 6.447\n1.50000 7.525\n2.00000 7.670\n3.00000 6.277\n4.00000 7.367\n6.00000 8.717\n8.00000 7.840\n12.00000 8.516\n16.00000 11.334\n24.00000 17.369\n32.00000 19.499\n48.00000 20.862\n64.00000 20.587\n96.00000 20.102\n128.00000 20.390\n192.00000 20.821\n256.00000 20.434\n384.00000 21.656\n512.00000 20.342\n768.00000 20.997\n1024.00000 20.326\n1536.00000 21.571\n2048.00000 19.647\n3072.00000 21.259\n4096.00000 23.743\n6144.00000 20.942\n8192.00000 21.334\n12288.00000 21.164\n16384.00000 20.588\n<\/code><\/pre>\n\n\n\n<p>Following are selected entries from output from lshw<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>Motherboard: SER\nBIOS: American Megatrends: SER7PRO_P5C8V30 - 09\/15\/2023\nMemory: CT16G56C46S5.M8G1\nNVMe - WD_BLACK SN850X 4000GB<\/code><\/pre>\n\n\n\n<p>Following are several review and benchmark threads from <a href=\"https:\/\/www.phoronix.com\">phoronix.com<\/a> with 7840HS systems:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Ryzen 7840U performance benchmarks (processor above is 7840HS) &#8211; <a href=\"https:\/\/www.phoronix.com\/review\/amd-ryzen7-7840u\">https:\/\/www.phoronix.com\/review\/amd-ryzen7-7840u<\/a><\/li>\n\n\n\n<li>Ryzen 7840U benchmark comparison between Windows and Linux &#8211; <a href=\"https:\/\/www.phoronix.com\/review\/amd-7840u-windows-linux\">https:\/\/www.phoronix.com\/review\/amd-7840u-windows-linux<\/a><\/li>\n\n\n\n<li>Linux kernel 6.4 has a fix for the PMF (platform management framework) driver &#8211; <a href=\"https:\/\/www.phoronix.com\/news\/AMD-PMF-7040-Ryzen-Laptop-Crash\">https:\/\/www.phoronix.com\/news\/AMD-PMF-7040-Ryzen-Laptop-Crash<\/a><\/li>\n\n\n\n<li>AVX-512 performance &#8211; <a href=\"https:\/\/www.phoronix.com\/review\/amd-ryzen7040-avx512\">https:\/\/www.phoronix.com\/review\/amd-ryzen7040-avx512<\/a>. Benchmarks with AVX-512 on and off.<\/li>\n\n\n\n<li>Linux laptop &#8211; Framework 13 &#8211; <a href=\"https:\/\/www.phoronix.com\/review\/framework-13-amd\">https:\/\/www.phoronix.com\/review\/framework-13-amd<\/a>. This is a modular and upgradable laptop.<\/li>\n<\/ul>\n\n\n\n<p>These articles and benchmarks provide areas for deeper analysis to understand how the workloads are exercising the processor.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I have set up a new AMD performance machine for experiments. The processors is a Ryzen 7840 (Phoenix) in a Beelink SER7 mini-PC. Following are some of the major parameters. This comparison is with Intel i5-13500H which will be my <span class=\"excerpt-dots\">&hellip;<\/span> <a class=\"more-link\" href=\"https:\/\/mvermeulen.org\/perf\/2023\/12\/17\/new-ryzen-7840-machine\/\"><span class=\"more-msg\">Continue reading &rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[9],"tags":[12],"class_list":["post-92","post","type-post","status-publish","format-standard","hentry","category-hardware","tag-7840hs"],"_links":{"self":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/posts\/92","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/comments?post=92"}],"version-history":[{"count":6,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/posts\/92\/revisions"}],"predecessor-version":[{"id":108,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/posts\/92\/revisions\/108"}],"wp:attachment":[{"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/media?parent=92"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/categories?post=92"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mvermeulen.org\/perf\/wp-json\/wp\/v2\/tags?post=92"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}