New Ryzen 7840 machine
I have set up a new AMD performance machine for experiments. The processors is a Ryzen 7840 (Phoenix) in a Beelink SER7 mini-PC.
Following are some of the major parameters. This comparison is with Intel i5-13500H which will be my Intel comparison microprocessor.
| Item | Ryzen 7840HS | i5-13500H | Notes |
|---|---|---|---|
| Architecture | Zen4 | Raptor Lake | |
| Cores | 8 | 12 4 performance (raptor cove) 8 efficiency (gracemont) | |
| Threads | 16 | 16 | |
| Base Clock | 3.8 GHz | 2.6 GHz, 1.9 GHz | |
| Boost Clock | 5.1 GHz | 4.7 GHz, 3.5 GHz | |
| TDP | 35-45W | 45W-95W | Set by vendor |
| Memory | 32 GB (2 x 16 GiB) DDR5 - 5600 2 Memory Channels | 16 GB DDR4 - 3200 2 Memory Channels | Check BIOS for actual speed |
| Stream | Copy: 71400 MB/s Scale: 70300 MB/s Add: 73600 MB/s Triad: 73000 MB/s | Copy: 39200 MB/s Scale: 39100 MB/s Add: 40100 MB/s Triad: 40000 MB/s | Measured |
| Cache | L1 - 32kB, 8 way, 4 clocks L2 - 1 MB, 8-way, 14 clocks L3 - 16MB, 24 way, 47 clocks | L1 - 48 kB, 12-way/8-way, 3/5 clocks L2 - 1 MB, 10-way/16-way, 15-20 clocks L3 - 18 MB, 10-way, 65-20 clocks | Agner Fog architecture document and likwid-topology |
| lmbench | L1 - 0.8 ns L2 - 3 ns L3 - 8 ns | L1 - 1.3 ns, 1.0 ns L2 - 4.4 ns, 8ns L3 - 12 ns, 19ns | Measured in Nanoseconds |
| Graphics | Radeon 780M 12 cores 2700 MHz | Intel Iris Xe | |
| Phoronix stream | Average: 40604 MB/s | Average: 35422 MB/s | 1.15x ratio smaller than optimized compiler results above |
| Phoronix coremark | Average 464076 Iterations/second | Average 388569 Iterations/second | 1.19 ratio |
Following is the topology shown by likwid-topology
--------------------------------------------------------------------------------
CPU name: AMD Ryzen 7 7840HS w/ Radeon 780M Graphics
CPU type: nil
CPU stepping: 1
********************************************************************************
Hardware Thread Topology
********************************************************************************
Sockets: 1
Cores per socket: 8
Threads per core: 2
--------------------------------------------------------------------------------
HWThread Thread Core Socket Available
0 0 0 0 *
1 0 1 0 *
2 0 2 0 *
3 0 3 0 *
4 0 4 0 *
5 0 5 0 *
6 0 6 0 *
7 0 7 0 *
8 1 0 0 *
9 1 1 0 *
10 1 2 0 *
11 1 3 0 *
12 1 4 0 *
13 1 5 0 *
14 1 6 0 *
15 1 7 0 *
--------------------------------------------------------------------------------
Socket 0: ( 0 8 1 9 2 10 3 11 4 12 5 13 6 14 7 15 )
--------------------------------------------------------------------------------
********************************************************************************
Cache Topology
********************************************************************************
Level: 1
Size: 32 kB
Cache groups: ( 0 8 ) ( 1 9 ) ( 2 10 ) ( 3 11 ) ( 4 12 ) ( 5 13 ) ( 6 14 ) ( 7 15 )
--------------------------------------------------------------------------------
Level: 2
Size: 1 MB
Cache groups: ( 0 8 ) ( 1 9 ) ( 2 10 ) ( 3 11 ) ( 4 12 ) ( 5 13 ) ( 6 14 ) ( 7 15 )
--------------------------------------------------------------------------------
Level: 3
Size: 16 MB
Cache groups: ( 0 8 1 9 2 10 3 11 4 12 5 13 6 14 7 15 )
--------------------------------------------------------------------------------
*******************************************************************************
NUMA Topology
********************************************************************************
NUMA domains: 1
--------------------------------------------------------------------------------
Domain: 0
Processors: ( 0 8 1 9 2 10 3 11 4 12 5 13 6 14 7 15 )
Distances: 10
Free memory: 4102.21 MB
Total memory: 27760.4 MB
--------------------------------------------------------------------------------
Following are the outputs from stream
-------------------------------------------------------------
STREAM version $Revision: 5.10 $
-------------------------------------------------------------
This system uses 8 bytes per array element.
-------------------------------------------------------------
Array size = 100000000 (elements), Offset = 0 (elements)
Memory per array = 762.9 MiB (= 0.7 GiB).
Total memory required = 2288.8 MiB (= 2.2 GiB).
Each kernel will be executed 100 times.
The *best* time for each kernel (excluding the first iteration)
will be used to compute the reported bandwidth.
-------------------------------------------------------------
Number of Threads requested = 2
Number of Threads counted = 2
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 26015 microseconds.
(= 26015 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Best Rate MB/s Avg time Min time Max time
Copy: 71476.8 0.023065 0.022385 0.025265
Scale: 70360.2 0.023345 0.022740 0.025791
Add: 73617.1 0.033653 0.032601 0.036008
Triad: 73017.4 0.033967 0.032869 0.036821
-------------------------------------------------------------
Solution Validates: avg error less than 1.000000e-13 on all three arrays
-------------------------------------------------------------
Following are the outputs from lmbench
"stride=1024
0.00098 0.782
0.00195 0.782
0.00293 0.782
0.00391 0.790
0.00586 0.784
0.00781 0.787
0.01172 0.789
0.01562 0.785
0.02344 0.782
0.03125 0.785
0.04688 1.480
0.06250 2.746
0.09375 1.243
0.12500 2.746
0.18750 2.745
0.25000 2.744
0.37500 3.043
0.50000 3.041
0.75000 3.084
1.00000 6.447
1.50000 7.525
2.00000 7.670
3.00000 6.277
4.00000 7.367
6.00000 8.717
8.00000 7.840
12.00000 8.516
16.00000 11.334
24.00000 17.369
32.00000 19.499
48.00000 20.862
64.00000 20.587
96.00000 20.102
128.00000 20.390
192.00000 20.821
256.00000 20.434
384.00000 21.656
512.00000 20.342
768.00000 20.997
1024.00000 20.326
1536.00000 21.571
2048.00000 19.647
3072.00000 21.259
4096.00000 23.743
6144.00000 20.942
8192.00000 21.334
12288.00000 21.164
16384.00000 20.588
Following are selected entries from output from lshw
Motherboard: SER
BIOS: American Megatrends: SER7PRO_P5C8V30 - 09/15/2023
Memory: CT16G56C46S5.M8G1
NVMe - WD_BLACK SN850X 4000GB
Following are several review and benchmark threads from phoronix.com with 7840HS systems:
- Ryzen 7840U performance benchmarks (processor above is 7840HS) – https://www.phoronix.com/review/amd-ryzen7-7840u
- Ryzen 7840U benchmark comparison between Windows and Linux – https://www.phoronix.com/review/amd-7840u-windows-linux
- Linux kernel 6.4 has a fix for the PMF (platform management framework) driver – https://www.phoronix.com/news/AMD-PMF-7040-Ryzen-Laptop-Crash
- AVX-512 performance – https://www.phoronix.com/review/amd-ryzen7040-avx512. Benchmarks with AVX-512 on and off.
- Linux laptop – Framework 13 – https://www.phoronix.com/review/framework-13-amd. This is a modular and upgradable laptop.
These articles and benchmarks provide areas for deeper analysis to understand how the workloads are exercising the processor.

Comments
New Ryzen 7840 machine — No Comments
HTML tags allowed in your comment: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>