Hello, I have a question about how the PassMark benchmark works with some of these very high core count AMD processors.
First of all, I want to preface by saying that I've searched the forums and read some of the older threads on this issue, which were helpful, but I still have some further clarification questions. I also see how often you guys get berated by fanboys and I have no interest in doing that here. My question is really AMD vs AMD anyway, so we can leave Intel out of the discussion. This question is also not meant to criticize or complain. I'm simply looking for some information to clear up my understanding about how some of this works.
As many others do, I use the PassMark benchmark to at least get a rough starting point of where to go with a particular system build. I know it will not be a 100% match to my specific use case, but unless I can take advantage of things like the new Xeon accelerators, etc., I should be able to get in the ballpark so I know which hardware to start with, or narrow it down to a smaller number of options, since I can't buy them all.
In the process of analyzing hardware options for several new servers, I got into looking at the difference between the dual EPYC 9654, the single EPYC 9654, and the Ryzen 9 7950X. I might not go with the absolute top end EPYC, but this comparison is useful to express my question. And yes, I'm comparing EPYC to Ryzen, as there's actually a SuperMicro server chassis coming out soon that uses blades with regular Ryzen chips, which seemed interesting.
I know that some of the EPYCs have a very low number of samples reported, and that adding cores has diminishing returns, and other bottlenecks can arise due to inter-socket communication, NUMA nodes, memory latency/throughput, etc. I'm also aware that in the case of EPYC vs Ryzen, the clock speed is not the same, and the architecture is very similar, both should be Zen 4, but obviously not identical. However, despite that, I'm still surprised by the respective benchmarks between these chips. Again -- I'm intrigued by this and asking for clarification, not criticizing or claiming that the PassMark result is wrong.
My use case is lots of identical little VMs or LXC containers on a Proxmox host. This use case should therefore scale very nicely with cores, and with the older hardware I have already, it does in fact scale nicely. I would assume the PassMark benchmark scales nicely too, as from what I read, the benchmark spins up a bunch of individual threads that don't communicate with each other. Though I did see one comment from a couple years ago that PassMark scaled well to 64 cores, which is obviously a problem for a dual EPYC 9654. But that could be an outdated limitation.
Dual EPYC 9654 (192 cores, 384 threads, 3.55GHz all core turbo) -- 149,727
Single EPYC 9654 (96 cores, 192 threads, 3.55GHz all core turbo) -- 126,045
Ryzen 9 7950X (16 cores, 32 threads, 5.1GHz all core turbo) -- 63,625
So what I see is that the 16 core Ryzen gets to 42% of the performance a dual EPYC 9654 with 192 cores. And the dual EPYC only gets 19% better performance than the single EPYC. From what I can fathom, this is either true due to the basic nature of scaling across cores and the architecture of the chips themselves, or it's not true and is an artifact of the way PassMark itself scales across cores. But I thought PassMark scaled well across cores, which is why I'm confused. That, or I'm just interpreting these scores wrong and I need help understanding what I need to be doing differently.
To break down the Ryzen comparison better, it should have roughly the same IPC as the Epyc, as they're both Zen 4 based. Maybe not, as I'm sure cache sizes are different, etc. Anyway, I'm going to go ahead and assume it's the same ballpark. EPYC has 30% lower all core turbo speeds than the Ryzen, so let's reduce the Ryzen result by that amount. That puts the modified Ryzen score at about 44,500. Multiply by 6x (96c/16c=6) = 267,000. Multiply by 2 again for the dual EPYC and we get 534,000. I get that this is oversimplified, but it's such an immense difference (I expected 3.6x what I actually got), that this is very relevant for helping me decide which direction to go. Is core scaling really that bad? If so, then two Ryzen blades gives me nearly as much as a dual EPYC 9654, and that's an awesome way to go! But I don't want to trick myself into that. I've seen other benchmarks where the performance at least seemed to scale pretty linearly with cores on those EPYCs, so I'm just trying to figure out what I might be missing. How should I be interpreting these results?
First of all, I want to preface by saying that I've searched the forums and read some of the older threads on this issue, which were helpful, but I still have some further clarification questions. I also see how often you guys get berated by fanboys and I have no interest in doing that here. My question is really AMD vs AMD anyway, so we can leave Intel out of the discussion. This question is also not meant to criticize or complain. I'm simply looking for some information to clear up my understanding about how some of this works.
As many others do, I use the PassMark benchmark to at least get a rough starting point of where to go with a particular system build. I know it will not be a 100% match to my specific use case, but unless I can take advantage of things like the new Xeon accelerators, etc., I should be able to get in the ballpark so I know which hardware to start with, or narrow it down to a smaller number of options, since I can't buy them all.
In the process of analyzing hardware options for several new servers, I got into looking at the difference between the dual EPYC 9654, the single EPYC 9654, and the Ryzen 9 7950X. I might not go with the absolute top end EPYC, but this comparison is useful to express my question. And yes, I'm comparing EPYC to Ryzen, as there's actually a SuperMicro server chassis coming out soon that uses blades with regular Ryzen chips, which seemed interesting.
I know that some of the EPYCs have a very low number of samples reported, and that adding cores has diminishing returns, and other bottlenecks can arise due to inter-socket communication, NUMA nodes, memory latency/throughput, etc. I'm also aware that in the case of EPYC vs Ryzen, the clock speed is not the same, and the architecture is very similar, both should be Zen 4, but obviously not identical. However, despite that, I'm still surprised by the respective benchmarks between these chips. Again -- I'm intrigued by this and asking for clarification, not criticizing or claiming that the PassMark result is wrong.
My use case is lots of identical little VMs or LXC containers on a Proxmox host. This use case should therefore scale very nicely with cores, and with the older hardware I have already, it does in fact scale nicely. I would assume the PassMark benchmark scales nicely too, as from what I read, the benchmark spins up a bunch of individual threads that don't communicate with each other. Though I did see one comment from a couple years ago that PassMark scaled well to 64 cores, which is obviously a problem for a dual EPYC 9654. But that could be an outdated limitation.
Dual EPYC 9654 (192 cores, 384 threads, 3.55GHz all core turbo) -- 149,727
Single EPYC 9654 (96 cores, 192 threads, 3.55GHz all core turbo) -- 126,045
Ryzen 9 7950X (16 cores, 32 threads, 5.1GHz all core turbo) -- 63,625
So what I see is that the 16 core Ryzen gets to 42% of the performance a dual EPYC 9654 with 192 cores. And the dual EPYC only gets 19% better performance than the single EPYC. From what I can fathom, this is either true due to the basic nature of scaling across cores and the architecture of the chips themselves, or it's not true and is an artifact of the way PassMark itself scales across cores. But I thought PassMark scaled well across cores, which is why I'm confused. That, or I'm just interpreting these scores wrong and I need help understanding what I need to be doing differently.
To break down the Ryzen comparison better, it should have roughly the same IPC as the Epyc, as they're both Zen 4 based. Maybe not, as I'm sure cache sizes are different, etc. Anyway, I'm going to go ahead and assume it's the same ballpark. EPYC has 30% lower all core turbo speeds than the Ryzen, so let's reduce the Ryzen result by that amount. That puts the modified Ryzen score at about 44,500. Multiply by 6x (96c/16c=6) = 267,000. Multiply by 2 again for the dual EPYC and we get 534,000. I get that this is oversimplified, but it's such an immense difference (I expected 3.6x what I actually got), that this is very relevant for helping me decide which direction to go. Is core scaling really that bad? If so, then two Ryzen blades gives me nearly as much as a dual EPYC 9654, and that's an awesome way to go! But I don't want to trick myself into that. I've seen other benchmarks where the performance at least seemed to scale pretty linearly with cores on those EPYCs, so I'm just trying to figure out what I might be missing. How should I be interpreting these results?
Comment