Hello, this post will be split into 2 parts, but can be perceived as being highly related.
We have a need to test a wide range of DDR5-6400 RDIMM modules and are having trouble determining a good setup to do so (native 6400, non-overclocked).
Part 1
I have been experimenting with a Supermicro X14SBI-F board paired with a Xeon 6505P CPU. We have MemTest Site 11.4 running over PXE.
This setup is very cost effective and I would love for it to work but theres a couple things hindering me from going all-in on it:
1. For some reason, ECC Polling does not work on this board. I have updated the BIOS and BMC firmware to the latest versions, WHEA is Enabled. I am unable to set ECC Polling to Enabled in MemTest. It simply gives me the message of "ECC is not enabled/unsupported on this system". I've gone through every BIOS menu and cant find any explicit options for enabling/disabling ECC all together so as far as I can tell, ECC ought to be working. (unless this particular hardware is unsupported by MemTest itself, please advise). If it is of any use as additional information, we also operate X13SRA-TF boards (DDR5-4800 RDIMM) that do allow for toggling ECC Polling.
Side Note: On the aforementioned X13 boards, I have been running sticks that reported correctable ECC errors (via ProxMox on a production system) for like 120+ hours now and still have not seen them appear in MemTest. I don't think they were a fluke because the error addresses were consistent across multiple instances of the error.
2. I am not completely sold yet on the viability of Supermicro motherboards for general memory testing. They are the most cost effective option on paper, but if they can't actually facilitate memory testing via MemTest, they're kind of off the table. (see part 2 of this topic for more details).
Part 2
For some background, we are a memory module company. We test thousands of memory modules a day. We have been operating DDR5 RDIMM testing machines for over a year at this point and have never once had a DDR5 RDIMM modules give an actual MemTest redline error while running as a single module (ive witnessed them when running multiple modules but could never be reproduced when ran as singles).
Before jumping to conclusions about this being poor DDR5 RDIMM support with MemTest, I suspect it could have to do with Supermicro boards being picky about memory in general. It is fairly common for modules to simply not POST at all rather than enter MemTest and receive a testing error. It seems to be a hard pass-or-fail just attempting to POST on these Supermicro boards (both X13 and X14).
I wanted to leave this kind of open ended to the guys over at passmark. What are you using during development for testing DDR5 RDIMMs and what would you say is the least finicky motherboard/CPU vendors for DDR5 RDIMM testing that you've found? We are not opposed to AMD so long as they will produce reliable and consistent results, we just need something that can be trusted to give adequate testing results. Have you experienced oddities with DDR5 RDIMM on Supermicro boards similar to what ive described (impossible to find a stick that actually posts and errors in MemTest)?
I will end this off by apologizing for the sporadic nature of the post. I'm at my whits end trying to understand this and trying to actually find a bad DDR5 RDIMM module (via MemTest).
If you have a DDR5 RDIMM module that reliably redlines in MemTest, I will literally buy it off of you at this point.
I am highly technical and am more than willing to provide whatever additional details are necessary to assist. Let me know.
Thank you for reading and taking the time to consider.
We have a need to test a wide range of DDR5-6400 RDIMM modules and are having trouble determining a good setup to do so (native 6400, non-overclocked).
Part 1
I have been experimenting with a Supermicro X14SBI-F board paired with a Xeon 6505P CPU. We have MemTest Site 11.4 running over PXE.
This setup is very cost effective and I would love for it to work but theres a couple things hindering me from going all-in on it:
1. For some reason, ECC Polling does not work on this board. I have updated the BIOS and BMC firmware to the latest versions, WHEA is Enabled. I am unable to set ECC Polling to Enabled in MemTest. It simply gives me the message of "ECC is not enabled/unsupported on this system". I've gone through every BIOS menu and cant find any explicit options for enabling/disabling ECC all together so as far as I can tell, ECC ought to be working. (unless this particular hardware is unsupported by MemTest itself, please advise). If it is of any use as additional information, we also operate X13SRA-TF boards (DDR5-4800 RDIMM) that do allow for toggling ECC Polling.
Side Note: On the aforementioned X13 boards, I have been running sticks that reported correctable ECC errors (via ProxMox on a production system) for like 120+ hours now and still have not seen them appear in MemTest. I don't think they were a fluke because the error addresses were consistent across multiple instances of the error.
2. I am not completely sold yet on the viability of Supermicro motherboards for general memory testing. They are the most cost effective option on paper, but if they can't actually facilitate memory testing via MemTest, they're kind of off the table. (see part 2 of this topic for more details).
Part 2
For some background, we are a memory module company. We test thousands of memory modules a day. We have been operating DDR5 RDIMM testing machines for over a year at this point and have never once had a DDR5 RDIMM modules give an actual MemTest redline error while running as a single module (ive witnessed them when running multiple modules but could never be reproduced when ran as singles).
Before jumping to conclusions about this being poor DDR5 RDIMM support with MemTest, I suspect it could have to do with Supermicro boards being picky about memory in general. It is fairly common for modules to simply not POST at all rather than enter MemTest and receive a testing error. It seems to be a hard pass-or-fail just attempting to POST on these Supermicro boards (both X13 and X14).
I wanted to leave this kind of open ended to the guys over at passmark. What are you using during development for testing DDR5 RDIMMs and what would you say is the least finicky motherboard/CPU vendors for DDR5 RDIMM testing that you've found? We are not opposed to AMD so long as they will produce reliable and consistent results, we just need something that can be trusted to give adequate testing results. Have you experienced oddities with DDR5 RDIMM on Supermicro boards similar to what ive described (impossible to find a stick that actually posts and errors in MemTest)?
I will end this off by apologizing for the sporadic nature of the post. I'm at my whits end trying to understand this and trying to actually find a bad DDR5 RDIMM module (via MemTest).
If you have a DDR5 RDIMM module that reliably redlines in MemTest, I will literally buy it off of you at this point.
I am highly technical and am more than willing to provide whatever additional details are necessary to assist. Let me know.
Thank you for reading and taking the time to consider.


Comment