Hello,
we recently bought two computers with DDR3 RAM (G-Skill Jan 2016 in the label of the DIMMs), and in one of them we experienced a corruption in database contents. Checking, I noticed that 7 bits were flipped along a blob of 14MB.
I tested using the userspace google rowhammer_test and it hit a bitflip in Iteration 10 (after around 10 seconds). Repeatedly running the test it effectively failed always below iteration 300.
I tested the RAM with memtest 6.3.0 (that should include a rowhammer test), and it found nothing. I tested the RAM with memtest 4.3.7 (without rowhammer) and it found nothing.
Nevertheless, using the memtest86+ 5.01 with rowhammer test (https://github.com/CMU-SAFARI/rowhammer) it found always addresses with bitflips. In both computers.
So, to sum up:
1) we have some evidence that there is a RAM problem: database corruption with bitflips
2) the google rowhammer_test linux program finds bitflips very quickly in one computer, and not so quick in the other computer, yet finds.
3) the memtest86+ 5.01 with rowhammer tests finds the bitflips in every run, in both computers.
4) memtest86 6.3.0 doesn't find any problem in those computers, having run the set of tests two or three times in each.
5) Replacing the RAM with good modules, memtest86+ 5.01 and the google rowhammer_test do not find any problem.
That makes me not trust much memtest86 6.3.0, definitely. The evidence happens in two computers, spanning 4 DIMMs. Do you have any idea of what can be going on? Is there a bug in memtest86 6.3.0?
we recently bought two computers with DDR3 RAM (G-Skill Jan 2016 in the label of the DIMMs), and in one of them we experienced a corruption in database contents. Checking, I noticed that 7 bits were flipped along a blob of 14MB.
I tested using the userspace google rowhammer_test and it hit a bitflip in Iteration 10 (after around 10 seconds). Repeatedly running the test it effectively failed always below iteration 300.
I tested the RAM with memtest 6.3.0 (that should include a rowhammer test), and it found nothing. I tested the RAM with memtest 4.3.7 (without rowhammer) and it found nothing.
Nevertheless, using the memtest86+ 5.01 with rowhammer test (https://github.com/CMU-SAFARI/rowhammer) it found always addresses with bitflips. In both computers.
So, to sum up:
1) we have some evidence that there is a RAM problem: database corruption with bitflips
2) the google rowhammer_test linux program finds bitflips very quickly in one computer, and not so quick in the other computer, yet finds.
3) the memtest86+ 5.01 with rowhammer tests finds the bitflips in every run, in both computers.
4) memtest86 6.3.0 doesn't find any problem in those computers, having run the set of tests two or three times in each.
5) Replacing the RAM with good modules, memtest86+ 5.01 and the google rowhammer_test do not find any problem.
That makes me not trust much memtest86 6.3.0, definitely. The evidence happens in two computers, spanning 4 DIMMs. Do you have any idea of what can be going on? Is there a bug in memtest86 6.3.0?
Comment