Linux - HardwareThis forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux?
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
I purchased an AMD Ryzen 5 5600 6-Core 2200MHz processor about 4 months ago. Today I got the following messages on my console:
Code:
Message from syslogd@ at Sat Dec 30 04:57:26 2023 ...
: [Hardware Error]: Corrected error, no action required.
Message from syslogd@ at Sat Dec 30 04:57:26 2023 ...
: [Hardware Error]: CPU:1 (19:21:2) MC20_STATUS[Over|CE|MiscV|AddrV|-|-|CECC|-|Poison|-]: 0xccccccc35b08c483
Message from syslogd@ at Sat Dec 30 04:57:26 2023 ...
: [Hardware Error]: IPID: 0x0000000000000000
Message from syslogd@ at Sat Dec 30 04:57:26 2023 ...
: [Hardware Error]: Error Addr: 0x0000000000000000
Message from syslogd@ at Sat Dec 30 04:57:26 2023 ...
: [Hardware Error]: Coherent Slave Ext. Error Code: 8, SDP read response had no match in the CS queue.
Message from syslogd@ at Sat Dec 30 04:57:26 2023 ...
: [Hardware Error]: cache level: L3/GEN, tx: INSN
And the following in syslog:
Code:
Dec 30 04:57:26 quadmon kernel: [Hardware Error]: Corrected error, no action required.
Dec 30 04:57:26 quadmon kernel: [Hardware Error]: CPU:1 (19:21:2) MC20_STATUS[Over|CE|MiscV|AddrV|-|-|CECC|-|Poison|-]: 0xccccccc35b08c483
Dec 30 04:57:26 quadmon kernel: [Hardware Error]: Error Addr: 0x0000000000000000
Dec 30 04:57:26 quadmon kernel: [Hardware Error]: IPID: 0x0000000000000000
Dec 30 04:57:26 quadmon kernel: [Hardware Error]: Coherent Slave Ext. Error Code: 8, SDP read response had no match in the CS queue.
Dec 30 04:57:26 quadmon kernel: [Hardware Error]: cache level: L3/GEN, tx: INSN
Is this something to worry about? A fellow posted about these same/similar error about a year and a half ago, https://www.remy.org.uk/tech.php?tech=1657321200, for an AMD Ryzen 5 5600X. He was seeing these messages every 2 to 3 weeks. He returned his CPU for exchange.
I'd really rather not go through the trouble of returning this process and trying to make-do in the meantime using some other computer. If this is not that serious, I'd rather keep using it until something bad happens.
I may just bite the bullet on this for now. This is my main work machine with 4 monitors and a Windows VM and I'd be hard pressed to put together a computer system that would let me work comfortably while waiting for a replacement. I've only seen this error once in the 4+ months I've been using it, running 24x7. Sadly, it might be more worthwhile to buy a new CPU for $130-ish and be back up the same day rather than go without for several weeks waiting on a replacement.
I'll leave this thread open and post back with any new developments.
I must have jinx myself. A few hours after posting that last message it happened again. The only difference is "Error Code: 18" instead of "Error Code 8".
Code:
Message from syslogd@ at Wed Jan 3 05:12:51 2024 ...
: [Hardware Error]: Corrected error, no action required.
Message from syslogd@ at Wed Jan 3 05:12:51 2024 ...
: [Hardware Error]: CPU:1 (19:21:2) MC20_STATUS[Over|CE|-|AddrV|PCC|-|-|-|Scrub]: 0xc748013bf892e95b
Message from syslogd@ at Wed Jan 3 05:12:51 2024 ...
: [Hardware Error]: IPID: 0x0000000000000000
Message from syslogd@ at Wed Jan 3 05:12:51 2024 ...
: [Hardware Error]: Error Addr: 0x0000000000000000
Message from syslogd@ at Wed Jan 3 05:12:51 2024 ...
: [Hardware Error]: Coherent Slave Ext. Error Code: 18
Message from syslogd@ at Wed Jan 3 05:12:51 2024 ...
: [Hardware Error]: cache level: L3/GEN, tx: GEN
Distribution: ChromeOS,SlackWare,Android and Lubuntu
Posts: 68
Rep:
All though it's extremely rare computer components i.e. CPU or RAM) and those in m.2 nvmes and SSDs can fail most of the time this rare occurrence does happen in about 1 out of 100,000 or more units you can get a couple of odd manufacturing defects such. The manufacturer's quality assurance department can get lazy and when that happens and defective units can slip by and go onto an unsuspecting customer/end user.
I replaced the CPU with a new one of the same model last week and am still getting the Hardware Error:
Code:
Message from syslogd@ at Sun May 5 07:03:05 2024 ...
: [Hardware Error]: Corrected error, no action required.
Message from syslogd@ at Sun May 5 07:03:05 2024 ...
: [Hardware Error]: CPU:1 (19:21:2) MC20_STATUS[Over|CE|-|AddrV|PCC|-|-|-|Scrub]: 0xc748013bf892e95b
Message from syslogd@ at Sun May 5 07:03:05 2024 ...
: [Hardware Error]: Error Addr: 0x0000000000000000
Message from syslogd@ at Sun May 5 07:03:05 2024 ...
: [Hardware Error]: IPID: 0x0000000000000000
Message from syslogd@ at Sun May 5 07:03:05 2024 ...
: [Hardware Error]: Coherent Slave Ext. Error Code: 18
Message from syslogd@ at Sun May 5 07:03:05 2024 ...
: [Hardware Error]: cache level: L3/GEN, tx: GEN
I'm very doubtful it's the CPU. Especially given that I've upgraded the office workstations with 8 of these CPUs (AMD Ryzen 5 5600 6-Core) in the last two months and have seen no such similar problem. The memory is new as of last October 2023. The only things kept from the previous system are the graphics cards (NVIDIA Corporation GK208B [GeForce GT 710]). What could the problem be? The messages aren't very helpful. What is a "Coherent Slave"? Memory? Graphcs? ... ?
is it overclocked? PSU is "good" enough? RAM is ok? What about the temperature? And yes, it can be any other [faulty] hardware component, but can be the CPU itself too. And probably reseat all the cards you have (ram, video, whatever).
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.