LinuxQuestions.org
Visit Jeremy's Blog.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware
User Name
Password
Linux - Hardware This forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux?

Notices


Reply
  Search this Thread
Old 12-30-2023, 01:06 PM   #1
mfoley
Senior Member
 
Registered: Oct 2008
Location: Columbus, Ohio USA
Distribution: Slackware
Posts: 2,589

Rep: Reputation: 179Reputation: 179
Coherent Slave Ext. Error Code: 8


I purchased an AMD Ryzen 5 5600 6-Core 2200MHz processor about 4 months ago. Today I got the following messages on my console:
Code:
Message from syslogd@  at Sat Dec 30 04:57:26 2023 ...
: [Hardware Error]: Corrected error, no action required.

Message from syslogd@  at Sat Dec 30 04:57:26 2023 ...
: [Hardware Error]: CPU:1 (19:21:2) MC20_STATUS[Over|CE|MiscV|AddrV|-|-|CECC|-|Poison|-]: 0xccccccc35b08c483

Message from syslogd@  at Sat Dec 30 04:57:26 2023 ...
: [Hardware Error]: IPID: 0x0000000000000000

Message from syslogd@  at Sat Dec 30 04:57:26 2023 ...
: [Hardware Error]: Error Addr: 0x0000000000000000

Message from syslogd@  at Sat Dec 30 04:57:26 2023 ...
: [Hardware Error]: Coherent Slave Ext. Error Code: 8, SDP read response had no match in the CS queue.

Message from syslogd@  at Sat Dec 30 04:57:26 2023 ...
: [Hardware Error]: cache level: L3/GEN, tx: INSN
And the following in syslog:
Code:
Dec 30 04:57:26 quadmon kernel: [Hardware Error]: Corrected error, no action required.
Dec 30 04:57:26 quadmon kernel: [Hardware Error]: CPU:1 (19:21:2) MC20_STATUS[Over|CE|MiscV|AddrV|-|-|CECC|-|Poison|-]: 0xccccccc35b08c483
Dec 30 04:57:26 quadmon kernel: [Hardware Error]: Error Addr: 0x0000000000000000
Dec 30 04:57:26 quadmon kernel: [Hardware Error]: IPID: 0x0000000000000000
Dec 30 04:57:26 quadmon kernel: [Hardware Error]: Coherent Slave Ext. Error Code: 8, SDP read response had no match in the CS queue.
Dec 30 04:57:26 quadmon kernel: [Hardware Error]: cache level: L3/GEN, tx: INSN
Is this something to worry about? A fellow posted about these same/similar error about a year and a half ago, https://www.remy.org.uk/tech.php?tech=1657321200, for an AMD Ryzen 5 5600X. He was seeing these messages every 2 to 3 weeks. He returned his CPU for exchange.

I'd really rather not go through the trouble of returning this process and trying to make-do in the meantime using some other computer. If this is not that serious, I'd rather keep using it until something bad happens.

Thoughts?
 
Old 01-01-2024, 12:50 AM   #2
beachboy2
Senior Member
 
Registered: Jan 2007
Location: Wild West Wales, UK
Distribution: Linux Mint 21 MATE, EndeavourOS, antiX, MX Linux
Posts: 3,987
Blog Entries: 33

Rep: Reputation: 1470Reputation: 1470Reputation: 1470Reputation: 1470Reputation: 1470Reputation: 1470Reputation: 1470Reputation: 1470Reputation: 1470Reputation: 1470
mfoley,

The vendor of a similar AMD 5 5600 CPU confirmed that the unit was faulty after its return by Remy:
https://www.remy.org.uk/tech.php?tech=1657321200

I would definitely return it under warranty as soon as possible.
 
Old 01-03-2024, 12:19 AM   #3
mfoley
Senior Member
 
Registered: Oct 2008
Location: Columbus, Ohio USA
Distribution: Slackware
Posts: 2,589

Original Poster
Rep: Reputation: 179Reputation: 179
I may just bite the bullet on this for now. This is my main work machine with 4 monitors and a Windows VM and I'd be hard pressed to put together a computer system that would let me work comfortably while waiting for a replacement. I've only seen this error once in the 4+ months I've been using it, running 24x7. Sadly, it might be more worthwhile to buy a new CPU for $130-ish and be back up the same day rather than go without for several weeks waiting on a replacement.

I'll leave this thread open and post back with any new developments.
 
Old 01-03-2024, 10:49 AM   #4
mfoley
Senior Member
 
Registered: Oct 2008
Location: Columbus, Ohio USA
Distribution: Slackware
Posts: 2,589

Original Poster
Rep: Reputation: 179Reputation: 179
I must have jinx myself. A few hours after posting that last message it happened again. The only difference is "Error Code: 18" instead of "Error Code 8".
Code:
Message from syslogd@  at Wed Jan  3 05:12:51 2024 ...
: [Hardware Error]: Corrected error, no action required.

Message from syslogd@  at Wed Jan  3 05:12:51 2024 ...
: [Hardware Error]: CPU:1 (19:21:2) MC20_STATUS[Over|CE|-|AddrV|PCC|-|-|-|Scrub]: 0xc748013bf892e95b

Message from syslogd@  at Wed Jan  3 05:12:51 2024 ...
: [Hardware Error]: IPID: 0x0000000000000000

Message from syslogd@  at Wed Jan  3 05:12:51 2024 ...
: [Hardware Error]: Error Addr: 0x0000000000000000

Message from syslogd@  at Wed Jan  3 05:12:51 2024 ...
: [Hardware Error]: Coherent Slave Ext. Error Code: 18

Message from syslogd@  at Wed Jan  3 05:12:51 2024 ...
: [Hardware Error]: cache level: L3/GEN, tx: GEN
 
Old 01-03-2024, 11:22 AM   #5
beachboy2
Senior Member
 
Registered: Jan 2007
Location: Wild West Wales, UK
Distribution: Linux Mint 21 MATE, EndeavourOS, antiX, MX Linux
Posts: 3,987
Blog Entries: 33

Rep: Reputation: 1470Reputation: 1470Reputation: 1470Reputation: 1470Reputation: 1470Reputation: 1470Reputation: 1470Reputation: 1470Reputation: 1470Reputation: 1470
mfoley,

Time for a replacement CPU.
 
Old 01-03-2024, 12:34 PM   #6
mfoley
Senior Member
 
Registered: Oct 2008
Location: Columbus, Ohio USA
Distribution: Slackware
Posts: 2,589

Original Poster
Rep: Reputation: 179Reputation: 179
Yup. It happened again today. This time, the desktop crashed and restarted. I'll get a replacement this week.
 
Old 01-04-2024, 05:25 PM   #7
niceflipper8827
Member
 
Registered: Sep 2023
Location: Washington State,USA
Distribution: ChromeOS,SlackWare,Android and Lubuntu
Posts: 68

Rep: Reputation: 2
All though it's extremely rare computer components i.e. CPU or RAM) and those in m.2 nvmes and SSDs can fail most of the time this rare occurrence does happen in about 1 out of 100,000 or more units you can get a couple of odd manufacturing defects such. The manufacturer's quality assurance department can get lazy and when that happens and defective units can slip by and go onto an unsuspecting customer/end user.
 
Old 05-06-2024, 01:54 PM   #8
mfoley
Senior Member
 
Registered: Oct 2008
Location: Columbus, Ohio USA
Distribution: Slackware
Posts: 2,589

Original Poster
Rep: Reputation: 179Reputation: 179
I replaced the CPU with a new one of the same model last week and am still getting the Hardware Error:
Code:
Message from syslogd@  at Sun May  5 07:03:05 2024 ...
: [Hardware Error]: Corrected error, no action required.

Message from syslogd@  at Sun May  5 07:03:05 2024 ...
: [Hardware Error]: CPU:1 (19:21:2) MC20_STATUS[Over|CE|-|AddrV|PCC|-|-|-|Scrub]: 0xc748013bf892e95b

Message from syslogd@  at Sun May  5 07:03:05 2024 ...
: [Hardware Error]: Error Addr: 0x0000000000000000

Message from syslogd@  at Sun May  5 07:03:05 2024 ...
: [Hardware Error]: IPID: 0x0000000000000000

Message from syslogd@  at Sun May  5 07:03:05 2024 ...
: [Hardware Error]: Coherent Slave Ext. Error Code: 18

Message from syslogd@  at Sun May  5 07:03:05 2024 ...
: [Hardware Error]: cache level: L3/GEN, tx: GEN
I'm very doubtful it's the CPU. Especially given that I've upgraded the office workstations with 8 of these CPUs (AMD Ryzen 5 5600 6-Core) in the last two months and have seen no such similar problem. The memory is new as of last October 2023. The only things kept from the previous system are the graphics cards (NVIDIA Corporation GK208B [GeForce GT 710]). What could the problem be? The messages aren't very helpful. What is a "Coherent Slave"? Memory? Graphcs? ... ?

Last edited by mfoley; 05-06-2024 at 01:55 PM.
 
Old 05-09-2024, 10:30 AM   #9
kilgoretrout
Senior Member
 
Registered: Oct 2003
Posts: 2,988

Rep: Reputation: 388Reputation: 388Reputation: 388Reputation: 388
Check the ram by running memtest overnight. See if you get any errors. Could be the motherboard or power supply as well.
 
Old 05-09-2024, 10:40 AM   #10
pan64
LQ Addict
 
Registered: Mar 2012
Location: Hungary
Distribution: debian/ubuntu/suse ...
Posts: 21,981

Rep: Reputation: 7337Reputation: 7337Reputation: 7337Reputation: 7337Reputation: 7337Reputation: 7337Reputation: 7337Reputation: 7337Reputation: 7337Reputation: 7337Reputation: 7337
is it overclocked? PSU is "good" enough? RAM is ok? What about the temperature? And yes, it can be any other [faulty] hardware component, but can be the CPU itself too. And probably reseat all the cards you have (ram, video, whatever).

Last edited by pan64; 05-09-2024 at 10:42 AM.
 
  


Reply

Tags
fault, ryzen



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Coherent Unix source code download BenCollver Linux - News 0 04-05-2015 08:22 AM
LXer: My life with Coherent, part 2 LXer Syndicated Linux News 0 05-04-2012 09:00 PM
LXer: My life with Coherent, part 1 LXer Syndicated Linux News 0 02-08-2012 09:01 PM
LXer: Your Company Needs to Develop a Coherent Mobile App Plan LXer Syndicated Linux News 0 12-01-2011 05:20 PM
ndiswrapper troubles: "Failed to allocate DMA coherent memory." Oodini Linux - Wireless Networking 1 01-12-2005 12:20 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware

All times are GMT -5. The time now is 10:02 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration