[sles-beta] Problems with mcelog on HP-Proliant Server with AMD Opteron processors

Michael Krapp mkrapp at suse.com
Tue Mar 5 03:06:08 MST 2013


Hi,

newer AMD processors, family 16 and greater, do not support mcelog, 
on SP2 mcelog was just missing the check and the errormessage.
The message is not really helpful at the moment since we don't ship
edac_mce_amd with the SLES kernel. For now, please just disable mcelog.
OTOH, no harm is done if mcelog starts on AMD systems, it will just bail
out with that message, nothing else is affected. 

We're tracking that with bnc#807336, AMD is involved.

-- 
Kind regards / Mit freundlichen Grüßen

Michael Krapp, Technical Support Engineer
Worldwide Support Services Linux
SUSE LINUX GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, HRB 21284 (AG Nürnberg)
Maxfeldstr. 5, D-90409 Nürnberg
T: +49-911-74053-0 F: +49-911-74053-679


* Greg.Lehmann at csiro.au schrieb 20130305 08:08:
> I noticed this on a AMD server too but since I had not installed SP2 on there before, I didn't know it had changed.
> 
> Greg
> 
> > -----Original Message-----
> > From: sles-beta-bounces at lists.suse.com [mailto:sles-beta-
> > bounces at lists.suse.com] On Behalf Of urs.frey at post.ch
> > Sent: Monday, 4 March 2013 7:57 PM
> > To: sles-beta at lists.suse.com
> > Subject: [sles-beta] Problems with mcelog on HP-Proliant Server with
> > AMD Opteron processors
> > 
> > Hi
> > 
> > Installation of SLES11-SP3 beta2 works smoothly.
> > I am testing on:
> > HP Proliant DL385-G5
> > HP Proliant BL465c-G7 Blade: AMD Opteron(tm) Processor 6172
> > HP Proliant BL465c-Gen8 Blade : AMD Opteron(TM) Processor 6220
> > HP Proliant BL460c-Gen8 Blade : Intel(R) Xeon(R) CPU E5-2660 0 @
> > 2.20GHz
> > 
> > The only thing which irritates me, is that mcelog failed message on my
> > console:
> > 
> > On all AMD Opteron Processors models I get this failure:  "CPU is
> > unsupported"
> > 
> > h04syy:/etc/init.d # /etc/init.d/mcelog start
> > Starting mcelog... CPU is unsupported
> > startproc:  exit status of parent of /usr/sbin/mcelog: 1
> > failed
> > h04syy:/etc/init.d #
> > 
> > I mean under SLES11-SP2 mcelog works with no failure.
> > On SLES11-SP2 there is an older release of mcelog
> > 
> > h04wwl:~ # uname -a
> > Linux h04wwl 3.0.58-0.6.6-default #1 SMP Tue Feb 19 11:07:00 UTC 2013
> > (1576ecd) x86_64 x86_64 x86_64 GNU/Linux
> > h04wwl:~ # cat /etc/SuSE-release
> > SUSE Linux Enterprise Server 11 (x86_64)
> > VERSION = 11
> > PATCHLEVEL = 2
> > h04wwl:~ #
> > h04wwl:~ # cat /proc/cpuinfo | grep "model name"
> > model name      : AMD Opteron(tm) Processor 6172
> > h04wwl:~ # ps -ef | grep mcelog
> > root      5603     1  0 Feb27 ?        00:00:00 /usr/sbin/mcelog --
> > daemon --config-file /etc/mcelog/mcelog.conf
> > root     31534 31503  0 10:27 pts/0    00:00:00 grep mcelog
> > h04wwl:~ # cat /var/run/mcelog.pid
> > 5603
> > h04wwl:~ #
> > freyu at h04wwl:~> rpm -q --changelog mcelog-1.0.2011.06.08-0.11.1 | less
> > * Tue Apr 17 2012 trenn at suse.de
> > - Add Ivy Bridge support (bnc#748484)
> > - Add forgotten CPU model 0x25 (bnc#742716)
> > 
> > On Intel mcelog works on SLES11-SP3 beta2
> > model name      : Intel(R) Xeon(R) CPU E5-2660 0 @ 2.20GHz
> > h05cni:~ #
> > h05cni:~ # uname -a
> > Linux h05cni 3.0.65-0.9-default #1 SMP Mon Feb 25 07:21:23 UTC 2013
> > (055263a) x86_64 x86_64 x86_64 GNU/Linux
> > h05cni:~ # cat /etc/SuSE-release
> > SUSE Linux Enterprise Server 11 (x86_64)
> > VERSION = 11
> > PATCHLEVEL = 3
> > h05cni:~ # ps -ef | grep mcelog | grep -v grep
> > root      4949     1  0 10:33 ?        00:00:00 /usr/sbin/mcelog --
> > daemon --config-file /etc/mcelog/mcelog.conf
> > h05cni:~ # cat /var/run/mcelog.pid
> > 4949
> > h05cni:~ #
> > 
> > But on AMD processors mcelog does not work anymore on SLES11-SP3 beta2
> > model name      : AMD Opteron(TM) Processor 6220
> > h05cnh:~ # ps -ef | grep mcelog | grep -v grep
> > h05cnh:~ # uname -a
> > Linux h05cnh 3.0.65-0.9-default #1 SMP Mon Feb 25 07:21:23 UTC 2013
> > (055263a) x86_64 x86_64 x86_64 GNU/Linux
> > h05cnh:~ # cat /etc/SuSE-release
> > SUSE Linux Enterprise Server 11 (x86_64)
> > VERSION = 11
> > PATCHLEVEL = 3
> > h05cnh:~ #
> > 
> > 
> > With SLES11-SP3 a new release of mcelog has been included: This is new
> > in the RPM changelog:
> > h05cnh:~ # rpm -qa | grep mcelog
> > mcelog-1.0.2013.01.18-0.7.7
> > h05cnh:~ # rpm -q --changelog mcelog-1.0.2013.01.18-0.7.7 | less
> > * Mon Jan 21 2013 trenn at suse.de
> > - Updated to latest git state (2013-01-18) (fate#313746)
> >   and removed patches which went upstream meanwhile (for example
> >   enabling ivybridge, fate#313745). This one has latest IvyBridge EX/EP
> >   CPU decoding support
> > - Removed wrongly used getdomainname call and replaced it with
> >   a dnslookup call as done in factory for a while
> > 
> > QUESTION:
> > Is there now a special configuration needed for AMD Opteron processors?
> > When I diff /etc/mcelog/mcelog.conf from SLES11-SP2 and SLES11-SP3
> > there is no difference at all.
> > 
> > So why is there the message "CPU is unsupported" only on Amd Opteron
> > processors?
> > Can it be, that with the removal of patches, some AMD specific code has
> > been removed?
> > 
> > Thanks for your feedback
> > Best regards
> > 
> > Urs Frey
> > Die Schweizerische Post
> > Services
> > Informationstechnologie
> > Webergutstrasse 12
> > 3030 Bern (Zollikofen)
> > Telefon : ++41 (0)58 338 58 70
> > FAX     : ++41 (0)58 667 30 07
> > E-Mail:   urs.frey at post.ch<mailto:urs.frey at post.ch>
> > 
> > 
> > _______________________________________________
> > sles-beta mailing list
> > sles-beta at lists.suse.com
> > http://lists.suse.com/mailman/listinfo/sles-beta
> _______________________________________________
> sles-beta mailing list
> sles-beta at lists.suse.com
> http://lists.suse.com/mailman/listinfo/sles-beta

-- 
Michael

Abschmecken ist feige...


More information about the sles-beta mailing list