用ipmitool获取的管理卡日志没有完全解码,可读性差
19d | 03/03/2015 | 11:57:19 | Unknown #0x28 | 19e | 03/03/2015 | 11:57:19 | Unknown #0x28 | 19f | 03/03/2015 | 11:57:20 | Unknown #0x28 | 1a0 | 03/03/2015 | 11:57:20 | Unknown #0x28 | 1a1 | 03/03/2015 | 11:57:20 | Unknown #0x28 | 1a2 | 03/03/2015 | 11:57:20 | Unknown #0x28 | 1a3 | 03/03/2015 | 11:57:20 | Memory #0x02 | Uncorrectable ECC | Asserted
而每次登陆IDRAC的web界面也挺麻烦,这时用racadm就比较方便了。racadm是DELL提供的一个命令行工具,可以在DELL官网下载到。
The Dell RACADM (Remote Access Controller Admin) utility is a command line tool that allows for remote or local management of Dell Servers via the iDRAC or DRAC. RACADM provides similar functionality to the iDRAC/DRAC Graphical User Interface (GUI). The Dell Chassis Management Controller (CMC) can also be managed remotely with RACADM.
RACADM commands can be run remotely from a management station and/or locally on the managed system. [1]
用racadm获取管理卡日志的方法如下:
$ racadm -r 10.127.208.23 -u root -p calvin help getsel Security Alert: Certificate is invalid - Certificate is not signed by Trusted Third Party Continuing execution. Use -S option for racadm to stop execution on certificate-related errors. getsel -- display records from the System Event Log (SEL) Usage: racadm getsel [-s <start>] [-c <count>] [-A] [-o] [-E] [-R] racadm getsel -i [-A] ------------------------------------------------------------------------------- Valid Options: -s : starting record number -c : number of records to display -i : display number of total records available -m : display a screen at a time -A : do not display headers or labels -o : display each record on a single line -E : display RAW SEL data along with other record data -R : display only RAW SEL data for each record ------------------------------------------------------------------------------- Usage Examples: - Display entire log: racadm getsel - Display number of records in log: racadm getsel -i -------------------------------------------------------------------------------
在windows上使用msys,可以用脚本包装一下,更加便于使用。新建 /bin/sel,脚本内容如下
#!/bin/bash IP=$1 echo > /tmp/$IP.log racadm -r $IP -u root -p passwd getsel -o 2>&1 |tee -a /tmp/$IP.log npp /tmp/$IP.log &
使用演示
$ sel 10.127.208.110 Security Alert: Certificate is invalid - Certificate is not signed by Trusted Third Party Continuing execution. Use -S option for racadm to stop execution on certificate-related errors. 03/03/2015 11:57:19 system Ok MSR Info Log: OEM sensor, OEM Diagnostic data event was asserted 03/03/2015 11:57:19 system Ok MSR Info Log: OEM sensor, OEM Diagnostic data event was asserted 03/03/2015 11:57:20 system Ok MSR Info Log: OEM sensor, OEM Diagnostic data event was asserted 03/03/2015 11:57:20 system Ok MSR Info Log: OEM sensor, OEM Diagnostic data event was asserted 03/03/2015 11:57:20 system Ok MSR Info Log: OEM sensor, OEM Diagnostic data event was asserted 03/03/2015 11:57:20 system Ok MSR Info Log: OEM sensor, OEM Diagnostic data event was asserted 03/03/2015 11:57:20 system Critical ECC Uncorr Err: Memory sensor, uncorrectable ECC ( DIMM_A2 ) was asserted
跟文章开头ipmitool操作的是同一台机器,racadm可以明确的看到是哪个槽位的内存有问题。
参考资料
[1]. RACADM Command Line Interface for DRAC http://en.community.dell.com/techcenter/systems-management/w/wiki/3205.racadm-command-line-interface-for-drac
发表回复