Dell服务器升级Raid卡固件

一台服务器换盘之后状态不正常,重启服务器进入raid管理界面,硬盘表示为:Blocked,且无法配置为热备.日志里有如下内容:

Copyback cannot be started on PD 03(e0x20/s3) from PD 05(e0x20/s5), as SAS/SATA is not supported in an array

查看硬盘状态:

[root@d10045101 ~]# /opt/MegaRAID/MegaCli/MegaCli64 -pdlist -aall |grep "Firmware st"
Firmware state: Online, Spun Up
Firmware state: Online, Spun Up
Firmware state: Online, Spun Up
Firmware state: Unconfigured(good), Spun Up
Firmware state: Online, Spun Up
Firmware state: Online, Spun Up

同事说是Raid固件问题,需要升级固件。

升级前准备

查看当前Raid卡固件信息。可以看到阵列卡型号是 PERC H700 Integrated。

[root@d10045101 hean]# lspci |grep -i raid
03:00.0 RAID bus controller: LSI Logic / Symbios Logic LSI MegaSAS 9260 (rev 04)

[root@d10045101 ~]# /opt/MegaRAID/MegaCli/MegaCli64 -adpallinfo -a0

Adapter #0

==============================================================================
Versions
================
Product Name : PERC H700 Integrated
Serial No : 9CO00AV
FW Package Build: 12.0.1-0091

Mfg. Data
================
Mfg. Date : 12/25/09
Rework Date : 12/25/09
Revision No : A00
Battery FRU : N/A

Image Versions in Flash:
================
BIOS Version : 3.09.00
FW Version : 2.0.03-0772
Preboot CLI Version: 02.00-013:#%00008
Ctrl-R Version : 2.00-0024
NVDATA Version : 2.02.0037
Boot Block Version : 2.00.00.00-0018
BOOT Version : 01.250.04.219

查看机器型号和SN。

[root@d10045101 ~]# dmidecode -t system
# dmidecode 2.9
SMBIOS 2.6 present.

Handle 0x0100, DMI type 1, 27 bytes
System Information
	Manufacturer: Dell Inc.
	Product Name: PowerEdge R710
	Version: Not Specified
	Serial Number: 1XXXXXX
	Wake-up Type: Power Switch
	SKU Number: Not Specified
	Family: Not Specified

去戴尔官网根据SN和阵列卡型号下载相关驱动。

升级过程

给驱动镜像加可执行权限,直接执行即可,以下是升级过程。

[root@d10045101 ~]# ./SAS-RAID_Firmware_C3X7D_LN_12.10.6-0001_A12.BIN 
Collecting inventory...
.......
Running validation...

PERC H700 Integrated Controller 0

The version of this Update Package is newer than the currently installed version.
Software application name: PERC H700 Integrated Controller 0 Firmware
Package version: 12.10.6-0001
Installed version: 12.0.1-0091


Continue? Y/N:y
Executing update...
WARNING: DO NOT STOP THIS PROCESS OR INSTALL OTHER DELL PRODUCTS WHILE UPDATE IS IN PROGRESS.
THESE ACTIONS MAY CAUSE YOUR SYSTEM TO BECOME UNSTABLE!
................................................................................................................................................................................................................
Device: PERC H700 Integrated Controller 0
  Application: PERC H700 Integrated Controller 0 Firmware
  The operation was successful.

Would you like to reboot your system now?
Continue? Y/N:y

Broadcast message from root (pts/0) (Wed Nov  5 15:40:44 2014):

The system is going down for reboot NOW!
[root@d10045101 ~]# 

升级结果

升级完成,再次看阵列卡固件版本,可以看到FW Package Build已从12.0.1-0091变成12.10.6-0001。

[root@d10045101 ~]# lspci |grep -i raid
03:00.0 RAID bus controller: LSI Logic / Symbios Logic LSI MegaSAS 9260 (rev 04)
[root@d10045101 ~]# /opt/Mega
MegaCli   MegaRAID/ 
[root@d10045101 ~]# /opt/MegaRAID/MegaCli/MegaCli64 -adpallinfo -a0 |more
                                     
OSSpecificInitialize: Failed to load libsysfs.so.2.0.2 Please ensure that libsfs is present in the system.
The dependent library libsysfs.so.2.0.1 not available. Please contact LSI for distribution of the package
Adapter #0

==============================================================================
                    Versions
                ================
Product Name    : PERC H700 Integrated
Serial No       : 9CO00AV
FW Package Build: 12.10.6-0001

                    Mfg. Data
                ================
Mfg. Date       : 12/25/09
Rework Date     : 12/25/09
Revision No     : A00
Battery FRU     : N/A

                Image Versions in Flash:
                ================
BIOS Version       : 3.18.00_4.09.05.00_0x0416A000
FW Version         : 2.100.03-2514
Preboot CLI Version: 04.04-010:#%00008
Ctrl-R Version     : 2.02-0025.1
NVDATA Version     : 2.07.03-0003
Boot Block Version : 2.02.00.00-0000
BOOT Version       : 01.250.04.219

检查磁盘状态。

[root@d10045101 ~]# /opt/MegaRAID/MegaCli/MegaCli64 -pdlist -aall |grep "Firmware st"
Firmware state: Online, Spun Up
Firmware state: Online, Spun Up
Firmware state: Online, Spun Up
Firmware state: Copyback
Firmware state: Online, Spun Up
Firmware state: Online, Spun Up

Copyback过程结束后的磁盘状态:

[root@d10045101 ~]# /opt/MegaRAID/MegaCli/MegaCli64 -pdlist -aall |grep "Firmware st"
Firmware state: Online, Spun Up
Firmware state: Online, Spun Up
Firmware state: Online, Spun Up
Firmware state: Online, Spun Up
Firmware state: Online, Spun Up
Firmware state: Hotspare, Spun Up

 关于Copyback

Typically, a drive fails or is expected to fail, and the data is rebuilt on a hot spare.
The failed drive is replaced with a new drive. Then the data is copied from the hot spare to the new drive, and the hot spare reverts from a rebuild drive to its original hot spare status. The copyback operation runs as a background activity, and the virtual drive is still available online to the host.

解释一下,Copyback 功用只限于有Hot Spare Raid(空闲热备援盘) 上才有的功能.

假设一组有带Hot Spare 阵列,某个slot 5 坏了 ,这时处于 slot 12 的hot spare hdd 开始做rebuild动作.
Rebuild 完成.会在于此阵列已无hot spare 盘功能.并且位置有所改变 (Disk Group 简称DG) 也有所变化

有设定copyback 的话,当slot 5 放入新的状况良好硬碟时, slot 12 hot spare 会做sync 到slot 5 .重要的是做此sync时,你对整组VD 的写入 变化数据 slot 5 会跟slot 12 同步。完成Copy back后,这时后slot 12 中硬碟可以继续做用.原阵列架构 ,DG 都不会改变

 参考链接

[1]. lsi 阵列卡 Copyback 功能说明文档 http://www.0li0.com/html/20121116/548.html

 

 

发表回复

您的电子邮箱地址不会被公开。 必填项已用*标注