RAID0 con drama en RedHat 6.2
Juan Pablo Tamayo
jptamayo en uno.cl
Mar Jun 1 16:06:53 CLT 2004
Estimados
La situacion:
En una maquina con dos controladoras SCSI conectadas a dos Storages
externos, se murio el primero de ellos (un Clarion), y el linux no ve nada
por esa controladora (scsi0), pero si ve los discos de la segunda
controladora, conectada a un Storage Sun.
Luego, como cambio la identificacion de los discos (lo que antes era
/dev/sdc ahora quedo como /dev/sda), simplemente cambie el /etc/raidtab y
trate de subir el md0 pero no funciona, tal vez alguien tiene experiencia y
me pueda dar un consejo, ya que nunca me habia topado con un arreglo muerto.
Por favor omitir comentarios de que el RedHat esta muy viejo et al, ya que
es un equipo de cliente y no administrado por mi.
[matrix:root:~]uname -a
Linux matrix.dominio.cl 2.2.19-6.2.16 #1 Wed Mar 13 13:03:05 EST 2002 i686
unknown
[matrix:root:~]lsmod
Module Size Used by
3c59x 21584 1 (autoclean)
[matrix:root:~]cat /etc/raidtab
raiddev /dev/md0
raid-level 0
nr-raid-disks 7
chunk-size 128
persistent-superblock 1
device /dev/sda1
raid-disk 0
device /dev/sdb1
raid-disk 1
device /dev/sdc1
raid-disk 2
device /dev/sdd1
raid-disk 3
device /dev/sde1
raid-disk 4
device /dev/sdf1
raid-disk 5
device /dev/sdg1
raid-disk 6
[matrix:root:~]modprobe -av aic7xxx
/sbin/insmod /lib/modules/2.2.19-6.2.16/scsi/aic7xxx.o
Using /lib/modules/2.2.19-6.2.16/scsi/aic7xxx.o
Symbol version prefix ''
[matrix:root:~]modprobe -av raid0
/sbin/insmod /lib/modules/2.2.19-6.2.16/block/raid0.o
Using /lib/modules/2.2.19-6.2.16/block/raid0.o
Symbol version prefix ''
[matrix:root:~]lsmod
Module Size Used by
raid0 3072 0 (unused)
aic7xxx 131808 0 (unused)
3c59x 21584 1 (autoclean)
[matrix:root:~]dmesg
[...]
(scsi0) <Adaptec AHA-2944 Ultra SCSI host adapter> found at PCI 0/9/0
(scsi0) Wide Channel, SCSI ID=7, 16/255 SCBs
(scsi0) Cables present (Int-50 NO, Int-68 NO, Ext-68 NO)
(scsi0) Downloading sequencer code... 436 instructions downloaded
(scsi1) <Adaptec AHA-2944 Ultra SCSI host adapter> found at PCI 0/11/0
(scsi1) Wide Channel, SCSI ID=7, 16/255 SCBs
(scsi1) Cables present (Int-50 NO, Int-68 NO, Ext-68 YES)
(scsi1) Downloading sequencer code... 422 instructions downloaded
scsi0 : Adaptec AHA274x/284x/294x (EISA/VLB/PCI-Fast SCSI) 5.1.33/3.2.4
<Adaptec AHA-2944 Ultra SCSI host adapter>
scsi1 : Adaptec AHA274x/284x/294x (EISA/VLB/PCI-Fast SCSI) 5.1.33/3.2.4
<Adaptec AHA-2944 Ultra SCSI host adapter>
scsi : 2 hosts.
Vendor: FUJITSU Model: M2949ESP SUN9.0G Rev: 2848
Type: Direct-Access ANSI SCSI revision: 02
Detected scsi disk sda at scsi1, channel 0, id 0, lun 0
Vendor: FUJITSU Model: M2949ESP SUN9.0G Rev: 2848
Type: Direct-Access ANSI SCSI revision: 02
Detected scsi disk sdb at scsi1, channel 0, id 1, lun 0
Vendor: FUJITSU Model: M2949ESP SUN9.0G Rev: 2848
Type: Direct-Access ANSI SCSI revision: 02
Detected scsi disk sdc at scsi1, channel 0, id 2, lun 0
Vendor: FUJITSU Model: M2949ESP SUN9.0G Rev: 2848
Type: Direct-Access ANSI SCSI revision: 02
Detected scsi disk sdd at scsi1, channel 0, id 3, lun 0
Vendor: FUJITSU Model: M2949ESP SUN9.0G Rev: 2848
Type: Direct-Access ANSI SCSI revision: 02
Detected scsi disk sde at scsi1, channel 0, id 4, lun 0
Vendor: FUJITSU Model: M2949ESP SUN9.0G Rev: 2848
Type: Direct-Access ANSI SCSI revision: 02
Detected scsi disk sdf at scsi1, channel 0, id 5, lun 0
Vendor: FUJITSU Model: M2949ESP SUN9.0G Rev: 2848
Type: Direct-Access ANSI SCSI revision: 02
Detected scsi disk sdg at scsi1, channel 0, id 6, lun 0
(scsi1:0:0:0) Synchronous at 20.0 Mbyte/sec, offset 8.
SCSI device sda: hdwr sector= 512 bytes. Sectors= 17689267 [8637 MB] [8.6
GB]
sda: sda1
(scsi1:0:1:0) Synchronous at 20.0 Mbyte/sec, offset 8.
SCSI device sdb: hdwr sector= 512 bytes. Sectors= 17689267 [8637 MB] [8.6
GB]
sdb: sdb1
(scsi1:0:2:0) Synchronous at 20.0 Mbyte/sec, offset 8.
SCSI device sdc: hdwr sector= 512 bytes. Sectors= 17689267 [8637 MB] [8.6
GB]
sdc: sdc1
(scsi1:0:3:0) Synchronous at 20.0 Mbyte/sec, offset 8.
SCSI device sdd: hdwr sector= 512 bytes. Sectors= 17689267 [8637 MB] [8.6
GB]
sdd: sdd1
(scsi1:0:4:0) Synchronous at 20.0 Mbyte/sec, offset 8.
SCSI device sde: hdwr sector= 512 bytes. Sectors= 17689267 [8637 MB] [8.6
GB]
sde: sde1
(scsi1:0:5:0) Synchronous at 20.0 Mbyte/sec, offset 8.
SCSI device sdf: hdwr sector= 512 bytes. Sectors= 17689267 [8637 MB] [8.6
GB]
sdf: sdf1
(scsi1:0:6:0) Synchronous at 20.0 Mbyte/sec, offset 8.
SCSI device sdg: hdwr sector= 512 bytes. Sectors= 17689267 [8637 MB] [8.6
GB]
sdg: sdg1
raid0 personality registered
[matrix:root:~]raidstart -a
[matrix:root:~]dmesg
[...]
(read) sda1's sb offset: 8843648 [events: 00000009]
(read) sdc1's sb offset: 8843648 [events: 00000009]
(read) sdd1's sb offset: 8843648 [events: 00000009]
(read) sde1's sb offset: 8843648 [events: 00000009]
(read) sdf1's sb offset: 8843648 [events: 00000009]
(read) sdg1's sb offset: 8843648 [events: 00000009]
blkdev_open() failed: -6
md: could not lock sdh1, zero-size? Marking faulty.
could not import sdh1, trying to run array nevertheless.
blkdev_open() failed: -6
md: could not lock sdi1, zero-size? Marking faulty.
could not import sdi1, trying to run array nevertheless.
autorun ...
considering sdg1 ...
adding sdg1 ...
adding sdf1 ...
adding sde1 ...
adding sdd1 ...
adding sdc1 ...
adding sda1 ...
created md0
bind<sda1,1>
bind<sdc1,2>
bind<sdd1,3>
bind<sde1,4>
bind<sdf1,5>
bind<sdg1,6>
running: <sdg1><sdf1><sde1><sdd1><sdc1><sda1>
now!
sdg1's event counter: 00000009
sdf1's event counter: 00000009
sde1's event counter: 00000009
sdd1's event counter: 00000009
sdc1's event counter: 00000009
sda1's event counter: 00000009
md: device name has changed from sdi1 to sdg1 since last import!
md: device name has changed from sdh1 to sdf1 since last import!
md: device name has changed from sdg1 to sde1 since last import!
md: device name has changed from sdf1 to sdd1 since last import!
md: device name has changed from sde1 to sdc1 since last import!
md: device name has changed from sdc1 to sda1 since last import!
md0: former device sdd1 is unavailable, removing from array!
mask ffffff80
rdev->size: 8843648
masked rdev->size: 8843648
new md_size: 8843648
rdev->size: 8843648
masked rdev->size: 8843648
new md_size: 17687296
rdev->size: 8843648
masked rdev->size: 8843648
new md_size: 26530944
rdev->size: 8843648
masked rdev->size: 8843648
new md_size: 35374592
rdev->size: 8843648
masked rdev->size: 8843648
new md_size: 44218240
rdev->size: 8843648
masked rdev->size: 8843648
new md_size: 53061888
md0: max total readahead window set to 3584k
md0: 7 data-disks, max readahead per data-disk: 512k
md: md0, array needs 7 disks, has 6, aborting.
raid0: disks are not ordered, aborting!
pers->run() failed ...
do_md_run() returned -22
unbind<sdg1,5>
export_rdev(sdg1)
unbind<sdf1,4>
export_rdev(sdf1)
unbind<sde1,3>
export_rdev(sde1)
unbind<sdd1,2>
export_rdev(sdd1)
unbind<sdc1,1>
export_rdev(sdc1)
unbind<sda1,0>
export_rdev(sda1)
md0 stopped.
... autorun DONE.
[matrix:root:~]fdisk -l /dev/sdd
Disk /dev/sdd: 255 heads, 63 sectors, 1101 cylinders
Units = cylinders of 16065 * 512 bytes
Device Boot Start End Blocks Id System
/dev/sdd1 1 1101 8843751 83 Linux
Dudas:
Por que aun trata de acceder a /dev/sdh y /dev/sdi ?
Por que no se muestran mensajes sobre /dev/sdb ?
Por que dice que /dev/sdd no esta disponible si lo ve con fdisk al igual que
los otros seis discos ?
Por el persistent-superblock en 1 no toma /etc/raidtab? Como lo fuerzo?
Esperando su iluminacion
Saludos!
JPT
Más información sobre la lista de distribución Linux