RAID0 con drama en RedHat 6.2

Juan Pablo Tamayo jptamayo en uno.cl
Mar Jun 1 16:06:53 CLT 2004


Estimados

La situacion:
En una maquina con dos controladoras SCSI conectadas a dos Storages
externos, se murio el primero de ellos (un Clarion), y el linux no ve nada
por esa controladora (scsi0), pero si ve los discos de la segunda
controladora, conectada a un Storage Sun.
Luego, como cambio la identificacion de los discos (lo que antes era
/dev/sdc ahora quedo como /dev/sda), simplemente cambie el /etc/raidtab y
trate de subir el md0 pero no funciona, tal vez alguien tiene experiencia y
me pueda dar un consejo, ya que nunca me habia topado con un arreglo muerto.

Por favor omitir comentarios de que el RedHat esta muy viejo et al, ya que
es un equipo de cliente y no administrado por mi.

[matrix:root:~]uname -a
Linux matrix.dominio.cl 2.2.19-6.2.16 #1 Wed Mar 13 13:03:05 EST 2002 i686
unknown

[matrix:root:~]lsmod
Module                  Size  Used by
3c59x                  21584   1  (autoclean)

[matrix:root:~]cat /etc/raidtab
raiddev /dev/md0
        raid-level 0
        nr-raid-disks   7
        chunk-size      128
        persistent-superblock 1
        device /dev/sda1
        raid-disk 0
        device /dev/sdb1
        raid-disk 1
        device /dev/sdc1
        raid-disk 2
        device /dev/sdd1
        raid-disk 3
        device /dev/sde1
        raid-disk 4
        device /dev/sdf1
        raid-disk 5
        device /dev/sdg1
        raid-disk 6

[matrix:root:~]modprobe -av aic7xxx
/sbin/insmod /lib/modules/2.2.19-6.2.16/scsi/aic7xxx.o
Using /lib/modules/2.2.19-6.2.16/scsi/aic7xxx.o
Symbol version prefix ''

[matrix:root:~]modprobe -av raid0
/sbin/insmod /lib/modules/2.2.19-6.2.16/block/raid0.o
Using /lib/modules/2.2.19-6.2.16/block/raid0.o
Symbol version prefix ''

[matrix:root:~]lsmod
Module                  Size  Used by
raid0                   3072   0  (unused)
aic7xxx               131808   0  (unused)
3c59x                  21584   1  (autoclean)

[matrix:root:~]dmesg
[...]
(scsi0) <Adaptec AHA-2944 Ultra SCSI host adapter> found at PCI 0/9/0
(scsi0) Wide Channel, SCSI ID=7, 16/255 SCBs
(scsi0) Cables present (Int-50 NO, Int-68 NO, Ext-68 NO)
(scsi0) Downloading sequencer code... 436 instructions downloaded
(scsi1) <Adaptec AHA-2944 Ultra SCSI host adapter> found at PCI 0/11/0
(scsi1) Wide Channel, SCSI ID=7, 16/255 SCBs
(scsi1) Cables present (Int-50 NO, Int-68 NO, Ext-68 YES)
(scsi1) Downloading sequencer code... 422 instructions downloaded
scsi0 : Adaptec AHA274x/284x/294x (EISA/VLB/PCI-Fast SCSI) 5.1.33/3.2.4
       <Adaptec AHA-2944 Ultra SCSI host adapter>
scsi1 : Adaptec AHA274x/284x/294x (EISA/VLB/PCI-Fast SCSI) 5.1.33/3.2.4
       <Adaptec AHA-2944 Ultra SCSI host adapter>
scsi : 2 hosts.
  Vendor: FUJITSU   Model: M2949ESP SUN9.0G  Rev: 2848
  Type:   Direct-Access                      ANSI SCSI revision: 02
Detected scsi disk sda at scsi1, channel 0, id 0, lun 0
  Vendor: FUJITSU   Model: M2949ESP SUN9.0G  Rev: 2848
  Type:   Direct-Access                      ANSI SCSI revision: 02
Detected scsi disk sdb at scsi1, channel 0, id 1, lun 0
  Vendor: FUJITSU   Model: M2949ESP SUN9.0G  Rev: 2848
  Type:   Direct-Access                      ANSI SCSI revision: 02
Detected scsi disk sdc at scsi1, channel 0, id 2, lun 0
  Vendor: FUJITSU   Model: M2949ESP SUN9.0G  Rev: 2848
  Type:   Direct-Access                      ANSI SCSI revision: 02
Detected scsi disk sdd at scsi1, channel 0, id 3, lun 0
  Vendor: FUJITSU   Model: M2949ESP SUN9.0G  Rev: 2848
  Type:   Direct-Access                      ANSI SCSI revision: 02
Detected scsi disk sde at scsi1, channel 0, id 4, lun 0
  Vendor: FUJITSU   Model: M2949ESP SUN9.0G  Rev: 2848
  Type:   Direct-Access                      ANSI SCSI revision: 02
Detected scsi disk sdf at scsi1, channel 0, id 5, lun 0
  Vendor: FUJITSU   Model: M2949ESP SUN9.0G  Rev: 2848
  Type:   Direct-Access                      ANSI SCSI revision: 02
Detected scsi disk sdg at scsi1, channel 0, id 6, lun 0
(scsi1:0:0:0) Synchronous at 20.0 Mbyte/sec, offset 8.
SCSI device sda: hdwr sector= 512 bytes. Sectors= 17689267 [8637 MB] [8.6
GB]
 sda: sda1
(scsi1:0:1:0) Synchronous at 20.0 Mbyte/sec, offset 8.
SCSI device sdb: hdwr sector= 512 bytes. Sectors= 17689267 [8637 MB] [8.6
GB]
 sdb: sdb1
(scsi1:0:2:0) Synchronous at 20.0 Mbyte/sec, offset 8.
SCSI device sdc: hdwr sector= 512 bytes. Sectors= 17689267 [8637 MB] [8.6
GB]
 sdc: sdc1
(scsi1:0:3:0) Synchronous at 20.0 Mbyte/sec, offset 8.
SCSI device sdd: hdwr sector= 512 bytes. Sectors= 17689267 [8637 MB] [8.6
GB]
 sdd: sdd1
(scsi1:0:4:0) Synchronous at 20.0 Mbyte/sec, offset 8.
SCSI device sde: hdwr sector= 512 bytes. Sectors= 17689267 [8637 MB] [8.6
GB]
 sde: sde1
(scsi1:0:5:0) Synchronous at 20.0 Mbyte/sec, offset 8.
SCSI device sdf: hdwr sector= 512 bytes. Sectors= 17689267 [8637 MB] [8.6
GB]
 sdf: sdf1
(scsi1:0:6:0) Synchronous at 20.0 Mbyte/sec, offset 8.
SCSI device sdg: hdwr sector= 512 bytes. Sectors= 17689267 [8637 MB] [8.6
GB]
 sdg: sdg1
raid0 personality registered

[matrix:root:~]raidstart -a

[matrix:root:~]dmesg
[...]
(read) sda1's sb offset: 8843648 [events: 00000009]
(read) sdc1's sb offset: 8843648 [events: 00000009]
(read) sdd1's sb offset: 8843648 [events: 00000009]
(read) sde1's sb offset: 8843648 [events: 00000009]
(read) sdf1's sb offset: 8843648 [events: 00000009]
(read) sdg1's sb offset: 8843648 [events: 00000009]
blkdev_open() failed: -6
md: could not lock sdh1, zero-size? Marking faulty.
could not import sdh1, trying to run array nevertheless.
blkdev_open() failed: -6
md: could not lock sdi1, zero-size? Marking faulty.
could not import sdi1, trying to run array nevertheless.
autorun ...
considering sdg1 ...
  adding sdg1 ...
  adding sdf1 ...
  adding sde1 ...
  adding sdd1 ...
  adding sdc1 ...
  adding sda1 ...
created md0
bind<sda1,1>
bind<sdc1,2>
bind<sdd1,3>
bind<sde1,4>
bind<sdf1,5>
bind<sdg1,6>
running: <sdg1><sdf1><sde1><sdd1><sdc1><sda1>
now!
sdg1's event counter: 00000009
sdf1's event counter: 00000009
sde1's event counter: 00000009
sdd1's event counter: 00000009
sdc1's event counter: 00000009
sda1's event counter: 00000009
md: device name has changed from sdi1 to sdg1 since last import!
md: device name has changed from sdh1 to sdf1 since last import!
md: device name has changed from sdg1 to sde1 since last import!
md: device name has changed from sdf1 to sdd1 since last import!
md: device name has changed from sde1 to sdc1 since last import!
md: device name has changed from sdc1 to sda1 since last import!
md0: former device sdd1 is unavailable, removing from array!
mask ffffff80
 rdev->size: 8843648
 masked rdev->size: 8843648
  new md_size: 8843648
 rdev->size: 8843648
 masked rdev->size: 8843648
  new md_size: 17687296
 rdev->size: 8843648
 masked rdev->size: 8843648
  new md_size: 26530944
 rdev->size: 8843648
 masked rdev->size: 8843648
  new md_size: 35374592
 rdev->size: 8843648
 masked rdev->size: 8843648
  new md_size: 44218240
 rdev->size: 8843648
 masked rdev->size: 8843648
  new md_size: 53061888
md0: max total readahead window set to 3584k
md0: 7 data-disks, max readahead per data-disk: 512k
md: md0, array needs 7 disks, has 6, aborting.
raid0: disks are not ordered, aborting!
pers->run() failed ...
do_md_run() returned -22
unbind<sdg1,5>
export_rdev(sdg1)
unbind<sdf1,4>
export_rdev(sdf1)
unbind<sde1,3>
export_rdev(sde1)
unbind<sdd1,2>
export_rdev(sdd1)
unbind<sdc1,1>
export_rdev(sdc1)
unbind<sda1,0>
export_rdev(sda1)
md0 stopped.
... autorun DONE.

[matrix:root:~]fdisk -l /dev/sdd

Disk /dev/sdd: 255 heads, 63 sectors, 1101 cylinders
Units = cylinders of 16065 * 512 bytes

   Device Boot    Start       End    Blocks   Id  System
/dev/sdd1             1      1101   8843751   83  Linux


Dudas:
Por que aun trata de acceder a /dev/sdh y /dev/sdi ?
Por que no se muestran mensajes sobre /dev/sdb ?
Por que dice que /dev/sdd no esta disponible si lo ve con fdisk al igual que
los otros seis discos ?
Por el persistent-superblock en 1 no toma /etc/raidtab? Como lo fuerzo?

Esperando su iluminacion
Saludos!

JPT



Más información sobre la lista de distribución Linux