blaaat
23/04/08, 22:13
Ik heb een degraded raid 5 array op 1 van mijn servers.
Nu ben ik een beetje bang om op onbekende manieren te gaan rebuilden.
Ik heb een 9650SE-4LPML met 4 HD's, 3 worden gebruikt, en 1 stond als hot-spare, maar automatisch recoveren hierop is mislukt.
c0 [Sun Apr 20 11:16:29 2008] WARNING Sector repair completed: port=3, LBA=0x7680C45
c0 [Sun Apr 20 11:17:03 2008] WARNING Sector repair completed: port=3, LBA=0x766D745
c0 [Sun Apr 20 11:17:04 2008] WARNING Sector repair completed: port=3, LBA=0x760BA05
c0 [Sun Apr 20 11:17:04 2008] WARNING Sector repair completed: port=3, LBA=0x7680C45
c0 [Sun Apr 20 11:17:04 2008] WARNING Sector repair completed: port=3, LBA=0x7687035
c0 [Sun Apr 20 11:17:04 2008] WARNING Sector repair completed: port=3, LBA=0x760BA05
c0 [Sun Apr 20 11:17:04 2008] WARNING Sector repair completed: port=3, LBA=0x766D745
c0 [Sun Apr 20 11:17:04 2008] WARNING Sector repair completed: port=3, LBA=0x7687035
c0 [Sun Apr 20 11:17:04 2008] ERROR Degraded unit: unit=0, port=3
c0 [Sun Apr 20 11:20:54 2008] INFO Rebuild started: unit=0
c0 [Sun Apr 20 11:43:15 2008] ERROR Drive timeout detected: port=3, unit=0
c0 [Sun Apr 20 11:43:30 2008] ERROR Rebuild failed: unit=0
c0 [Sun Apr 20 11:43:30 2008] ERROR Degraded unit: unit=0, port=3
c0 [Sun Apr 20 11:44:50 2008] WARNING Drive removed: port=3
c0 [Sun Apr 20 11:44:50 2008] ERROR Degraded unit: unit=0, port=3
c0 [Wed Apr 23 19:09:58 2008] INFO Drive inserted: port=1
c0 [Wed Apr 23 20:51:23 2008] INFO Drive inserted: port=1
3 disks, verspreid over 2 units.
Unit UnitType Status %RCmpl %V/I/M Stripe Size(GB) Cache AVrfy
------------------------------------------------------------------------------
u0 RAID-5 DEGRADED - - 64K 148.99 OFF OFF
u1 RAID-5 INOPERABLE - - 64K 148.99 OFF OFF
Port Status Unit Size Blocks Serial
---------------------------------------------------------------
p0 OK u0 74.53 GB 156301488 WD-WMAP97754657
p1 OK u1 74.53 GB 156301488 WD-WMAP97958231
p2 OK u0 74.53 GB 156301488 WD-WMAP97797408
p3 UNKNOWN - 74.53 GB 156301488 WD-WMAP97771570
Unit 0 bevat 1 degraded disk.
//server15> /c0/u0 show
Unit UnitType Status %RCmpl %V/I/M Port Stripe Size(GB)
------------------------------------------------------------------------
u0 RAID-5 DEGRADED - - - 64K 148.99
u0-0 DISK OK - - p2 - 74.4951
u0-1 DISK DEGRADED - - - - 74.4951
u0-2 DISK OK - - p0 - 74.4951
u0/v0 Volume - - - - - 148.99
Unit 1 bevat er 2.
//server15> /c0/u1 show
Unit UnitType Status %RCmpl %V/I/M Port Stripe Size(GB)
------------------------------------------------------------------------
u1 RAID-5 INOPERABLE - - - 64K 148.99
u1-0 DISK DEGRADED - - - - 74.4951
u1-1 DISK OK - - p1 - 74.4951
u1-2 DISK DEGRADED - - - - 74.4951
u1/v0 Volume - - - - - 148.99
Nou probeer ik om disk3 weer als spare toe te voegen aan unit0.
maar zonder succes.
//server15> /c0 add type=spare disk=3
Creating new unit on controller /c0 ... Failed.
(0x0B:0x0020): Drive error
Pogingen om te rebuilden geven ook errors:
//server15> maint rebuild c0 u0 p1
Sending rebuild start request to /c0/u0 on 1 disk(s) [1] ... Failed.
(0x0B:0x0035): Replacement drive configuration is invalid for rebuild operation
//server15> maint rebuild c0 u0 p3
Sending rebuild start request to /c0/u0 on 1 disk(s) [3] ... Failed.
(0x0B:0x0020): Drive error
Het lijkt een beetje of de poortjes door elkaar zijn gehusseld.
Aangezien er bij de alarms word gewaarschuwd voor een kapotte disk3, maar dat was de spare.
En tijdens het rescannen zie ik nu tijdens alarms dat nieuwe disk @ port1 is. (terwijl deze als p3 UNKOWN) er staat.
Wat kan ik nu het beste doen?
Nu ben ik een beetje bang om op onbekende manieren te gaan rebuilden.
Ik heb een 9650SE-4LPML met 4 HD's, 3 worden gebruikt, en 1 stond als hot-spare, maar automatisch recoveren hierop is mislukt.
c0 [Sun Apr 20 11:16:29 2008] WARNING Sector repair completed: port=3, LBA=0x7680C45
c0 [Sun Apr 20 11:17:03 2008] WARNING Sector repair completed: port=3, LBA=0x766D745
c0 [Sun Apr 20 11:17:04 2008] WARNING Sector repair completed: port=3, LBA=0x760BA05
c0 [Sun Apr 20 11:17:04 2008] WARNING Sector repair completed: port=3, LBA=0x7680C45
c0 [Sun Apr 20 11:17:04 2008] WARNING Sector repair completed: port=3, LBA=0x7687035
c0 [Sun Apr 20 11:17:04 2008] WARNING Sector repair completed: port=3, LBA=0x760BA05
c0 [Sun Apr 20 11:17:04 2008] WARNING Sector repair completed: port=3, LBA=0x766D745
c0 [Sun Apr 20 11:17:04 2008] WARNING Sector repair completed: port=3, LBA=0x7687035
c0 [Sun Apr 20 11:17:04 2008] ERROR Degraded unit: unit=0, port=3
c0 [Sun Apr 20 11:20:54 2008] INFO Rebuild started: unit=0
c0 [Sun Apr 20 11:43:15 2008] ERROR Drive timeout detected: port=3, unit=0
c0 [Sun Apr 20 11:43:30 2008] ERROR Rebuild failed: unit=0
c0 [Sun Apr 20 11:43:30 2008] ERROR Degraded unit: unit=0, port=3
c0 [Sun Apr 20 11:44:50 2008] WARNING Drive removed: port=3
c0 [Sun Apr 20 11:44:50 2008] ERROR Degraded unit: unit=0, port=3
c0 [Wed Apr 23 19:09:58 2008] INFO Drive inserted: port=1
c0 [Wed Apr 23 20:51:23 2008] INFO Drive inserted: port=1
3 disks, verspreid over 2 units.
Unit UnitType Status %RCmpl %V/I/M Stripe Size(GB) Cache AVrfy
------------------------------------------------------------------------------
u0 RAID-5 DEGRADED - - 64K 148.99 OFF OFF
u1 RAID-5 INOPERABLE - - 64K 148.99 OFF OFF
Port Status Unit Size Blocks Serial
---------------------------------------------------------------
p0 OK u0 74.53 GB 156301488 WD-WMAP97754657
p1 OK u1 74.53 GB 156301488 WD-WMAP97958231
p2 OK u0 74.53 GB 156301488 WD-WMAP97797408
p3 UNKNOWN - 74.53 GB 156301488 WD-WMAP97771570
Unit 0 bevat 1 degraded disk.
//server15> /c0/u0 show
Unit UnitType Status %RCmpl %V/I/M Port Stripe Size(GB)
------------------------------------------------------------------------
u0 RAID-5 DEGRADED - - - 64K 148.99
u0-0 DISK OK - - p2 - 74.4951
u0-1 DISK DEGRADED - - - - 74.4951
u0-2 DISK OK - - p0 - 74.4951
u0/v0 Volume - - - - - 148.99
Unit 1 bevat er 2.
//server15> /c0/u1 show
Unit UnitType Status %RCmpl %V/I/M Port Stripe Size(GB)
------------------------------------------------------------------------
u1 RAID-5 INOPERABLE - - - 64K 148.99
u1-0 DISK DEGRADED - - - - 74.4951
u1-1 DISK OK - - p1 - 74.4951
u1-2 DISK DEGRADED - - - - 74.4951
u1/v0 Volume - - - - - 148.99
Nou probeer ik om disk3 weer als spare toe te voegen aan unit0.
maar zonder succes.
//server15> /c0 add type=spare disk=3
Creating new unit on controller /c0 ... Failed.
(0x0B:0x0020): Drive error
Pogingen om te rebuilden geven ook errors:
//server15> maint rebuild c0 u0 p1
Sending rebuild start request to /c0/u0 on 1 disk(s) [1] ... Failed.
(0x0B:0x0035): Replacement drive configuration is invalid for rebuild operation
//server15> maint rebuild c0 u0 p3
Sending rebuild start request to /c0/u0 on 1 disk(s) [3] ... Failed.
(0x0B:0x0020): Drive error
Het lijkt een beetje of de poortjes door elkaar zijn gehusseld.
Aangezien er bij de alarms word gewaarschuwd voor een kapotte disk3, maar dat was de spare.
En tijdens het rescannen zie ik nu tijdens alarms dat nieuwe disk @ port1 is. (terwijl deze als p3 UNKOWN) er staat.
Wat kan ik nu het beste doen?