:alexjoneswat:... - random

p, 6 months ago

:alexjoneswat:
spinningstatedisk.png

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ sjw

Image

Image alternative text

graf, 6 months ago

@p last week we racked a server full of 12 10TB disks and 3/4 dozen NVMe and I finally started working on it after getting home and one of the drives shit out errors like crazy, in a soft raid10 it was marked as failed but i removed it via mdadm, used wipefs to remove the RAID headers from the disk and readded it to the RAID and it's rebuilding without issues

hoping the disk passes this recovery would be shitty to replace a drive literally the first couple hours i use it

image/png

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

p, 6 months ago

@graf Well, if there's a 0.1% defect rate, that's a 1.2% chance that there'll be at least one defective one in a batch of 12. Not completely unheard-of but not impossible.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

graf, 6 months ago

@p datacenter guy "the true test of a SYS ADMIN"
lmfao i hate my life

this is the new matrix array so i might just replace the disk but im hopeful nothing is wrong with it

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

p, 6 months ago

@graf Seems like a lot of resources for "basically IRC". :captcrunch:

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

graf, 6 months ago

@p yeah matrix used by anybody other than yourself is retarded and nowhere close to irc at all

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

splitshockvirus, 6 months ago

@graf @p

Did you run a smart test on that drive before adding it?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

graf, 6 months ago

@splitshockvirus @p yeah they all passed so i’m wondering if it’s the backplane or cabling

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

p, 6 months ago

@graf @splitshockvirus Random blip due to bus congestion when initializing all of the drives at once?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

graf, 6 months ago

@p @splitshockvirus i think the failure might have happened when i was making other raids on the same bus. the drive seems fine and passes smart tests so i’m rebuilding assuming resetting it’s status is enough. i don’t think the drive is at fault right now. but something definitely fucked up and in production i can’t have that

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Add comment