View Issue Details

IDProjectCategoryView StatusLast Update
0015801CentOS-7kernelpublic2019-04-11 06:36
Reporternaveen.centos 
PriorityhighSeveritymajorReproducibilityrandom
Status assignedResolutionopen 
Product Version7.4.1708 
Target VersionFixed in Version 
Summary0015801: Disks are getting offline on HP server running with Centos7.4
DescriptionWe are experiencing disks are going offline with below messages, this happens randomly and only on HP servers which was newly added. we have also upgraded to latest firmware version and also installed kmod-smartpqi-1.1.4-133.rhel7u4.x86_64.rpm driver recommended by HP. Still this issue keep re-occurring randomly.

kernel: smartpqi 0000:5c:00.0: resetting scsi 1:1:0:4
kernel: smartpqi 0000:5c:00.0: reset of scsi 1:1:0:4: SUCCESS
kernel: sd 1:1:0:4: [sde] Medium access timeout failure. Offlining disk!
kernel: smartpqi 0000:5c:00.0: resetting scsi 1:1:0:6
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: [sde] killing request
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: Buffer I/O error on dev sde1, logical block 330494541, lost async page write
kernel: Buffer I/O error on dev sde1, logical block 330494542, lost async page write
kernel: Buffer I/O error on dev sde1, logical block 330494543, lost async page write
kernel: Buffer I/O error on dev sde1, logical block 330494544, lost async page write
kernel: Buffer I/O error on dev sde1, logical block 330494545, lost async page write
kernel: Buffer I/O error on dev sde1, logical block 330494546, lost async page write
kernel: Buffer I/O error on dev sde1, logical block 330494547, lost async page write
kernel: Buffer I/O error on dev sde1, logical block 330494548, lost async page write
kernel: Buffer I/O error on dev sde1, logical block 330494549, lost async page write
kernel: Buffer I/O error on dev sde1, logical block 330494550, lost async page write
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device
kernel: sd 1:1:0:4: rejecting I/O to offline device

Steps To ReproduceNA
Additional InformationNA
Tags7.4
abrt_hash
URL

Activities

naveen.centos

naveen.centos

2019-02-08 07:29

reporter   ~0033791

Similar issue reported as RHEL bug

https://bugzilla.redhat.com/show_bug.cgi?id=1666912
naveen.centos

naveen.centos

2019-03-12 08:56

reporter   ~0033982

Can anyone help on this issue.
tigalch

tigalch

2019-03-12 09:14

manager   ~0033983

Before anything, you should update to 7.6.
TrevorH

TrevorH

2019-03-12 16:00

manager   ~0033984

And no-one can help because the fix, according to the bugzilla entry you posted, is not scheduled to be included in a RHEL kernel until 7.7 drops and only Redhat knows when that will be.
toracat

toracat

2019-03-14 17:53

manager   ~0034000

kernel-plus-3.10.0-957.10.1.el7.centos.plus which will be released soon has the patch in the referenced RHBZ.
toracat

toracat

2019-03-18 18:36

manager   ~0034034

kernel-plus-3.10.0-957.10.1.el7.centos.plus has just been released and is sync'ing to mirrors.
toracat

toracat

2019-03-18 18:39

manager   ~0034035

@naveen.centos

Your feedback appreciated.
naveen.centos

naveen.centos

2019-03-19 04:58

reporter   ~0034039

Hi All,

Thanks for your replies, @toracat we will get kernel-plus-3.10.0-957.10.1.el7.centos.plus installed and tested.
naveen.centos

naveen.centos

2019-04-10 08:05

reporter   ~0034176

Hi All,

After we upgraded to Centos 7.6 with kernel-3.10.0-957.10.1.el7.x86_64, disk offline issue has not reoccurred, we have kept under observation for more than 2 weeks, system seems to be stable.

Thanks for your support.
pgreco

pgreco

2019-04-10 09:47

developer   ~0034177

@naveen.centos using stock kernel or plus kernel?
naveen.centos

naveen.centos

2019-04-11 06:36

reporter   ~0034180

we used stock kernel, since we have production servers.

Issue History

Date Modified Username Field Change
2019-02-08 06:44 naveen.centos New Issue
2019-02-08 07:29 naveen.centos Note Added: 0033791
2019-02-18 10:05 naveen.centos Tag Attached: 7.4
2019-03-12 08:56 naveen.centos Note Added: 0033982
2019-03-12 09:14 tigalch Note Added: 0033983
2019-03-12 16:00 TrevorH Note Added: 0033984
2019-03-14 17:53 toracat Note Added: 0034000
2019-03-14 18:01 toracat Status new => assigned
2019-03-18 18:36 toracat Note Added: 0034034
2019-03-18 18:39 toracat Status assigned => feedback
2019-03-18 18:39 toracat Note Added: 0034035
2019-03-19 04:58 naveen.centos Note Added: 0034039
2019-03-19 04:58 naveen.centos Status feedback => assigned
2019-03-28 23:43 toracat Status assigned => feedback
2019-04-10 08:05 naveen.centos Note Added: 0034176
2019-04-10 08:05 naveen.centos Status feedback => assigned
2019-04-10 09:47 pgreco Note Added: 0034177
2019-04-11 06:36 naveen.centos Note Added: 0034180