[gpfsug-discuss] [EXTERNAL] Kernel crashes with Spectrum Scale and RHEL 7.7 3.10.0-1062.18.1.el7 kernel
Schuler, Laurence (GSFC-606.4)[ADNET SYSTEMS INC]
laurence.schuler at nasa.gov
Wed Apr 15 16:49:59 BST 2020
Will this impact *any* version of Spectrum Scale?
-Laurence
From: <gpfsug-discuss-bounces at spectrumscale.org> on behalf of Felipe Knop <knop at us.ibm.com>
Reply-To: gpfsug main discussion list <gpfsug-discuss at spectrumscale.org>
Date: Wednesday, April 15, 2020 at 11:30 AM
To: "gpfsug-discuss at spectrumscale.org" <gpfsug-discuss at spectrumscale.org>
Subject: [EXTERNAL] [gpfsug-discuss] Kernel crashes with Spectrum Scale and RHEL 7.7 3.10.0-1062.18.1.el7 kernel
All,
A problem has been identified with Spectrum Scale when running on RHEL 7.7 and kernel 3.10.0-1062.18.1.el7. While a fix is being currently developed, customers should not move up to this kernel level.
The new kernel was issued on March 17 via the following errata: https://access.redhat.com/errata/RHSA-2020:0834
When this kernel is used with Scale, system crashes have been observed. The following are a couple of examples of kernel stack traces for the crash:
[ 2915.625015] BUG: unable to handle kernel NULL pointer dereference at 0000000000000040
[ 2915.633770] IP: [<ffffffffc0e2cf90>] cxiDropSambaDCacheEntry+0x190/0x1b0 [mmfslinux]
[ 2915.914097] [<ffffffffc0e3d28c>] gpfs_i_rmdir+0x29c/0x310 [mmfslinux]
[ 2915.921381] [<ffffffffb9663130>] ? take_dentry_name_snapshot+0xf0/0xf0
[ 2915.928760] [<ffffffffb9664f60>] ? shrink_dcache_parent+0x60/0x90
[ 2915.935656] [<ffffffffb96577cc>] vfs_rmdir+0xdc/0x150
[ 2915.941388] [<ffffffffb965cca1>] do_rmdir+0x1f1/0x220
[ 2915.947119] [<ffffffffb964ce66>] ? __fput+0x186/0x260
[ 2915.952849] [<ffffffffb964d02e>] ? ____fput+0xe/0x10
[ 2915.958484] [<ffffffffb94c2e60>] ? task_work_run+0xc0/0xe0
[ 2915.964701] [<ffffffffb965df05>] SyS_unlinkat+0x25/0x40
[1224278.495993] [<ffffffff88e63918>] __dentry_kill+0x128/0x190
[1224278.496678] [<ffffffff88e63a36>] dput+0xb6/0x1a0
[1224278.497378] [<ffffffff88e64116>] d_prune_aliases+0xb6/0xf0
[1224278.498083] [<ffffffffc0c2c0ea>] cxiPruneDCacheEntry+0x13a/0x1c0 [mmfslinux]
[1224278.498798] [<ffffffffc0eba608>] _ZN10gpfsNode_t16invalidateOSNodeEPS_Pvij+0x108/0x350 [mmfs26]
RHEL 7.8 is also impacted by the same problem, but validation of Scale with 7.8 is still under way.
Felipe
----
Felipe Knop knop at us.ibm.com
GPFS Development and Security
IBM Systems
IBM Building 008
2455 South Rd, Poughkeepsie, NY 12601
(845) 433-9314 T/L 293-9314
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://gpfsug.org/pipermail/gpfsug-discuss_gpfsug.org/attachments/20200415/94e53b65/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 9466 bytes
Desc: not available
URL: <http://gpfsug.org/pipermail/gpfsug-discuss_gpfsug.org/attachments/20200415/94e53b65/attachment.bin>
More information about the gpfsug-discuss
mailing list