From UWEFALKE at de.ibm.com Tue Nov 1 08:41:37 2016 From: UWEFALKE at de.ibm.com (Uwe Falke) Date: Tue, 1 Nov 2016 09:41:37 +0100 Subject: [gpfsug-discuss] Recent Whitepapers from Yuri Volobuev In-Reply-To: References: Message-ID: Another serious loss ... Mit freundlichen Gr??en / Kind regards Dr. Uwe Falke IT Specialist High Performance Computing Services / Integrated Technology Services / Data Center Services ------------------------------------------------------------------------------------------------------------------------------------------- IBM Deutschland Rathausstr. 7 09111 Chemnitz Phone: +49 371 6978 2165 Mobile: +49 175 575 2877 E-Mail: uwefalke at de.ibm.com ------------------------------------------------------------------------------------------------------------------------------------------- IBM Deutschland Business & Technology Services GmbH / Gesch?ftsf?hrung: Frank Hammer, Thorsten Moehring Sitz der Gesellschaft: Ehningen / Registergericht: Amtsgericht Stuttgart, HRB 17122 From: "Oesterlin, Robert" To: gpfsug main discussion list Date: 10/31/2016 10:54 AM Subject: [gpfsug-discuss] Recent Whitepapers from Yuri Volobuev Sent by: gpfsug-discuss-bounces at spectrumscale.org For those of you who may not know, Yuri Volobuev has left IBM to pursue new challenges. Myself along with many others received so much help and keen insight from Yuri on all things GPFS. He will be missed. Bob Oesterlin Sr Principal Storage Engineer, Nuance _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss From makaplan at us.ibm.com Tue Nov 1 16:37:17 2016 From: makaplan at us.ibm.com (Marc A Kaplan) Date: Tue, 1 Nov 2016 11:37:17 -0500 Subject: [gpfsug-discuss] wanted...gpfs policy that places larger files onto a pool based on size In-Reply-To: References: <21BC488F0AEA2245B2C3E83FC0B33DBB063A1D4A@CHI-EXCHANGEW1.w2k.jumptrading.com>

Message-ID: Placement policy rules, SET POOL ..., are evaluated at open/create time before any write(2) calls have been made, so GPFS has no "idea" how big the file is going to ultimately be. In other words at file creation time FILE_SIZE, if we had implemented it, would be 0, so rather than mislead you and answer the question "why is FILE_SIZE==0, we left FILE_SIZE undefined in SET POOL rules. Of course, we at IBM have thought of at least some other scenarios, and we are listening here... As the last so many years show, GPFS continues to add features, etc, etc. -- marc -------------- next part -------------- An HTML attachment was scrubbed... URL: From billowen at us.ibm.com Wed Nov 2 14:28:15 2016 From: billowen at us.ibm.com (Bill Owen) Date: Wed, 2 Nov 2016 07:28:15 -0700 Subject: [gpfsug-discuss] unified file and object In-Reply-To: References:

Message-ID: Hi Leslie, Can you also send the /etc/swift/object-server-sof.conf file from this system? Here is a sample of the file from my working system - it sounds like the config file may not be complete on your system: [root at spectrumscale ~]# cat /etc/swift/object-server-sof.conf [DEFAULT] bind_ip = 127.0.0.1 bind_port = 6203 workers = 3 mount_check = false log_name = object-server-sof log_level = ERROR id_mgmt = unified_mode retain_acl = yes retain_winattr = yes retain_xattr = yes retain_owner = yes tempfile_prefix = .ibmtmp_ disable_fallocate = true log_statsd_host = localhost log_statsd_port = 8125 log_statsd_default_sample_rate = 1.0 log_statsd_sample_rate_factor = 1.0 log_statsd_metric_prefix = devices = /gpfs/fs1/object_fileset/o [pipeline:main] pipeline = object-server [app:object-server] use = egg:swiftonfile#object disk_chunk_size = 65536 network_chunk_size = 65536 [object-replicator] [object-updater] [object-auditor] [object-reconstructor] Bill Owen billowen at us.ibm.com Spectrum Scale Object Storage 520-799-4829 From: leslie elliott To: gpfsug main discussion list Date: 10/29/2016 03:53 AM Subject: Re: [gpfsug-discuss] unified file and object Sent by: gpfsug-discuss-bounces at spectrumscale.org Bill to be clear the file access ?I mentioned was in relation to SMB and NFS using mmuserauth rather than the unification with the object store since it is required as well but I did try to do this for object as well using the Administration and Programming Reference from page 142, was using unified_mode rather than local_mode mmobj config change --ccrfile spectrum-scale-object.conf --section capabilities --property file-access-enabled --value true the mmuserauth failed as you are aware, we have created test accounts without spaces in the DN and were successful with this step, so eagerly await a fix to be able to use the correct accounts mmobj config change --ccrfile object-server-sof.conf --section DEFAULT --property id_mgmt --value unified_mode mmobj config change --ccrfile object-server-sof.conf --section DEFAULT --property ad_domain --value DOMAIN we have successfully tested object stores on this cluster with simple auth the output you asked for is as follows [root at pren-gs7k-vm4 ~]# cat /etc/swift/object-server-sof.conf [DEFAULT] devices = /gpfs/pren01/ObjectFileset/o log_level = ERROR [root at pren-gs7k-vm4 ~]# systemctl -l status openstack-swift-object-sof ? openstack-swift-object-sof.service - OpenStack Object Storage (swift) - Object Server ? ?Loaded: loaded (/usr/lib/systemd/system/openstack-swift-object-sof.service; disabled; vendor preset: disabled) ? ?Active: failed (Result: exit-code) since Sat 2016-10-29 10:30:22 UTC; 27s ago ? Process: 8086 ExecStart=/usr/bin/swift-object-server-sof /etc/swift/object-server-sof.conf (code=exited, status=1/FAILURE) ?Main PID: 8086 (code=exited, status=1/FAILURE) Oct 29 10:30:22 pren-gs7k-vm4 systemd[1]: Started OpenStack Object Storage (swift) - Object Server. Oct 29 10:30:22 pren-gs7k-vm4 systemd[1]: Starting OpenStack Object Storage (swift) - Object Server... Oct 29 10:30:22 pren-gs7k-vm4 swift-object-server-sof[8086]: Error trying to load config from /etc/swift/object-server-sof.conf: No section 'object-server' (prefixed by 'app' or 'application' or 'composite' or 'composit' or 'pipeline' or 'filter-app') found in config /etc/swift/object-server-sof.conf Oct 29 10:30:22 pren-gs7k-vm4 systemd[1]: openstack-swift-object-sof.service: main process exited, code=exited, status=1/FAILURE Oct 29 10:30:22 pren-gs7k-vm4 systemd[1]: Unit openstack-swift-object-sof.service entered failed state. Oct 29 10:30:22 pren-gs7k-vm4 systemd[1]: openstack-swift-object-sof.service failed. I am happy to help you or for you to help to debug this problem via a short call thanks leslie On 29 October 2016 at 00:37, Bill Owen wrote: 2. Can you provide more details on how you configured file access? The normal procedure is to use "mmobj file-access enable", and this will set up the required settings in the config file. Can you send us: - the steps used to configure file access - the resulting /etc/swift/object-server-sof.conf - log files from /var/log/swift or output of "systemctl status openstack-swift-object-sof" We can schedule a short call to help debug if needed. _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: not available URL: From leslie.james.elliott at gmail.com Wed Nov 2 22:00:25 2016 From: leslie.james.elliott at gmail.com (leslie elliott) Date: Thu, 3 Nov 2016 08:00:25 +1000 Subject: [gpfsug-discuss] unified file and object In-Reply-To: References:

Message-ID: Bill you are correct about it missing details [root at pren-gs7k-vm4 ~]# cat /etc/swift/object-server-sof.conf [DEFAULT] devices = /gpfs/pren01/ObjectFileset/o log_level = ERROR now that I have yours for reference I have updated the file and the service starts, but I am unsure why it was not provisioned correctly initially leslie On 3 November 2016 at 00:28, Bill Owen wrote: > Hi Leslie, > Can you also send the /etc/swift/object-server-sof.conf file from this > system? > > Here is a sample of the file from my working system - it sounds like the > config file may not be complete on your system: > [root at spectrumscale ~]# cat /etc/swift/object-server-sof.conf > [DEFAULT] > bind_ip = 127.0.0.1 > bind_port = 6203 > workers = 3 > mount_check = false > log_name = object-server-sof > log_level = ERROR > id_mgmt = unified_mode > retain_acl = yes > retain_winattr = yes > retain_xattr = yes > retain_owner = yes > tempfile_prefix = .ibmtmp_ > disable_fallocate = true > log_statsd_host = localhost > log_statsd_port = 8125 > log_statsd_default_sample_rate = 1.0 > log_statsd_sample_rate_factor = 1.0 > log_statsd_metric_prefix = > devices = /gpfs/fs1/object_fileset/o > > [pipeline:main] > pipeline = object-server > > [app:object-server] > use = egg:swiftonfile#object > disk_chunk_size = 65536 > network_chunk_size = 65536 > > [object-replicator] > > [object-updater] > > [object-auditor] > > [object-reconstructor] > > > Bill Owen > billowen at us.ibm.com > Spectrum Scale Object Storage > 520-799-4829 > > > [image: Inactive hide details for leslie elliott ---10/29/2016 03:53:48 > AM---Bill to be clear the file access I mentioned was in relat]leslie > elliott ---10/29/2016 03:53:48 AM---Bill to be clear the file access I > mentioned was in relation to SMB and NFS > > From: leslie elliott > To: gpfsug main discussion list > Date: 10/29/2016 03:53 AM > Subject: Re: [gpfsug-discuss] unified file and object > Sent by: gpfsug-discuss-bounces at spectrumscale.org > ------------------------------ > > > > Bill > > to be clear the file access I mentioned was in relation to SMB and NFS > using mmuserauth rather than the unification with the object store since it > is required as well > > but I did try to do this for object as well using the Administration and > Programming Reference from page 142, was using unified_mode rather than > local_mode > > mmobj config change --ccrfile spectrum-scale-object.conf --section > capabilities --property file-access-enabled --value true > > the mmuserauth failed as you are aware, we have created test accounts > without spaces in the DN and were successful with this step, so eagerly > await a fix to be able to use the correct accounts > > mmobj config change --ccrfile object-server-sof.conf --section DEFAULT > --property id_mgmt --value unified_mode > mmobj config change --ccrfile object-server-sof.conf --section DEFAULT > --property ad_domain --value DOMAIN > > > we have successfully tested object stores on this cluster with simple auth > > > the output you asked for is as follows > > [root at pren-gs7k-vm4 ~]# cat /etc/swift/object-server-sof.conf > [DEFAULT] > devices = /gpfs/pren01/ObjectFileset/o > log_level = ERROR > > > [root at pren-gs7k-vm4 ~]# systemctl -l status openstack-swift-object-sof > ? openstack-swift-object-sof.service - OpenStack Object Storage (swift) - > Object Server > Loaded: loaded (/usr/lib/systemd/system/openstack-swift-object-sof.service; > disabled; vendor preset: disabled) > Active: failed (Result: exit-code) since Sat 2016-10-29 10:30:22 UTC; > 27s ago > Process: 8086 ExecStart=/usr/bin/swift-object-server-sof > /etc/swift/object-server-sof.conf (code=exited, status=1/FAILURE) > Main PID: 8086 (code=exited, status=1/FAILURE) > > Oct 29 10:30:22 pren-gs7k-vm4 systemd[1]: Started OpenStack Object Storage > (swift) - Object Server. > Oct 29 10:30:22 pren-gs7k-vm4 systemd[1]: Starting OpenStack Object > Storage (swift) - Object Server... > Oct 29 10:30:22 pren-gs7k-vm4 swift-object-server-sof[8086]: Error trying > to load config from /etc/swift/object-server-sof.conf: No section > 'object-server' (prefixed by 'app' or 'application' or 'composite' or > 'composit' or 'pipeline' or 'filter-app') found in config > /etc/swift/object-server-sof.conf > Oct 29 10:30:22 pren-gs7k-vm4 systemd[1]: openstack-swift-object-sof.service: > main process exited, code=exited, status=1/FAILURE > Oct 29 10:30:22 pren-gs7k-vm4 systemd[1]: Unit openstack-swift-object-sof.service > entered failed state. > Oct 29 10:30:22 pren-gs7k-vm4 systemd[1]: openstack-swift-object-sof.service > failed. > > > > > I am happy to help you or for you to help to debug this problem via a > short call > > > thanks > > leslie > > > > On 29 October 2016 at 00:37, Bill Owen <*billowen at us.ibm.com* > > wrote: > > > 2. Can you provide more details on how you configured file access? The > normal procedure is to use "mmobj file-access enable", and this will set up > the required settings in the config file. Can you send us: > - the steps used to configure file access > - the resulting /etc/swift/object-server-sof.conf > - log files from /var/log/swift or output of "systemctl status > openstack-swift-object-sof" > > We can schedule a short call to help debug if needed. > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at *spectrumscale.org* > *http://gpfsug.org/mailman/listinfo/gpfsug-discuss* > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: not available URL: From billowen at us.ibm.com Wed Nov 2 23:39:48 2016 From: billowen at us.ibm.com (Bill Owen) Date: Wed, 2 Nov 2016 16:39:48 -0700 Subject: [gpfsug-discuss] unified file and object In-Reply-To: References:

Message-ID: > now that I have yours for reference I have updated the file and the service starts, but I am unsure why it was not provisioned correctly initially Do you have the log from the original installation? Did you install using the spectrumscale install toolkit? Thanks, Bill Owen billowen at us.ibm.com Spectrum Scale Object Storage 520-799-4829 From: leslie elliott To: gpfsug main discussion list Date: 11/02/2016 03:00 PM Subject: Re: [gpfsug-discuss] unified file and object Sent by: gpfsug-discuss-bounces at spectrumscale.org Bill you are correct about it missing details [root at pren-gs7k-vm4 ~]# cat /etc/swift/object-server-sof.conf [DEFAULT] devices = /gpfs/pren01/ObjectFileset/o log_level = ERROR now that I have yours for reference I have updated the file and the service starts, but I am unsure why it was not provisioned correctly initially leslie On 3 November 2016 at 00:28, Bill Owen wrote: Hi Leslie, Can you also send the /etc/swift/object-server-sof.conf file from this system? Here is a sample of the file from my working system - it sounds like the config file may not be complete on your system: [root at spectrumscale ~]# cat /etc/swift/object-server-sof.conf [DEFAULT] bind_ip = 127.0.0.1 bind_port = 6203 workers = 3 mount_check = false log_name = object-server-sof log_level = ERROR id_mgmt = unified_mode retain_acl = yes retain_winattr = yes retain_xattr = yes retain_owner = yes tempfile_prefix = .ibmtmp_ disable_fallocate = true log_statsd_host = localhost log_statsd_port = 8125 log_statsd_default_sample_rate = 1.0 log_statsd_sample_rate_factor = 1.0 log_statsd_metric_prefix = devices = /gpfs/fs1/object_fileset/o [pipeline:main] pipeline = object-server [app:object-server] use = egg:swiftonfile#object disk_chunk_size = 65536 network_chunk_size = 65536 [object-replicator] [object-updater] [object-auditor] [object-reconstructor] Bill Owen billowen at us.ibm.com Spectrum Scale Object Storage 520-799-4829 Inactive hide details for leslie elliott ---10/29/2016 03:53:48 AM---Bill to be clear the file access I mentioned was in relatleslie elliott ---10/29/2016 03:53:48 AM---Bill to be clear the file access I mentioned was in relation to SMB and NFS From: leslie elliott To: gpfsug main discussion list Date: 10/29/2016 03:53 AM Subject: Re: [gpfsug-discuss] unified file and object Sent by: gpfsug-discuss-bounces at spectrumscale.org Bill to be clear the file access ?I mentioned was in relation to SMB and NFS using mmuserauth rather than the unification with the object store since it is required as well but I did try to do this for object as well using the Administration and Programming Reference from page 142, was using unified_mode rather than local_mode mmobj config change --ccrfile spectrum-scale-object.conf --section capabilities --property file-access-enabled --value true the mmuserauth failed as you are aware, we have created test accounts without spaces in the DN and were successful with this step, so eagerly await a fix to be able to use the correct accounts mmobj config change --ccrfile object-server-sof.conf --section DEFAULT --property id_mgmt --value unified_mode mmobj config change --ccrfile object-server-sof.conf --section DEFAULT --property ad_domain --value DOMAIN we have successfully tested object stores on this cluster with simple auth the output you asked for is as follows [root at pren-gs7k-vm4 ~]# cat /etc/swift/object-server-sof.conf [DEFAULT] devices = /gpfs/pren01/ObjectFileset/o log_level = ERROR [root at pren-gs7k-vm4 ~]# systemctl -l status openstack-swift-object-sof ? openstack-swift-object-sof.service - OpenStack Object Storage (swift) - Object Server ? ?Loaded: loaded (/usr/lib/systemd/system/openstack-swift-object-sof.service; disabled; vendor preset: disabled) ? ?Active: failed (Result: exit-code) since Sat 2016-10-29 10:30:22 UTC; 27s ago ? Process: 8086 ExecStart=/usr/bin/swift-object-server-sof /etc/swift/object-server-sof.conf (code=exited, status=1/FAILURE) ?Main PID: 8086 (code=exited, status=1/FAILURE) Oct 29 10:30:22 pren-gs7k-vm4 systemd[1]: Started OpenStack Object Storage (swift) - Object Server. Oct 29 10:30:22 pren-gs7k-vm4 systemd[1]: Starting OpenStack Object Storage (swift) - Object Server... Oct 29 10:30:22 pren-gs7k-vm4 swift-object-server-sof[8086]: Error trying to load config from /etc/swift/object-server-sof.conf: No section 'object-server' (prefixed by 'app' or 'application' or 'composite' or 'composit' or 'pipeline' or 'filter-app') found in config /etc/swift/object-server-sof.conf Oct 29 10:30:22 pren-gs7k-vm4 systemd[1]: openstack-swift-object-sof.service: main process exited, code=exited, status=1/FAILURE Oct 29 10:30:22 pren-gs7k-vm4 systemd[1]: Unit openstack-swift-object-sof.service entered failed state. Oct 29 10:30:22 pren-gs7k-vm4 systemd[1]: openstack-swift-object-sof.service failed. I am happy to help you or for you to help to debug this problem via a short call thanks leslie On 29 October 2016 at 00:37, Bill Owen wrote: 2. Can you provide more details on how you configured file access? The normal procedure is to use "mmobj file-access enable", and this will set up the required settings in the config file. Can you send us: - the steps used to configure file access - the resulting /etc/swift/object-server-sof.conf - log files from /var/log/swift or output of "systemctl status openstack-swift-object-sof" We can schedule a short call to help debug if needed. _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: not available URL: From leslie.james.elliott at gmail.com Thu Nov 3 04:37:43 2016 From: leslie.james.elliott at gmail.com (leslie elliott) Date: Thu, 3 Nov 2016 14:37:43 +1000 Subject: [gpfsug-discuss] unified file and object In-Reply-To: References:

Message-ID: Sorry I don't have an install log This is a DDN installation so while I believe the use the spectrum scale toolkit I can not confirm this Thanks Leslie On Thursday, 3 November 2016, Bill Owen wrote: > > now that I have yours for reference I have updated the file and the > service starts, but I am unsure why it was not provisioned correctly > initially > Do you have the log from the original installation? Did you install using > the spectrumscale install toolkit? > > Thanks, > Bill Owen > billowen at us.ibm.com > Spectrum Scale Object Storage > 520-799-4829 > > > [image: Inactive hide details for leslie elliott ---11/02/2016 03:00:55 > PM---Bill you are correct about it missing details]leslie elliott > ---11/02/2016 03:00:55 PM---Bill you are correct about it missing details > > From: leslie elliott > > To: gpfsug main discussion list

> > Date: 11/02/2016 03:00 PM > Subject: Re: [gpfsug-discuss] unified file and object > Sent by: gpfsug-discuss-bounces at spectrumscale.org > > ------------------------------ > > > > Bill > > you are correct about it missing details > > > [root at pren-gs7k-vm4 ~]# cat /etc/swift/object-server-sof.conf > [DEFAULT] > devices = /gpfs/pren01/ObjectFileset/o > log_level = ERROR > > > > now that I have yours for reference I have updated the file and the > service starts, but I am unsure why it was not provisioned correctly > initially > > leslie > > > On 3 November 2016 at 00:28, Bill Owen <*billowen at us.ibm.com* > > wrote: > > Hi Leslie, > Can you also send the /etc/swift/object-server-sof.conf file from this > system? > > Here is a sample of the file from my working system - it sounds like > the config file may not be complete on your system: > [root at spectrumscale ~]# cat /etc/swift/object-server-sof.conf > [DEFAULT] > bind_ip = 127.0.0.1 > bind_port = 6203 > workers = 3 > mount_check = false > log_name = object-server-sof > log_level = ERROR > id_mgmt = unified_mode > retain_acl = yes > retain_winattr = yes > retain_xattr = yes > retain_owner = yes > tempfile_prefix = .ibmtmp_ > disable_fallocate = true > log_statsd_host = localhost > log_statsd_port = 8125 > log_statsd_default_sample_rate = 1.0 > log_statsd_sample_rate_factor = 1.0 > log_statsd_metric_prefix = > devices = /gpfs/fs1/object_fileset/o > > [pipeline:main] > pipeline = object-server > > [app:object-server] > use = egg:swiftonfile#object > disk_chunk_size = 65536 > network_chunk_size = 65536 > > [object-replicator] > > [object-updater] > > [object-auditor] > > [object-reconstructor] > > > Bill Owen > *billowen at us.ibm.com* > > Spectrum Scale Object Storage > 520-799-4829 > > > [image: Inactive hide details for leslie elliott ---10/29/2016 > 03:53:48 AM---Bill to be clear the file access I mentioned was in relat]leslie > elliott ---10/29/2016 03:53:48 AM---Bill to be clear the file access I > mentioned was in relation to SMB and NFS > > From: leslie elliott <*leslie.james.elliott at gmail.com* > > > To: gpfsug main discussion list <*gpfsug-discuss at spectrumscale.org* > > > Date: 10/29/2016 03:53 AM > Subject: Re: [gpfsug-discuss] unified file and object > Sent by: *gpfsug-discuss-bounces at spectrumscale.org* > > ------------------------------ > > > > Bill > > to be clear the file access I mentioned was in relation to SMB and > NFS using mmuserauth rather than the unification with the object store > since it is required as well > > but I did try to do this for object as well using the Administration > and Programming Reference from page 142, was using unified_mode rather than > local_mode > > mmobj config change --ccrfile spectrum-scale-object.conf --section > capabilities --property file-access-enabled --value true > > the mmuserauth failed as you are aware, we have created test accounts > without spaces in the DN and were successful with this step, so eagerly > await a fix to be able to use the correct accounts > > mmobj config change --ccrfile object-server-sof.conf --section DEFAULT > --property id_mgmt --value unified_mode > mmobj config change --ccrfile object-server-sof.conf --section DEFAULT > --property ad_domain --value DOMAIN > > > we have successfully tested object stores on this cluster with simple > auth > > > the output you asked for is as follows > > [root at pren-gs7k-vm4 ~]# cat /etc/swift/object-server-sof.conf > [DEFAULT] > devices = /gpfs/pren01/ObjectFileset/o > log_level = ERROR > > > [root at pren-gs7k-vm4 ~]# systemctl -l status openstack-swift-object-sof > ? openstack-swift-object-sof.service - OpenStack Object Storage > (swift) - Object Server > Loaded: loaded (/usr/lib/systemd/system/openstack-swift-object-sof.service; > disabled; vendor preset: disabled) > Active: failed (Result: exit-code) since Sat 2016-10-29 10:30:22 > UTC; 27s ago > Process: 8086 ExecStart=/usr/bin/swift-object-server-sof > /etc/swift/object-server-sof.conf (code=exited, status=1/FAILURE) > Main PID: 8086 (code=exited, status=1/FAILURE) > > Oct 29 10:30:22 pren-gs7k-vm4 systemd[1]: Started OpenStack Object > Storage (swift) - Object Server. > Oct 29 10:30:22 pren-gs7k-vm4 systemd[1]: Starting OpenStack Object > Storage (swift) - Object Server... > Oct 29 10:30:22 pren-gs7k-vm4 swift-object-server-sof[8086]: Error > trying to load config from /etc/swift/object-server-sof.conf: No > section 'object-server' (prefixed by 'app' or 'application' or 'composite' > or 'composit' or 'pipeline' or 'filter-app') found in config > /etc/swift/object-server-sof.conf > Oct 29 10:30:22 pren-gs7k-vm4 systemd[1]: openstack-swift-object-sof.service: > main process exited, code=exited, status=1/FAILURE > Oct 29 10:30:22 pren-gs7k-vm4 systemd[1]: Unit > openstack-swift-object-sof.service entered failed state. > Oct 29 10:30:22 pren-gs7k-vm4 systemd[1]: openstack-swift-object-sof.service > failed. > > > > > I am happy to help you or for you to help to debug this problem via a > short call > > > thanks > > leslie > > > > On 29 October 2016 at 00:37, Bill Owen <*billowen at us.ibm.com* > > wrote: > > 2. Can you provide more details on how you configured file > access? The normal procedure is to use "mmobj file-access enable", and this > will set up the required settings in the config file. Can you send us: > - the steps used to configure file access > - the resulting /etc/swift/object-server-sof.conf > - log files from /var/log/swift or output of "systemctl status > openstack-swift-object-sof" > > We can schedule a short call to help debug if needed. > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at *spectrumscale.org* > *http://gpfsug.org/mailman/listinfo/gpfsug-discuss* > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at *spectrumscale.org* > *http://gpfsug.org/mailman/listinfo/gpfsug-discuss* > > > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at *spectrumscale.org* > *http://gpfsug.org/mailman/listinfo/gpfsug-discuss* > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: not available URL: From Mark.Bush at siriuscom.com Fri Nov 4 16:18:17 2016 From: Mark.Bush at siriuscom.com (Mark.Bush at siriuscom.com) Date: Fri, 4 Nov 2016 16:18:17 +0000 Subject: [gpfsug-discuss] CES and IP's that disappear Message-ID: <451248FD-D116-4C3D-A439-73967C287F6C@siriuscom.com> I continue to run into a problem where after I get CES setup properly and the ces-ip addresses show up, I then add an NFS export and all of a sudden the ces-ip?s disappear from the protocol nodes. I have been scouring the problem determination guide but can?t seem to find out what is going on. It makes no sense to me that the IP?s would disappear. Especially after a simple task of just adding an nfs export. I first thought this was just a gui issue but just got done trying it all from the cli and the same thing happens. Has anyone seen anything like this? Mark This message (including any attachments) is intended only for the use of the individual or entity to which it is addressed and may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law. If you are not the intended recipient, you are hereby notified that any use, dissemination, distribution, or copying of this communication is strictly prohibited. This message may be viewed by parties at Sirius Computer Solutions other than those named in the message header. This message does not contain an official representation of Sirius Computer Solutions. If you have received this communication in error, notify Sirius Computer Solutions immediately and (i) destroy this message if a facsimile or (ii) delete this message immediately if this is an electronic communication. Thank you. Sirius Computer Solutions -------------- next part -------------- An HTML attachment was scrubbed... URL: From leslie.james.elliott at gmail.com Sat Nov 5 06:09:34 2016 From: leslie.james.elliott at gmail.com (leslie elliott) Date: Sat, 5 Nov 2016 16:09:34 +1000 Subject: [gpfsug-discuss] HAWC and LROC Message-ID: Hi I am curious if anyone has run these together on a client and whether it helped If we wanted to have these functions out at the client to optimise compute IO in a couple of special cases can both exist at the same time on the same nonvolatile hardware or do the two functions need independent devices and what would be the process to disestablish them on the clients as the requirement was satisfied thanks leslie -------------- next part -------------- An HTML attachment was scrubbed... URL: From olaf.weiser at de.ibm.com Sat Nov 5 13:39:58 2016 From: olaf.weiser at de.ibm.com (Olaf Weiser) Date: Sat, 5 Nov 2016 13:39:58 +0000 Subject: [gpfsug-discuss] HAWC and LROC Message-ID: You can use both -HAWC ,LROC- on the same node... but you need dedicated ,independent ,block devices ... In addition for hawc, you could consider replication and use 2 devices, even across 2 nodes. ... Gesendet von IBM Verse leslie elliott --- [gpfsug-discuss] HAWC and LROC --- Von:"leslie elliott" An:"gpfsug main discussion list" Datum:Sa. 05.11.2016 02:09Betreff:[gpfsug-discuss] HAWC and LROC Hi I am curious if anyone has run these together on a client and whether it helped If we wanted to have these functions out at the client to optimise compute IO in a couple of special cases can both exist at the same time on the same nonvolatile hardware or do the two functions need independent devices and what would be the process to disestablish them on the clients as the requirement was satisfied thanks leslie -------------- next part -------------- An HTML attachment was scrubbed... URL: From oehmes at gmail.com Sat Nov 5 16:17:52 2016 From: oehmes at gmail.com (Sven Oehme) Date: Sat, 05 Nov 2016 16:17:52 +0000 Subject: [gpfsug-discuss] HAWC and LROC In-Reply-To: References: Message-ID: Yes and no :) While olaf is right, it needs two independent blockdevices, partitions are just fine. So one could have in fact have a 200g ssd as a boot device and partitions it lets say 30g os 20g hawc 150g lroc you have to keep in mind that lroc and hawc have 2 very different requirements on the 'device'. if you loose hawc, you loose one copy of critical data (that's why the log needs to be replicated), if you loose lroc, you only loose cached data stored somewhere else, so the recommendation is to use soemwhat reliable 'devices' for hawc, while for lroc it could be simple consumer grade ssd's. So if you use one for both, it should be reliable. Sven On Sat, Nov 5, 2016, 6:40 AM Olaf Weiser wrote: > You can use both -HAWC ,LROC- on the same node... but you need dedicated > ,independent ,block devices ... > In addition for hawc, you could consider replication and use 2 devices, > even across 2 nodes. ... > > Gesendet von IBM Verse > > leslie elliott --- [gpfsug-discuss] HAWC and LROC --- > > Von: "leslie elliott" > An: "gpfsug main discussion list" > Datum: Sa. 05.11.2016 02:09 > Betreff: [gpfsug-discuss] HAWC and LROC > ------------------------------ > > > Hi I am curious if anyone has run these together on a client and whether > it helped > > If we wanted to have these functions out at the client to optimise compute > IO in a couple of special cases > > can both exist at the same time on the same nonvolatile hardware or do the > two functions need independent devices > > and what would be the process to disestablish them on the clients as the > requirement was satisfied > > thanks > > leslie > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > -------------- next part -------------- An HTML attachment was scrubbed... URL: From r.sobey at imperial.ac.uk Mon Nov 7 11:08:00 2016 From: r.sobey at imperial.ac.uk (Sobey, Richard A) Date: Mon, 7 Nov 2016 11:08:00 +0000 Subject: [gpfsug-discuss] How to clear stale entries in GUI log Message-ID: Hi all Since upgrading to 4.2.1 and all the in-between work that comes with it, I've got an error (warning) in my GUI with the following: Event ID: MS8071 Time: 13/09/2016 17:47:00 Message: DISK [disk_down] ICSAN_GPFS_FSD_QUORUM is DEGRADED Details: System response: - Administrator response: Use the command 'mmhealth node show' to get further details. Component NSD There's no fix for it, and neither has it recognised the problem doesn't exist anymore: [root at quorum ~]# mmhealth node show Node name: icgpfsq1 Node status: HEALTHY Component Status Reasons ------------------------------------------------------------------- GPFS HEALTHY - NETWORK HEALTHY - FILESYSTEM HEALTHY - DISK HEALTHY - GUI HEALTHY - Apart from possibly restarting the GUI services, shouldn't this just go away by itself? Cheers Richard -------------- next part -------------- An HTML attachment was scrubbed... URL: From dhildeb at us.ibm.com Mon Nov 7 19:29:27 2016 From: dhildeb at us.ibm.com (Dean Hildebrand) Date: Mon, 7 Nov 2016 11:29:27 -0800 Subject: [gpfsug-discuss] HAWC and LROC In-Reply-To: References:

Message-ID: Just adding in that with HAWC, you can also use a shared fast storage device (instead of a node local SSD). So, for example, if you already have your metadata stored in a shared SSD server, then you can just enable HAWC without any additional replication requirements. Dean From: Sven Oehme To: gpfsug main discussion list Date: 11/05/2016 09:18 AM Subject: Re: [gpfsug-discuss] HAWC and LROC Sent by: gpfsug-discuss-bounces at spectrumscale.org Yes and no :) While olaf is right, it needs two independent blockdevices, partitions are just fine. So one could have in fact have a 200g ssd as a boot device and partitions it lets say 30g os 20g hawc 150g lroc you have to keep in mind that lroc and hawc have 2 very different requirements on the 'device'. if you loose hawc, you loose one copy of critical data (that's why the log needs to be replicated), if you loose lroc, you only loose cached data stored somewhere else, so the recommendation is to use soemwhat reliable 'devices' for hawc, while for lroc it could be simple consumer grade ssd's. So if you use one for both, it should be reliable. Sven On Sat, Nov 5, 2016, 6:40 AM Olaf Weiser wrote: You can use both -HAWC ,LROC- on the same node... but you need dedicated ,independent ,block devices ... In addition for hawc, you could consider replication and use 2 devices, even across 2 nodes. ... Gesendet von IBM Verse leslie elliott --- [gpfsug-discuss] HAWC and LROC --- Von: "leslie elliott" An: "gpfsug main discussion list" Datum: Sa. 05.11.2016 02:09 Betreff [gpfsug-discuss] HAWC and LROC : Hi I am curious if anyone has run these together on a client and whether it helped If we wanted to have these functions out at the client to optimise compute IO in a couple of special cases can both exist at the same time on the same nonvolatile hardware or do the two functions need independent devices and what would be the process to disestablish them on the clients as the requirement was satisfied thanks leslie _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: not available URL: From MDIETZ at de.ibm.com Mon Nov 7 20:14:10 2016 From: MDIETZ at de.ibm.com (Mathias Dietz) Date: Mon, 7 Nov 2016 21:14:10 +0100 Subject: [gpfsug-discuss] CES and IP's that disappear In-Reply-To: <451248FD-D116-4C3D-A439-73967C287F6C@siriuscom.com> References: <451248FD-D116-4C3D-A439-73967C287F6C@siriuscom.com> Message-ID: Hi Mark, this sounds like a CES IP failover happened in the background. With Spectrum Scale 4.2.1 you can use the command "mmhealth node eventlog" on the failing node to see if a failover happened and what has triggered the failover. Prior to 4.2.1 use the command "mmces events list" or look into the mmfs.log for errors. Mit freundlichen Gr??en / Kind regards Mathias Dietz Spectrum Scale - Release Lead Architect (4.2.2 Release) System Health and Problem Determination Architect IBM Certified Software Engineer ---------------------------------------------------------------------------------------------------------- IBM Deutschland Hechtsheimer Str. 2 55131 Mainz Phone: +49-6131-84-2027 Mobile: +49-15152801035 E-Mail: mdietz at de.ibm.com ---------------------------------------------------------------------------------------------------------- IBM Deutschland Research & Development GmbH Vorsitzender des Aufsichtsrats: Martina Koederitz, Gesch?ftsf?hrung: Dirk Wittkopp Sitz der Gesellschaft: B?blingen / Registergericht: Amtsgericht Stuttgart, HRB 243294 From: "Mark.Bush at siriuscom.com" To: "gpfsug-discuss at spectrumscale.org" Date: 11/04/2016 05:18 PM Subject: [gpfsug-discuss] CES and IP's that disappear Sent by: gpfsug-discuss-bounces at spectrumscale.org I continue to run into a problem where after I get CES setup properly and the ces-ip addresses show up, I then add an NFS export and all of a sudden the ces-ip?s disappear from the protocol nodes. I have been scouring the problem determination guide but can?t seem to find out what is going on. It makes no sense to me that the IP?s would disappear. Especially after a simple task of just adding an nfs export. I first thought this was just a gui issue but just got done trying it all from the cli and the same thing happens. Has anyone seen anything like this? Mark This message (including any attachments) is intended only for the use of the individual or entity to which it is addressed and may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law. If you are not the intended recipient, you are hereby notified that any use, dissemination, distribution, or copying of this communication is strictly prohibited. This message may be viewed by parties at Sirius Computer Solutions other than those named in the message header. This message does not contain an official representation of Sirius Computer Solutions. If you have received this communication in error, notify Sirius Computer Solutions immediately and (i) destroy this message if a facsimile or (ii) delete this message immediately if this is an electronic communication. Thank you. Sirius Computer Solutions _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From Robert.Oesterlin at nuance.com Mon Nov 7 21:12:50 2016 From: Robert.Oesterlin at nuance.com (Oesterlin, Robert) Date: Mon, 7 Nov 2016 21:12:50 +0000 Subject: [gpfsug-discuss] SC16: GPFS User Group Meeting location Information Message-ID: <55E57C1F-B20C-4AAE-9FE9-A2D87C9AE61C@nuance.com> IBM Spectrum Scale User Group Meeting - SC16 Please register if you have not done so: https://www-01.ibm.com/events/wwe/grp/grp305.nsf/Registration.xsp?openform&seminar=357M7UES&locale=en_US&auth=anonymous Date: Sunday, November 13th Time: 12:30p - 5:30p - Please be on time! Location: Grand Ballroom Salon F Reception Following: Salon E Salt Lake Marriott Downtown at City Creek 75 S W Temple Salt Lake City, Utah 84101 United States The Salt Lake Marriott Downtown at City Creek is located across the street from the Salt Palace Convention Center. Bob Oesterlin Sr Principal Storage Engineer, Nuance -------------- next part -------------- An HTML attachment was scrubbed... URL: From mimarsh2 at vt.edu Tue Nov 8 13:40:43 2016 From: mimarsh2 at vt.edu (Brian Marshall) Date: Tue, 8 Nov 2016 08:40:43 -0500 Subject: [gpfsug-discuss] subnets confusion Message-ID: All, I have a tricky (at least to me) subnets question. I have 2 NSD Server clusters: Serv1 -> daemon on 10.51 with high speed network on 10.82 Serv2 -> daemon on 10.42 a high speed network and 2 client clusters: Cli1 -> daemon on 10.81 with high speed network on 10.82 Cli2 -> daemon on 10.41 with high speed network on 10.42 Serv1 has the following subnets operand: subnets 10.82.0.0/Serv1;Cli1 10.41.0.0/Cli2 Cli1 has the following subnets subnets 10.82.0.0/Serv1;Cli1 Cli2 has the following subnets subnets 10.51.0.0/Serv1 10.41.0.0/Cli2 10.42.0.0/Serv2 Problem: Sometimes Serv1 will try to contact Cli2 nodes on the 10.42 address which they don't have access to. I get errors like Close connection to 10.42.1 0.1 hs001.cluster.ib (Connection timed out) Cli2 nodes can connect/re-connect to Serv1 once the server cluster kicks them out. Serv1 has Cli2 listed on its 10.41 subnets operand, so I don't fully understand why Serv1 does not use 10.41 to connect Possible Solution?? I think to fix this I either need to add Serv1 to the 10.41 subnet of Cli2 OR move the 10.42 operand on Cli2 to the front of the list. I am working from this link https://www.ibm.com/developerworks/community/wikis/home?lang=en#!/wiki/General+Parallel+File+System+(GPFS)/page/GPFS+Network+Communication+Overview Please let me know if you need more info. I have tried to strip this down to the bare minimum and in doing so may have left out good details. Thank you, Brian Marshall -------------- next part -------------- An HTML attachment was scrubbed... URL: From Robert.Oesterlin at nuance.com Tue Nov 8 13:56:10 2016 From: Robert.Oesterlin at nuance.com (Oesterlin, Robert) Date: Tue, 8 Nov 2016 13:56:10 +0000 Subject: [gpfsug-discuss] "waiting for exclusive use of connection for sending msg" Message-ID: <24DB6954-F972-4022-9A7C-539E048E0680@nuance.com> This is one of those RPC waiters whose real cause and solution elude me. I know from various sources that it's network congestion related. But the documentation doesn't *really* give me a clue as to where to look next. If the NSD server is running well, with no obvious network issues then this may be a simple matter of network congestion. Any member out there who might know in detail where I should be looking? Bob Oesterlin Sr Principal Storage Engineer, Nuance 507-269-0413 -------------- next part -------------- An HTML attachment was scrubbed... URL: From rohwedder at de.ibm.com Tue Nov 8 14:51:02 2016 From: rohwedder at de.ibm.com (Markus Rohwedder) Date: Tue, 8 Nov 2016 15:51:02 +0100 Subject: [gpfsug-discuss] How to clear stale entries in GUI log In-Reply-To: References: Message-ID: Hello, you ran into a defect which is fixed with the upcoming 4.2.1.2 PTF Here is a workaround: You can clear the eventlog of the system health component using mmsysmonc clearDB This is a per node database, so you need to run this on all the nodes which have stale entries. It will clear all the events on this node, if you want to save them run: mmhealth node eventlog > log.save On the GUI node, run systemctl restart gpfsgui afterwards. The mmhealth command suppresses events during startup. So in case a bad condition turns OK during a restart phase, the bad event will remain stale. Regards, Markus Rohwedder IBM Spectrum Scale GUI development -------------- next part -------------- An HTML attachment was scrubbed... URL: From r.sobey at imperial.ac.uk Tue Nov 8 15:09:51 2016 From: r.sobey at imperial.ac.uk (Sobey, Richard A) Date: Tue, 8 Nov 2016 15:09:51 +0000 Subject: [gpfsug-discuss] How to clear stale entries in GUI log In-Reply-To: References:

Message-ID: Thanks. I've run that on, I assume, our quorum server where this disk is mounted, but the error is still showing up. The event itself doesn't say which node is affected. ICSAN_GPFS_FSD_QUORUM nsd 512 103 no no ready up system That looks ok to me. Maybe I misunderstood your line "This is a per node database, so you need to run this on all the nodes which have stale entries.". Should I just run it on all the nodes in the cluster instead... there's not many so won't take long but wondering if that's really necessary? Thanks Richard From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Markus Rohwedder Sent: 08 November 2016 14:51 To: gpfsug-discuss at spectrumscale.org Subject: [gpfsug-discuss] How to clear stale entries in GUI log Hello, you ran into a defect which is fixed with the upcoming 4.2.1.2 PTF Here is a workaround: You can clear the eventlog of the system health component using mmsysmonc clearDB This is a per node database, so you need to run this on all the nodes which have stale entries. It will clear all the events on this node, if you want to save them run: mmhealth node eventlog > log.save On the GUI node, run systemctl restart gpfsgui afterwards. The mmhealth command suppresses events during startup. So in case a bad condition turns OK during a restart phase, the bad event will remain stale. Regards, Markus Rohwedder IBM Spectrum Scale GUI development -------------- next part -------------- An HTML attachment was scrubbed... URL: From andreas.koeninger at de.ibm.com Tue Nov 8 16:49:50 2016 From: andreas.koeninger at de.ibm.com (Andreas Koeninger) Date: Tue, 8 Nov 2016 16:49:50 +0000 Subject: [gpfsug-discuss] How to clear stale entries in GUI log In-Reply-To: References: ,

Message-ID: An HTML attachment was scrubbed... URL: From S.J.Thompson at bham.ac.uk Wed Nov 9 09:10:47 2016 From: S.J.Thompson at bham.ac.uk (Simon Thompson (Research Computing - IT Services)) Date: Wed, 9 Nov 2016 09:10:47 +0000 Subject: [gpfsug-discuss] SC16: GPFS User Group Meeting location Information Message-ID: Please do sign up and come along to the user group meeting, Important note: the start time is 12:30pm, (it was originally advertised as 1pm on the IBM website). I was on a planning call yesterday, and I'm very happy to hear that IBM are specifically vetting their slides to ensure they are technical talks (not sales) for the user group session. We also have some great sounding user talks on the agenda. The programme is available on the Spectrum Scale UG website as well at: http://www.spectrumscale.org/ssug-at-sc16/ We're all looking forward to seeing you on Sunday Simon From: > on behalf of "Oesterlin, Robert" > Reply-To: "gpfsug-discuss at spectrumscale.org" > Date: Monday, 7 November 2016 at 21:12 To: "gpfsug-discuss at spectrumscale.org" > Subject: [gpfsug-discuss] SC16: GPFS User Group Meeting location Information IBM Spectrum Scale User Group Meeting - SC16 Please register if you have not done so: https://www-01.ibm.com/events/wwe/grp/grp305.nsf/Registration.xsp?openform&seminar=357M7UES&locale=en_US&auth=anonymous Date: Sunday, November 13th Time: 12:30p - 5:30p - Please be on time! Location: Grand Ballroom Salon F Reception Following: Salon E Salt Lake Marriott Downtown at City Creek 75 S W Temple Salt Lake City, Utah 84101 United States The Salt Lake Marriott Downtown at City Creek is located across the street from the Salt Palace Convention Center. Bob Oesterlin Sr Principal Storage Engineer, Nuance -------------- next part -------------- An HTML attachment was scrubbed... URL: From pascal+gpfsug at blue-onyx.ch Wed Nov 9 16:23:06 2016 From: pascal+gpfsug at blue-onyx.ch (Pascal Jermini) Date: Wed, 9 Nov 2016 17:23:06 +0100 Subject: [gpfsug-discuss] LROC and Spectrum Scale Express Message-ID: <7d2dcea5-0d28-2357-e5fe-8419aeaaf30b@blue-onyx.ch> Dear all, by looking at the documentation it is not clear whether Spectrum Scale Express edition supports the LROC feature. As far as I understand it, the client license is sufficient, however no word is given about which edition supports that feature. Any idea and/or pointer? Many thanks, Pascal From jake.carroll at uq.edu.au Wed Nov 9 17:39:05 2016 From: jake.carroll at uq.edu.au (Jake Carroll) Date: Wed, 9 Nov 2016 17:39:05 +0000 Subject: [gpfsug-discuss] Tuning AFM for high throughput/high IO over _really_ long distances Message-ID: <83652C3D-0802-4CC2-B636-9FAA31EF5AF0@uq.edu.au> Hi. I?ve got an GPFS to GPFS AFM cache/home (IW) relationship set up over a really long distance. About 180ms of latency between the two clusters and around 13,000km of optical path. Fortunately for me, I?ve actually got near theoretical maximum IO over the NIC?s between the clusters and I?m iPerf?ing at around 8.90 to 9.2Gbit/sec over a 10GbE circuit. All MTU9000 all the way through. Anyway ? I?m finding my AFM traffic to be dragging its feet and I don?t really understand why that might be. I?ve verified the links and transports ability as I said above with iPerf, and CERN?s FDT to near 10Gbit/sec. I also verified the clusters on both sides in terms of disk IO and they both seem easily capable in IOZone and IOR tests of multiple GB/sec of throughput. So ? my questions: 1. Are there very specific tunings AFM needs for high latency/long distance IO? 2. Are there very specific NIC/TCP-stack tunings (beyond the type of thing we already have in place) that benefits AFM over really long distances and high latency? 3. We are seeing on the ?cache? side really lazy/sticky ?ls ?als? in the home mount. It sometimes takes 20 to 30 seconds before the command line will report back with a long listing of files. Any ideas why it?d take that long to get a response from ?home?. We?ve got our TCP stack setup fairly aggressively, on all hosts that participate in these two clusters. ethtool -C enp2s0f0 adaptive-rx off ifconfig enp2s0f0 txqueuelen 10000 sysctl -w net.core.rmem_max=536870912 sysctl -w net.core.wmem_max=536870912 sysctl -w net.ipv4.tcp_rmem="4096 87380 268435456" sysctl -w net.ipv4.tcp_wmem="4096 65536 268435456" sysctl -w net.core.netdev_max_backlog=250000 sysctl -w net.ipv4.tcp_congestion_control=htcp sysctl -w net.ipv4.tcp_mtu_probing=1 I modified a couple of small things on the AFM ?cache? side to see if it?d make a difference such as: mmchconfig afmNumWriteThreads=4 mmchconfig afmNumReadThreads=4 But no difference so far. Thoughts would be appreciated. I?ve done this before over much shorter distances (30Km) and I?ve flattened a 10GbE wire without really tuning?anything. Are my large in-flight-packets numbers/long-time-to-acknowledgement semantics going to hurt here? I really thought AFM might be well designed for exactly this kind of work at long distance *and* high throughput ? so I must be missing something! -jc -------------- next part -------------- An HTML attachment was scrubbed... URL: From janfrode at tanso.net Wed Nov 9 18:05:21 2016 From: janfrode at tanso.net (Jan-Frode Myklebust) Date: Wed, 09 Nov 2016 18:05:21 +0000 Subject: [gpfsug-discuss] Tuning AFM for high throughput/high IO over _really_ long distances In-Reply-To: <83652C3D-0802-4CC2-B636-9FAA31EF5AF0@uq.edu.au> References: <83652C3D-0802-4CC2-B636-9FAA31EF5AF0@uq.edu.au> Message-ID: Mostly curious, don't have experience in such environments, but ... Is this AFM over NFS or NSD protocol? Might be interesting to try the other option -- and also check how nsdperf performs over such distance/latency. -jf ons. 9. nov. 2016 kl. 18.39 skrev Jake Carroll : > Hi. > > > > I?ve got an GPFS to GPFS AFM cache/home (IW) relationship set up over a > really long distance. About 180ms of latency between the two clusters and > around 13,000km of optical path. Fortunately for me, I?ve actually got near > theoretical maximum IO over the NIC?s between the clusters and I?m > iPerf?ing at around 8.90 to 9.2Gbit/sec over a 10GbE circuit. All MTU9000 > all the way through. > > > > Anyway ? I?m finding my AFM traffic to be dragging its feet and I don?t > really understand why that might be. I?ve verified the links and transports > ability as I said above with iPerf, and CERN?s FDT to near 10Gbit/sec. > > > > I also verified the clusters on both sides in terms of disk IO and they > both seem easily capable in IOZone and IOR tests of multiple GB/sec of > throughput. > > > > So ? my questions: > > > > 1. Are there very specific tunings AFM needs for high latency/long > distance IO? > > 2. Are there very specific NIC/TCP-stack tunings (beyond the type > of thing we already have in place) that benefits AFM over really long > distances and high latency? > > 3. We are seeing on the ?cache? side really lazy/sticky ?ls ?als? > in the home mount. It sometimes takes 20 to 30 seconds before the command > line will report back with a long listing of files. Any ideas why it?d take > that long to get a response from ?home?. > > > > We?ve got our TCP stack setup fairly aggressively, on all hosts that > participate in these two clusters. > > > > ethtool -C enp2s0f0 adaptive-rx off > > ifconfig enp2s0f0 txqueuelen 10000 > > sysctl -w net.core.rmem_max=536870912 > > sysctl -w net.core.wmem_max=536870912 > > sysctl -w net.ipv4.tcp_rmem="4096 87380 268435456" > > sysctl -w net.ipv4.tcp_wmem="4096 65536 268435456" > > sysctl -w net.core.netdev_max_backlog=250000 > > sysctl -w net.ipv4.tcp_congestion_control=htcp > > sysctl -w net.ipv4.tcp_mtu_probing=1 > > > > I modified a couple of small things on the AFM ?cache? side to see if it?d > make a difference such as: > > > > mmchconfig afmNumWriteThreads=4 > > mmchconfig afmNumReadThreads=4 > > > > But no difference so far. > > > > Thoughts would be appreciated. I?ve done this before over much shorter > distances (30Km) and I?ve flattened a 10GbE wire without really > tuning?anything. Are my large in-flight-packets > numbers/long-time-to-acknowledgement semantics going to hurt here? I really > thought AFM might be well designed for exactly this kind of work at long > distance **and** high throughput ? so I must be missing something! > > > > -jc > > > > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > -------------- next part -------------- An HTML attachment was scrubbed... URL: From sfadden at us.ibm.com Wed Nov 9 18:08:42 2016 From: sfadden at us.ibm.com (Scott Fadden) Date: Wed, 9 Nov 2016 10:08:42 -0800 Subject: [gpfsug-discuss] Tuning AFM for high throughput/high IO over _really_ long distances In-Reply-To: <83652C3D-0802-4CC2-B636-9FAA31EF5AF0@uq.edu.au> References: <83652C3D-0802-4CC2-B636-9FAA31EF5AF0@uq.edu.au> Message-ID: Jake, If AFM is using NFS it is all about NFS tuning. The copy from one side to the other is basically just a client writing to an NFS mount. Thee are a few things you can look at: 1. NFS Transfer size (Make is 1MiB, I think that is the max) 2. TCP Tuning for large window size. This is discussed on Tuning active file management home communications in the docs. On this page you will find some discussion on increasing gateway threads, and other things similar that may help as well. We can discuss further as I understand we will be meeting at SC16. Scott Fadden Spectrum Scale - Technical Marketing Phone: (503) 880-5833 sfadden at us.ibm.com http://www.ibm.com/systems/storage/spectrum/scale From: Jake Carroll To: "gpfsug-discuss at spectrumscale.org" Date: 11/09/2016 09:39 AM Subject: [gpfsug-discuss] Tuning AFM for high throughput/high IO over _really_ long distances Sent by: gpfsug-discuss-bounces at spectrumscale.org Hi. I?ve got an GPFS to GPFS AFM cache/home (IW) relationship set up over a really long distance. About 180ms of latency between the two clusters and around 13,000km of optical path. Fortunately for me, I?ve actually got near theoretical maximum IO over the NIC?s between the clusters and I?m iPerf?ing at around 8.90 to 9.2Gbit/sec over a 10GbE circuit. All MTU9000 all the way through. Anyway ? I?m finding my AFM traffic to be dragging its feet and I don?t really understand why that might be. I?ve verified the links and transports ability as I said above with iPerf, and CERN?s FDT to near 10Gbit/sec. I also verified the clusters on both sides in terms of disk IO and they both seem easily capable in IOZone and IOR tests of multiple GB/sec of throughput. So ? my questions: 1. Are there very specific tunings AFM needs for high latency/long distance IO? 2. Are there very specific NIC/TCP-stack tunings (beyond the type of thing we already have in place) that benefits AFM over really long distances and high latency? 3. We are seeing on the ?cache? side really lazy/sticky ?ls ?als? in the home mount. It sometimes takes 20 to 30 seconds before the command line will report back with a long listing of files. Any ideas why it?d take that long to get a response from ?home?. We?ve got our TCP stack setup fairly aggressively, on all hosts that participate in these two clusters. ethtool -C enp2s0f0 adaptive-rx off ifconfig enp2s0f0 txqueuelen 10000 sysctl -w net.core.rmem_max=536870912 sysctl -w net.core.wmem_max=536870912 sysctl -w net.ipv4.tcp_rmem="4096 87380 268435456" sysctl -w net.ipv4.tcp_wmem="4096 65536 268435456" sysctl -w net.core.netdev_max_backlog=250000 sysctl -w net.ipv4.tcp_congestion_control=htcp sysctl -w net.ipv4.tcp_mtu_probing=1 I modified a couple of small things on the AFM ?cache? side to see if it?d make a difference such as: mmchconfig afmNumWriteThreads=4 mmchconfig afmNumReadThreads=4 But no difference so far. Thoughts would be appreciated. I?ve done this before over much shorter distances (30Km) and I?ve flattened a 10GbE wire without really tuning?anything. Are my large in-flight-packets numbers/long-time-to-acknowledgement semantics going to hurt here? I really thought AFM might be well designed for exactly this kind of work at long distance *and* high throughput ? so I must be missing something! -jc _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From jake.carroll at uq.edu.au Wed Nov 9 18:09:14 2016 From: jake.carroll at uq.edu.au (Jake Carroll) Date: Wed, 9 Nov 2016 18:09:14 +0000 Subject: [gpfsug-discuss] Tuning AFM for high throughput/high IO over _really_ long distances (Jan-Frode Myklebust) Message-ID: <5D327C63-84EC-4F59-86E7-158308E91013@uq.edu.au> Hi jf? >> Mostly curious, don't have experience in such environments, but ... Is this AFM over NFS or NSD protocol? Might be interesting to try the other option -- and also check how nsdperf performs over such distance/latency. As it turns out, it seems, very few people do. I will test nsdperf over it and see how it performs. And yes, it is AFM ? AFM. No NFS involved here! -jc ------------------------------ Message: 2 Date: Wed, 9 Nov 2016 17:39:05 +0000 From: Jake Carroll To: "gpfsug-discuss at spectrumscale.org" Subject: [gpfsug-discuss] Tuning AFM for high throughput/high IO over _really_ long distances Message-ID: <83652C3D-0802-4CC2-B636-9FAA31EF5AF0 at uq.edu.au> Content-Type: text/plain; charset="utf-8" Hi. I?ve got an GPFS to GPFS AFM cache/home (IW) relationship set up over a really long distance. About 180ms of latency between the two clusters and around 13,000km of optical path. Fortunately for me, I?ve actually got near theoretical maximum IO over the NIC?s between the clusters and I?m iPerf?ing at around 8.90 to 9.2Gbit/sec over a 10GbE circuit. All MTU9000 all the way through. Anyway ? I?m finding my AFM traffic to be dragging its feet and I don?t really understand why that might be. I?ve verified the links and transports ability as I said above with iPerf, and CERN?s FDT to near 10Gbit/sec. I also verified the clusters on both sides in terms of disk IO and they both seem easily capable in IOZone and IOR tests of multiple GB/sec of throughput. So ? my questions: 1. Are there very specific tunings AFM needs for high latency/long distance IO? 2. Are there very specific NIC/TCP-stack tunings (beyond the type of thing we already have in place) that benefits AFM over really long distances and high latency? 3. We are seeing on the ?cache? side really lazy/sticky ?ls ?als? in the home mount. It sometimes takes 20 to 30 seconds before the command line will report back with a long listing of files. Any ideas why it?d take that long to get a response from ?home?. We?ve got our TCP stack setup fairly aggressively, on all hosts that participate in these two clusters. ethtool -C enp2s0f0 adaptive-rx off ifconfig enp2s0f0 txqueuelen 10000 sysctl -w net.core.rmem_max=536870912 sysctl -w net.core.wmem_max=536870912 sysctl -w net.ipv4.tcp_rmem="4096 87380 268435456" sysctl -w net.ipv4.tcp_wmem="4096 65536 268435456" sysctl -w net.core.netdev_max_backlog=250000 sysctl -w net.ipv4.tcp_congestion_control=htcp sysctl -w net.ipv4.tcp_mtu_probing=1 I modified a couple of small things on the AFM ?cache? side to see if it?d make a difference such as: mmchconfig afmNumWriteThreads=4 mmchconfig afmNumReadThreads=4 But no difference so far. Thoughts would be appreciated. I?ve done this before over much shorter distances (30Km) and I?ve flattened a 10GbE wire without really tuning?anything. Are my large in-flight-packets numbers/long-time-to-acknowledgement semantics going to hurt here? I really thought AFM might be well designed for exactly this kind of work at long distance *and* high throughput ? so I must be missing something! -jc -------------- next part -------------- An HTML attachment was scrubbed... URL: ------------------------------ Message: 3 Date: Wed, 09 Nov 2016 18:05:21 +0000 From: Jan-Frode Myklebust To: "gpfsug-discuss at spectrumscale.org" Subject: Re: [gpfsug-discuss] Tuning AFM for high throughput/high IO over _really_ long distances Message-ID: Content-Type: text/plain; charset="utf-8" Mostly curious, don't have experience in such environments, but ... Is this AFM over NFS or NSD protocol? Might be interesting to try the other option -- and also check how nsdperf performs over such distance/latency. -jf ons. 9. nov. 2016 kl. 18.39 skrev Jake Carroll : > Hi. > > > > I?ve got an GPFS to GPFS AFM cache/home (IW) relationship set up over a > really long distance. About 180ms of latency between the two clusters and > around 13,000km of optical path. Fortunately for me, I?ve actually got near > theoretical maximum IO over the NIC?s between the clusters and I?m > iPerf?ing at around 8.90 to 9.2Gbit/sec over a 10GbE circuit. All MTU9000 > all the way through. > > > > Anyway ? I?m finding my AFM traffic to be dragging its feet and I don?t > really understand why that might be. I?ve verified the links and transports > ability as I said above with iPerf, and CERN?s FDT to near 10Gbit/sec. > > > > I also verified the clusters on both sides in terms of disk IO and they > both seem easily capable in IOZone and IOR tests of multiple GB/sec of > throughput. > > > > So ? my questions: > > > > 1. Are there very specific tunings AFM needs for high latency/long > distance IO? > > 2. Are there very specific NIC/TCP-stack tunings (beyond the type > of thing we already have in place) that benefits AFM over really long > distances and high latency? > > 3. We are seeing on the ?cache? side really lazy/sticky ?ls ?als? > in the home mount. It sometimes takes 20 to 30 seconds before the command > line will report back with a long listing of files. Any ideas why it?d take > that long to get a response from ?home?. > > > > We?ve got our TCP stack setup fairly aggressively, on all hosts that > participate in these two clusters. > > > > ethtool -C enp2s0f0 adaptive-rx off > > ifconfig enp2s0f0 txqueuelen 10000 > > sysctl -w net.core.rmem_max=536870912 > > sysctl -w net.core.wmem_max=536870912 > > sysctl -w net.ipv4.tcp_rmem="4096 87380 268435456" > > sysctl -w net.ipv4.tcp_wmem="4096 65536 268435456" > > sysctl -w net.core.netdev_max_backlog=250000 > > sysctl -w net.ipv4.tcp_congestion_control=htcp > > sysctl -w net.ipv4.tcp_mtu_probing=1 > > > > I modified a couple of small things on the AFM ?cache? side to see if it?d > make a difference such as: > > > > mmchconfig afmNumWriteThreads=4 > > mmchconfig afmNumReadThreads=4 > > > > But no difference so far. > > > > Thoughts would be appreciated. I?ve done this before over much shorter > distances (30Km) and I?ve flattened a 10GbE wire without really > tuning?anything. Are my large in-flight-packets > numbers/long-time-to-acknowledgement semantics going to hurt here? I really > thought AFM might be well designed for exactly this kind of work at long > distance **and** high throughput ? so I must be missing something! > > > > -jc > > > > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > -------------- next part -------------- An HTML attachment was scrubbed... URL: ------------------------------ _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss End of gpfsug-discuss Digest, Vol 58, Issue 12 ********************************************** From sfadden at us.ibm.com Wed Nov 9 18:24:15 2016 From: sfadden at us.ibm.com (Scott Fadden) Date: Wed, 9 Nov 2016 10:24:15 -0800 Subject: [gpfsug-discuss] Tuning AFM for high throughput/high IO over _really_ long distances (Jan-Frode Myklebust) In-Reply-To: <5D327C63-84EC-4F59-86E7-158308E91013@uq.edu.au> References: <5D327C63-84EC-4F59-86E7-158308E91013@uq.edu.au> Message-ID: So you are using the NSD protocol for data transfers over multi-cluster? If so the TCP and thread tuning should help as well. Scott Fadden Spectrum Scale - Technical Marketing Phone: (503) 880-5833 sfadden at us.ibm.com http://www.ibm.com/systems/storage/spectrum/scale From: Jake Carroll To: "gpfsug-discuss at spectrumscale.org" Date: 11/09/2016 10:09 AM Subject: Re: [gpfsug-discuss] Tuning AFM for high throughput/high IO over _really_ long distances (Jan-Frode Myklebust) Sent by: gpfsug-discuss-bounces at spectrumscale.org Hi jf? >> Mostly curious, don't have experience in such environments, but ... Is this AFM over NFS or NSD protocol? Might be interesting to try the other option -- and also check how nsdperf performs over such distance/latency. As it turns out, it seems, very few people do. I will test nsdperf over it and see how it performs. And yes, it is AFM ? AFM. No NFS involved here! -jc ------------------------------ Message: 2 Date: Wed, 9 Nov 2016 17:39:05 +0000 From: Jake Carroll To: "gpfsug-discuss at spectrumscale.org" Subject: [gpfsug-discuss] Tuning AFM for high throughput/high IO over _really_ long distances Message-ID: <83652C3D-0802-4CC2-B636-9FAA31EF5AF0 at uq.edu.au> Content-Type: text/plain; charset="utf-8" Hi. I?ve got an GPFS to GPFS AFM cache/home (IW) relationship set up over a really long distance. About 180ms of latency between the two clusters and around 13,000km of optical path. Fortunately for me, I?ve actually got near theoretical maximum IO over the NIC?s between the clusters and I?m iPerf?ing at around 8.90 to 9.2Gbit/sec over a 10GbE circuit. All MTU9000 all the way through. Anyway ? I?m finding my AFM traffic to be dragging its feet and I don?t really understand why that might be. I?ve verified the links and transports ability as I said above with iPerf, and CERN?s FDT to near 10Gbit/sec. I also verified the clusters on both sides in terms of disk IO and they both seem easily capable in IOZone and IOR tests of multiple GB/sec of throughput. So ? my questions: 1. Are there very specific tunings AFM needs for high latency/long distance IO? 2. Are there very specific NIC/TCP-stack tunings (beyond the type of thing we already have in place) that benefits AFM over really long distances and high latency? 3. We are seeing on the ?cache? side really lazy/sticky ?ls ?als? in the home mount. It sometimes takes 20 to 30 seconds before the command line will report back with a long listing of files. Any ideas why it?d take that long to get a response from ?home?. We?ve got our TCP stack setup fairly aggressively, on all hosts that participate in these two clusters. ethtool -C enp2s0f0 adaptive-rx off ifconfig enp2s0f0 txqueuelen 10000 sysctl -w net.core.rmem_max=536870912 sysctl -w net.core.wmem_max=536870912 sysctl -w net.ipv4.tcp_rmem="4096 87380 268435456" sysctl -w net.ipv4.tcp_wmem="4096 65536 268435456" sysctl -w net.core.netdev_max_backlog=250000 sysctl -w net.ipv4.tcp_congestion_control=htcp sysctl -w net.ipv4.tcp_mtu_probing=1 I modified a couple of small things on the AFM ?cache? side to see if it?d make a difference such as: mmchconfig afmNumWriteThreads=4 mmchconfig afmNumReadThreads=4 But no difference so far. Thoughts would be appreciated. I?ve done this before over much shorter distances (30Km) and I?ve flattened a 10GbE wire without really tuning?anything. Are my large in-flight-packets numbers/long-time-to-acknowledgement semantics going to hurt here? I really thought AFM might be well designed for exactly this kind of work at long distance *and* high throughput ? so I must be missing something! -jc -------------- next part -------------- An HTML attachment was scrubbed... URL: < http://gpfsug.org/pipermail/gpfsug-discuss/attachments/20161109/d4f4d9a7/attachment-0001.html > ------------------------------ Message: 3 Date: Wed, 09 Nov 2016 18:05:21 +0000 From: Jan-Frode Myklebust To: "gpfsug-discuss at spectrumscale.org" Subject: Re: [gpfsug-discuss] Tuning AFM for high throughput/high IO over _really_ long distances Message-ID: Content-Type: text/plain; charset="utf-8" Mostly curious, don't have experience in such environments, but ... Is this AFM over NFS or NSD protocol? Might be interesting to try the other option -- and also check how nsdperf performs over such distance/latency. -jf ons. 9. nov. 2016 kl. 18.39 skrev Jake Carroll : > Hi. > > > > I?ve got an GPFS to GPFS AFM cache/home (IW) relationship set up over a > really long distance. About 180ms of latency between the two clusters and > around 13,000km of optical path. Fortunately for me, I?ve actually got near > theoretical maximum IO over the NIC?s between the clusters and I?m > iPerf?ing at around 8.90 to 9.2Gbit/sec over a 10GbE circuit. All MTU9000 > all the way through. > > > > Anyway ? I?m finding my AFM traffic to be dragging its feet and I don?t > really understand why that might be. I?ve verified the links and transports > ability as I said above with iPerf, and CERN?s FDT to near 10Gbit/sec. > > > > I also verified the clusters on both sides in terms of disk IO and they > both seem easily capable in IOZone and IOR tests of multiple GB/sec of > throughput. > > > > So ? my questions: > > > > 1. Are there very specific tunings AFM needs for high latency/long > distance IO? > > 2. Are there very specific NIC/TCP-stack tunings (beyond the type > of thing we already have in place) that benefits AFM over really long > distances and high latency? > > 3. We are seeing on the ?cache? side really lazy/sticky ?ls ?als? > in the home mount. It sometimes takes 20 to 30 seconds before the command > line will report back with a long listing of files. Any ideas why it?d take > that long to get a response from ?home?. > > > > We?ve got our TCP stack setup fairly aggressively, on all hosts that > participate in these two clusters. > > > > ethtool -C enp2s0f0 adaptive-rx off > > ifconfig enp2s0f0 txqueuelen 10000 > > sysctl -w net.core.rmem_max=536870912 > > sysctl -w net.core.wmem_max=536870912 > > sysctl -w net.ipv4.tcp_rmem="4096 87380 268435456" > > sysctl -w net.ipv4.tcp_wmem="4096 65536 268435456" > > sysctl -w net.core.netdev_max_backlog=250000 > > sysctl -w net.ipv4.tcp_congestion_control=htcp > > sysctl -w net.ipv4.tcp_mtu_probing=1 > > > > I modified a couple of small things on the AFM ?cache? side to see if it?d > make a difference such as: > > > > mmchconfig afmNumWriteThreads=4 > > mmchconfig afmNumReadThreads=4 > > > > But no difference so far. > > > > Thoughts would be appreciated. I?ve done this before over much shorter > distances (30Km) and I?ve flattened a 10GbE wire without really > tuning?anything. Are my large in-flight-packets > numbers/long-time-to-acknowledgement semantics going to hurt here? I really > thought AFM might be well designed for exactly this kind of work at long > distance **and** high throughput ? so I must be missing something! > > > > -jc > > > > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > -------------- next part -------------- An HTML attachment was scrubbed... URL: < http://gpfsug.org/pipermail/gpfsug-discuss/attachments/20161109/f44369ab/attachment.html > ------------------------------ _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss End of gpfsug-discuss Digest, Vol 58, Issue 12 ********************************************** _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From jake.carroll at uq.edu.au Wed Nov 9 18:27:50 2016 From: jake.carroll at uq.edu.au (Jake Carroll) Date: Wed, 9 Nov 2016 18:27:50 +0000 Subject: [gpfsug-discuss] Tuning AFM for high throughput/high IO over _really_ long In-Reply-To: References: Message-ID: <88B892F7-75AA-4881-B1E3-DDC7500456CD@uq.edu.au> Scott, Nar, very much pure AFM to AFM here, hence we are a little surprised. Last time we did this over a longish link we almost caused an outage with the ease at which we attained throughput - but maybe there are some magic tolerances we are hitting in latency and in flight IO semantics that SS/GPFS/AFM is not well tweaked for (yet...)... Yes - we are catching up at SC. I think it's all been arranged? We are also talking to one of your resources about this AFM throughput behaviour this afternoon. John I believe his name is? Anyway - if you've got any ideas, am all ears! > > > Today's Topics: > > 1. Re: Tuning AFM for high throughput/high IO over _really_ long > distances (Scott Fadden) > 2. Re: Tuning AFM for high throughput/high IO over _really_ long > distances (Jan-Frode Myklebust) (Jake Carroll) > > > ---------------------------------------------------------------------- > > Message: 1 > Date: Wed, 9 Nov 2016 10:08:42 -0800 > From: "Scott Fadden" > To: gpfsug main discussion list > Subject: Re: [gpfsug-discuss] Tuning AFM for high throughput/high IO > over _really_ long distances > Message-ID: > > > Content-Type: text/plain; charset="utf-8" > > Jake, > > If AFM is using NFS it is all about NFS tuning. The copy from one side to > the other is basically just a client writing to an NFS mount. Thee are a > few things you can look at: > 1. NFS Transfer size (Make is 1MiB, I think that is the max) > 2. TCP Tuning for large window size. This is discussed on Tuning active > file management home communications in the docs. On this page you will > find some discussion on increasing gateway threads, and other things > similar that may help as well. > > We can discuss further as I understand we will be meeting at SC16. > > Scott Fadden > Spectrum Scale - Technical Marketing > Phone: (503) 880-5833 > sfadden at us.ibm.com > http://www.ibm.com/systems/storage/spectrum/scale > > > > From: Jake Carroll > To: "gpfsug-discuss at spectrumscale.org" > > Date: 11/09/2016 09:39 AM > Subject: [gpfsug-discuss] Tuning AFM for high throughput/high IO > over _really_ long distances > Sent by: gpfsug-discuss-bounces at spectrumscale.org > > > > Hi. > > I?ve got an GPFS to GPFS AFM cache/home (IW) relationship set up over a > really long distance. About 180ms of latency between the two clusters and > around 13,000km of optical path. Fortunately for me, I?ve actually got > near theoretical maximum IO over the NIC?s between the clusters and I?m > iPerf?ing at around 8.90 to 9.2Gbit/sec over a 10GbE circuit. All MTU9000 > all the way through. > > Anyway ? I?m finding my AFM traffic to be dragging its feet and I don?t > really understand why that might be. I?ve verified the links and > transports ability as I said above with iPerf, and CERN?s FDT to near > 10Gbit/sec. > > I also verified the clusters on both sides in terms of disk IO and they > both seem easily capable in IOZone and IOR tests of multiple GB/sec of > throughput. > > So ? my questions: > > 1. Are there very specific tunings AFM needs for high latency/long > distance IO? > 2. Are there very specific NIC/TCP-stack tunings (beyond the type of > thing we already have in place) that benefits AFM over really long > distances and high latency? > 3. We are seeing on the ?cache? side really lazy/sticky ?ls ?als? in > the home mount. It sometimes takes 20 to 30 seconds before the command > line will report back with a long listing of files. Any ideas why it?d > take that long to get a response from ?home?. > > We?ve got our TCP stack setup fairly aggressively, on all hosts that > participate in these two clusters. > > ethtool -C enp2s0f0 adaptive-rx off > ifconfig enp2s0f0 txqueuelen 10000 > sysctl -w net.core.rmem_max=536870912 > sysctl -w net.core.wmem_max=536870912 > sysctl -w net.ipv4.tcp_rmem="4096 87380 268435456" > sysctl -w net.ipv4.tcp_wmem="4096 65536 268435456" > sysctl -w net.core.netdev_max_backlog=250000 > sysctl -w net.ipv4.tcp_congestion_control=htcp > sysctl -w net.ipv4.tcp_mtu_probing=1 > > I modified a couple of small things on the AFM ?cache? side to see if it?d > make a difference such as: > > mmchconfig afmNumWriteThreads=4 > mmchconfig afmNumReadThreads=4 > > But no difference so far. > > Thoughts would be appreciated. I?ve done this before over much shorter > distances (30Km) and I?ve flattened a 10GbE wire without really > tuning?anything. Are my large in-flight-packets > numbers/long-time-to-acknowledgement semantics going to hurt here? I > really thought AFM might be well designed for exactly this kind of work at > long distance *and* high throughput ? so I must be missing something! > > -jc > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > > -------------- next part -------------- > An HTML attachment was scrubbed... > URL: > > ------------------------------ > > Message: 2 > Date: Wed, 9 Nov 2016 18:09:14 +0000 > From: Jake Carroll > To: "gpfsug-discuss at spectrumscale.org" > > Subject: Re: [gpfsug-discuss] Tuning AFM for high throughput/high IO > over _really_ long distances (Jan-Frode Myklebust) > Message-ID: <5D327C63-84EC-4F59-86E7-158308E91013 at uq.edu.au> > Content-Type: text/plain; charset="utf-8" > > Hi jf? > > >>> Mostly curious, don't have experience in such environments, but ... Is this > AFM over NFS or NSD protocol? Might be interesting to try the other option > -- and also check how nsdperf performs over such distance/latency. > > As it turns out, it seems, very few people do. > > I will test nsdperf over it and see how it performs. And yes, it is AFM ? AFM. No NFS involved here! > > -jc > > > > ------------------------------ > > Message: 2 > Date: Wed, 9 Nov 2016 17:39:05 +0000 > From: Jake Carroll > To: "gpfsug-discuss at spectrumscale.org" > > Subject: [gpfsug-discuss] Tuning AFM for high throughput/high IO over > _really_ long distances > Message-ID: <83652C3D-0802-4CC2-B636-9FAA31EF5AF0 at uq.edu.au> > Content-Type: text/plain; charset="utf-8" > > Hi. > > I?ve got an GPFS to GPFS AFM cache/home (IW) relationship set up over a really long distance. About 180ms of latency between the two clusters and around 13,000km of optical path. Fortunately for me, I?ve actually got near theoretical maximum IO over the NIC?s between the clusters and I?m iPerf?ing at around 8.90 to 9.2Gbit/sec over a 10GbE circuit. All MTU9000 all the way through. > > Anyway ? I?m finding my AFM traffic to be dragging its feet and I don?t really understand why that might be. I?ve verified the links and transports ability as I said above with iPerf, and CERN?s FDT to near 10Gbit/sec. > > I also verified the clusters on both sides in terms of disk IO and they both seem easily capable in IOZone and IOR tests of multiple GB/sec of throughput. > > So ? my questions: > > > 1. Are there very specific tunings AFM needs for high latency/long distance IO? > > 2. Are there very specific NIC/TCP-stack tunings (beyond the type of thing we already have in place) that benefits AFM over really long distances and high latency? > > 3. We are seeing on the ?cache? side really lazy/sticky ?ls ?als? in the home mount. It sometimes takes 20 to 30 seconds before the command line will report back with a long listing of files. Any ideas why it?d take that long to get a response from ?home?. > > We?ve got our TCP stack setup fairly aggressively, on all hosts that participate in these two clusters. > > ethtool -C enp2s0f0 adaptive-rx off > ifconfig enp2s0f0 txqueuelen 10000 > sysctl -w net.core.rmem_max=536870912 > sysctl -w net.core.wmem_max=536870912 > sysctl -w net.ipv4.tcp_rmem="4096 87380 268435456" > sysctl -w net.ipv4.tcp_wmem="4096 65536 268435456" > sysctl -w net.core.netdev_max_backlog=250000 > sysctl -w net.ipv4.tcp_congestion_control=htcp > sysctl -w net.ipv4.tcp_mtu_probing=1 > > I modified a couple of small things on the AFM ?cache? side to see if it?d make a difference such as: > > mmchconfig afmNumWriteThreads=4 > mmchconfig afmNumReadThreads=4 > > But no difference so far. > > Thoughts would be appreciated. I?ve done this before over much shorter distances (30Km) and I?ve flattened a 10GbE wire without really tuning?anything. Are my large in-flight-packets numbers/long-time-to-acknowledgement semantics going to hurt here? I really thought AFM might be well designed for exactly this kind of work at long distance *and* high throughput ? so I must be missing something! > > -jc > > > > -------------- next part -------------- > An HTML attachment was scrubbed... > URL: > > ------------------------------ > > Message: 3 > Date: Wed, 09 Nov 2016 18:05:21 +0000 > From: Jan-Frode Myklebust > To: "gpfsug-discuss at spectrumscale.org" > > Subject: Re: [gpfsug-discuss] Tuning AFM for high throughput/high IO > over _really_ long distances > Message-ID: > > Content-Type: text/plain; charset="utf-8" > > Mostly curious, don't have experience in such environments, but ... Is this > AFM over NFS or NSD protocol? Might be interesting to try the other option > -- and also check how nsdperf performs over such distance/latency. > > > > -jf >> ons. 9. nov. 2016 kl. 18.39 skrev Jake Carroll : >> >> Hi. >> >> >> >> I?ve got an GPFS to GPFS AFM cache/home (IW) relationship set up over a >> really long distance. About 180ms of latency between the two clusters and >> around 13,000km of optical path. Fortunately for me, I?ve actually got near >> theoretical maximum IO over the NIC?s between the clusters and I?m >> iPerf?ing at around 8.90 to 9.2Gbit/sec over a 10GbE circuit. All MTU9000 >> all the way through. >> >> >> >> Anyway ? I?m finding my AFM traffic to be dragging its feet and I don?t >> really understand why that might be. I?ve verified the links and transports >> ability as I said above with iPerf, and CERN?s FDT to near 10Gbit/sec. >> >> >> >> I also verified the clusters on both sides in terms of disk IO and they >> both seem easily capable in IOZone and IOR tests of multiple GB/sec of >> throughput. >> >> >> >> So ? my questions: >> >> >> >> 1. Are there very specific tunings AFM needs for high latency/long >> distance IO? >> >> 2. Are there very specific NIC/TCP-stack tunings (beyond the type >> of thing we already have in place) that benefits AFM over really long >> distances and high latency? >> >> 3. We are seeing on the ?cache? side really lazy/sticky ?ls ?als? >> in the home mount. It sometimes takes 20 to 30 seconds before the command >> line will report back with a long listing of files. Any ideas why it?d take >> that long to get a response from ?home?. >> >> >> >> We?ve got our TCP stack setup fairly aggressively, on all hosts that >> participate in these two clusters. >> >> >> >> ethtool -C enp2s0f0 adaptive-rx off >> >> ifconfig enp2s0f0 txqueuelen 10000 >> >> sysctl -w net.core.rmem_max=536870912 >> >> sysctl -w net.core.wmem_max=536870912 >> >> sysctl -w net.ipv4.tcp_rmem="4096 87380 268435456" >> >> sysctl -w net.ipv4.tcp_wmem="4096 65536 268435456" >> >> sysctl -w net.core.netdev_max_backlog=250000 >> >> sysctl -w net.ipv4.tcp_congestion_control=htcp >> >> sysctl -w net.ipv4.tcp_mtu_probing=1 >> >> >> >> I modified a couple of small things on the AFM ?cache? side to see if it?d >> make a difference such as: >> >> >> >> mmchconfig afmNumWriteThreads=4 >> >> mmchconfig afmNumReadThreads=4 >> >> >> >> But no difference so far. >> >> >> >> Thoughts would be appreciated. I?ve done this before over much shorter >> distances (30Km) and I?ve flattened a 10GbE wire without really >> tuning?anything. Are my large in-flight-packets >> numbers/long-time-to-acknowledgement semantics going to hurt here? I really >> thought AFM might be well designed for exactly this kind of work at long >> distance **and** high throughput ? so I must be missing something! >> >> >> >> -jc >> >> >> >> >> >> >> _______________________________________________ >> gpfsug-discuss mailing list >> gpfsug-discuss at spectrumscale.org >> http://gpfsug.org/mailman/listinfo/gpfsug-discuss >> > -------------- next part -------------- > An HTML attachment was scrubbed... > URL: > > ------------------------------ > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > End of gpfsug-discuss Digest, Vol 58, Issue 12 > ********************************************** > > > > ------------------------------ > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > End of gpfsug-discuss Digest, Vol 58, Issue 13 > ********************************************** From olaf.weiser at de.ibm.com Wed Nov 9 20:53:11 2016 From: olaf.weiser at de.ibm.com (Olaf Weiser) Date: Wed, 9 Nov 2016 21:53:11 +0100 Subject: [gpfsug-discuss] Tuning AFM for high throughput/high IO over _really_ long distances In-Reply-To: References: <83652C3D-0802-4CC2-B636-9FAA31EF5AF0@uq.edu.au> Message-ID: An HTML attachment was scrubbed... URL: From kdball at us.ibm.com Wed Nov 9 22:03:04 2016 From: kdball at us.ibm.com (Keith D Ball) Date: Wed, 9 Nov 2016 22:03:04 +0000 Subject: [gpfsug-discuss] LROC and Spectrum Scale Express In-Reply-To: References: Message-ID: An HTML attachment was scrubbed... URL: From dhildeb at us.ibm.com Wed Nov 9 22:28:21 2016 From: dhildeb at us.ibm.com (Dean Hildebrand) Date: Wed, 9 Nov 2016 14:28:21 -0800 Subject: [gpfsug-discuss] Tuning AFM for high throughput/high IO over _really_ long In-Reply-To: <88B892F7-75AA-4881-B1E3-DDC7500456CD@uq.edu.au> References: <88B892F7-75AA-4881-B1E3-DDC7500456CD@uq.edu.au> Message-ID: Hi Jake, I would tackle this programmatically: a) Mount the remote FS directly from a GPFS client (using multi-cluster) and evaluate the performance using GPFS directly. The key factors affecting performance here will be - number of nsd servers at remote site (home site): The client creates a new TCP connection to each NSD server. The TCP connections will slowly expand their send/receive window as more data is read/written. The more TCP connections the better since it allows multiple windows to better fill the pipe. Note that TCP windows close quickly when no data is sent..and must resume tcp slow start on each benchmark run. - size of tcp buffers that gpfs is using: You want to be able to have the sum of the TCP windows fill the pipe. - type of workload you are running: Small files occur the full latency, heavy metadata occur the full latency, but large files allow tcp slow start to expand the tcp window to the full size and fill the pipe. b) One this is done then move to AFM. Note that with writes the only way to evaluate performance is to monitor the network B/W on the GW. With reads, note that the GW writes data to storage before clients read it off the local disk. So you can either monitor read network B/W on the GW, or run your read tests directly on the GW (since then data is delivered to the application benchmark directly from the pagepool). Also note that AFM writes data in at most (by default) 1GB chunks. This can be increased with afmMaxWriteMergeLen, but be careful since if the network fails in the middle of the write to the home, it may need to restart from the beginning when the network connection is fixed. c) Using multiple GWs with AFM with parallel I/O. This allows more available nodes the opportunity to create more TCP connections, all giving a better chance to fill the pipe. The additional TCP connections also mitigates the impact of packet loss or delay since it will only affect one of the connections. Playing with chunk size here can be useful. Dean From: Jake Carroll To: "gpfsug-discuss at spectrumscale.org" Date: 11/09/2016 10:28 AM Subject: [gpfsug-discuss] Tuning AFM for high throughput/high IO over _really_ long Sent by: gpfsug-discuss-bounces at spectrumscale.org Scott, Nar, very much pure AFM to AFM here, hence we are a little surprised. Last time we did this over a longish link we almost caused an outage with the ease at which we attained throughput - but maybe there are some magic tolerances we are hitting in latency and in flight IO semantics that SS/GPFS/AFM is not well tweaked for (yet...)... Yes - we are catching up at SC. I think it's all been arranged? We are also talking to one of your resources about this AFM throughput behaviour this afternoon. John I believe his name is? Anyway - if you've got any ideas, am all ears! > > > Today's Topics: > > 1. Re: Tuning AFM for high throughput/high IO over _really_ long > distances (Scott Fadden) > 2. Re: Tuning AFM for high throughput/high IO over _really_ long > distances (Jan-Frode Myklebust) (Jake Carroll) > > > ---------------------------------------------------------------------- > > Message: 1 > Date: Wed, 9 Nov 2016 10:08:42 -0800 > From: "Scott Fadden" > To: gpfsug main discussion list > Subject: Re: [gpfsug-discuss] Tuning AFM for high throughput/high IO > over _really_ long distances > Message-ID: > > > Content-Type: text/plain; charset="utf-8" > > Jake, > > If AFM is using NFS it is all about NFS tuning. The copy from one side to > the other is basically just a client writing to an NFS mount. Thee are a > few things you can look at: > 1. NFS Transfer size (Make is 1MiB, I think that is the max) > 2. TCP Tuning for large window size. This is discussed on Tuning active > file management home communications in the docs. On this page you will > find some discussion on increasing gateway threads, and other things > similar that may help as well. > > We can discuss further as I understand we will be meeting at SC16. > > Scott Fadden > Spectrum Scale - Technical Marketing > Phone: (503) 880-5833 > sfadden at us.ibm.com > http://www.ibm.com/systems/storage/spectrum/scale > > > > From: Jake Carroll > To: "gpfsug-discuss at spectrumscale.org" > > Date: 11/09/2016 09:39 AM > Subject: [gpfsug-discuss] Tuning AFM for high throughput/high IO > over _really_ long distances > Sent by: gpfsug-discuss-bounces at spectrumscale.org > > > > Hi. > > I?ve got an GPFS to GPFS AFM cache/home (IW) relationship set up over a > really long distance. About 180ms of latency between the two clusters and > around 13,000km of optical path. Fortunately for me, I?ve actually got > near theoretical maximum IO over the NIC?s between the clusters and I?m > iPerf?ing at around 8.90 to 9.2Gbit/sec over a 10GbE circuit. All MTU9000 > all the way through. > > Anyway ? I?m finding my AFM traffic to be dragging its feet and I don?t > really understand why that might be. I?ve verified the links and > transports ability as I said above with iPerf, and CERN?s FDT to near > 10Gbit/sec. > > I also verified the clusters on both sides in terms of disk IO and they > both seem easily capable in IOZone and IOR tests of multiple GB/sec of > throughput. > > So ? my questions: > > 1. Are there very specific tunings AFM needs for high latency/long > distance IO? > 2. Are there very specific NIC/TCP-stack tunings (beyond the type of > thing we already have in place) that benefits AFM over really long > distances and high latency? > 3. We are seeing on the ?cache? side really lazy/sticky ?ls ?als? in > the home mount. It sometimes takes 20 to 30 seconds before the command > line will report back with a long listing of files. Any ideas why it?d > take that long to get a response from ?home?. > > We?ve got our TCP stack setup fairly aggressively, on all hosts that > participate in these two clusters. > > ethtool -C enp2s0f0 adaptive-rx off > ifconfig enp2s0f0 txqueuelen 10000 > sysctl -w net.core.rmem_max=536870912 > sysctl -w net.core.wmem_max=536870912 > sysctl -w net.ipv4.tcp_rmem="4096 87380 268435456" > sysctl -w net.ipv4.tcp_wmem="4096 65536 268435456" > sysctl -w net.core.netdev_max_backlog=250000 > sysctl -w net.ipv4.tcp_congestion_control=htcp > sysctl -w net.ipv4.tcp_mtu_probing=1 > > I modified a couple of small things on the AFM ?cache? side to see if it?d > make a difference such as: > > mmchconfig afmNumWriteThreads=4 > mmchconfig afmNumReadThreads=4 > > But no difference so far. > > Thoughts would be appreciated. I?ve done this before over much shorter > distances (30Km) and I?ve flattened a 10GbE wire without really > tuning?anything. Are my large in-flight-packets > numbers/long-time-to-acknowledgement semantics going to hurt here? I > really thought AFM might be well designed for exactly this kind of work at > long distance *and* high throughput ? so I must be missing something! > > -jc > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > > -------------- next part -------------- > An HTML attachment was scrubbed... > URL: < http://gpfsug.org/pipermail/gpfsug-discuss/attachments/20161109/c775cf5a/attachment-0001.html > > > ------------------------------ > > Message: 2 > Date: Wed, 9 Nov 2016 18:09:14 +0000 > From: Jake Carroll > To: "gpfsug-discuss at spectrumscale.org" > > Subject: Re: [gpfsug-discuss] Tuning AFM for high throughput/high IO > over _really_ long distances (Jan-Frode Myklebust) > Message-ID: <5D327C63-84EC-4F59-86E7-158308E91013 at uq.edu.au> > Content-Type: text/plain; charset="utf-8" > > Hi jf? > > >>> Mostly curious, don't have experience in such environments, but ... Is this > AFM over NFS or NSD protocol? Might be interesting to try the other option > -- and also check how nsdperf performs over such distance/latency. > > As it turns out, it seems, very few people do. > > I will test nsdperf over it and see how it performs. And yes, it is AFM ? AFM. No NFS involved here! > > -jc > > > > ------------------------------ > > Message: 2 > Date: Wed, 9 Nov 2016 17:39:05 +0000 > From: Jake Carroll > To: "gpfsug-discuss at spectrumscale.org" > > Subject: [gpfsug-discuss] Tuning AFM for high throughput/high IO over > _really_ long distances > Message-ID: <83652C3D-0802-4CC2-B636-9FAA31EF5AF0 at uq.edu.au> > Content-Type: text/plain; charset="utf-8" > > Hi. > > I?ve got an GPFS to GPFS AFM cache/home (IW) relationship set up over a really long distance. About 180ms of latency between the two clusters and around 13,000km of optical path. Fortunately for me, I?ve actually got near theoretical maximum IO over the NIC?s between the clusters and I?m iPerf?ing at around 8.90 to 9.2Gbit/sec over a 10GbE circuit. All MTU9000 all the way through. > > Anyway ? I?m finding my AFM traffic to be dragging its feet and I don?t really understand why that might be. I?ve verified the links and transports ability as I said above with iPerf, and CERN?s FDT to near 10Gbit/sec. > > I also verified the clusters on both sides in terms of disk IO and they both seem easily capable in IOZone and IOR tests of multiple GB/sec of throughput. > > So ? my questions: > > > 1. Are there very specific tunings AFM needs for high latency/long distance IO? > > 2. Are there very specific NIC/TCP-stack tunings (beyond the type of thing we already have in place) that benefits AFM over really long distances and high latency? > > 3. We are seeing on the ?cache? side really lazy/sticky ?ls ?als? in the home mount. It sometimes takes 20 to 30 seconds before the command line will report back with a long listing of files. Any ideas why it?d take that long to get a response from ?home?. > > We?ve got our TCP stack setup fairly aggressively, on all hosts that participate in these two clusters. > > ethtool -C enp2s0f0 adaptive-rx off > ifconfig enp2s0f0 txqueuelen 10000 > sysctl -w net.core.rmem_max=536870912 > sysctl -w net.core.wmem_max=536870912 > sysctl -w net.ipv4.tcp_rmem="4096 87380 268435456" > sysctl -w net.ipv4.tcp_wmem="4096 65536 268435456" > sysctl -w net.core.netdev_max_backlog=250000 > sysctl -w net.ipv4.tcp_congestion_control=htcp > sysctl -w net.ipv4.tcp_mtu_probing=1 > > I modified a couple of small things on the AFM ?cache? side to see if it?d make a difference such as: > > mmchconfig afmNumWriteThreads=4 > mmchconfig afmNumReadThreads=4 > > But no difference so far. > > Thoughts would be appreciated. I?ve done this before over much shorter distances (30Km) and I?ve flattened a 10GbE wire without really tuning?anything. Are my large in-flight-packets numbers/long-time-to-acknowledgement semantics going to hurt here? I really thought AFM might be well designed for exactly this kind of work at long distance *and* high throughput ? so I must be missing something! > > -jc > > > > -------------- next part -------------- > An HTML attachment was scrubbed... > URL: < http://gpfsug.org/pipermail/gpfsug-discuss/attachments/20161109/d4f4d9a7/attachment-0001.html > > > ------------------------------ > > Message: 3 > Date: Wed, 09 Nov 2016 18:05:21 +0000 > From: Jan-Frode Myklebust > To: "gpfsug-discuss at spectrumscale.org" > > Subject: Re: [gpfsug-discuss] Tuning AFM for high throughput/high IO > over _really_ long distances > Message-ID: > > Content-Type: text/plain; charset="utf-8" > > Mostly curious, don't have experience in such environments, but ... Is this > AFM over NFS or NSD protocol? Might be interesting to try the other option > -- and also check how nsdperf performs over such distance/latency. > > > > -jf >> ons. 9. nov. 2016 kl. 18.39 skrev Jake Carroll : >> >> Hi. >> >> >> >> I?ve got an GPFS to GPFS AFM cache/home (IW) relationship set up over a >> really long distance. About 180ms of latency between the two clusters and >> around 13,000km of optical path. Fortunately for me, I?ve actually got near >> theoretical maximum IO over the NIC?s between the clusters and I?m >> iPerf?ing at around 8.90 to 9.2Gbit/sec over a 10GbE circuit. All MTU9000 >> all the way through. >> >> >> >> Anyway ? I?m finding my AFM traffic to be dragging its feet and I don?t >> really understand why that might be. I?ve verified the links and transports >> ability as I said above with iPerf, and CERN?s FDT to near 10Gbit/sec. >> >> >> >> I also verified the clusters on both sides in terms of disk IO and they >> both seem easily capable in IOZone and IOR tests of multiple GB/sec of >> throughput. >> >> >> >> So ? my questions: >> >> >> >> 1. Are there very specific tunings AFM needs for high latency/long >> distance IO? >> >> 2. Are there very specific NIC/TCP-stack tunings (beyond the type >> of thing we already have in place) that benefits AFM over really long >> distances and high latency? >> >> 3. We are seeing on the ?cache? side really lazy/sticky ?ls ?als? >> in the home mount. It sometimes takes 20 to 30 seconds before the command >> line will report back with a long listing of files. Any ideas why it?d take >> that long to get a response from ?home?. >> >> >> >> We?ve got our TCP stack setup fairly aggressively, on all hosts that >> participate in these two clusters. >> >> >> >> ethtool -C enp2s0f0 adaptive-rx off >> >> ifconfig enp2s0f0 txqueuelen 10000 >> >> sysctl -w net.core.rmem_max=536870912 >> >> sysctl -w net.core.wmem_max=536870912 >> >> sysctl -w net.ipv4.tcp_rmem="4096 87380 268435456" >> >> sysctl -w net.ipv4.tcp_wmem="4096 65536 268435456" >> >> sysctl -w net.core.netdev_max_backlog=250000 >> >> sysctl -w net.ipv4.tcp_congestion_control=htcp >> >> sysctl -w net.ipv4.tcp_mtu_probing=1 >> >> >> >> I modified a couple of small things on the AFM ?cache? side to see if it?d >> make a difference such as: >> >> >> >> mmchconfig afmNumWriteThreads=4 >> >> mmchconfig afmNumReadThreads=4 >> >> >> >> But no difference so far. >> >> >> >> Thoughts would be appreciated. I?ve done this before over much shorter >> distances (30Km) and I?ve flattened a 10GbE wire without really >> tuning?anything. Are my large in-flight-packets >> numbers/long-time-to-acknowledgement semantics going to hurt here? I really >> thought AFM might be well designed for exactly this kind of work at long >> distance **and** high throughput ? so I must be missing something! >> >> >> >> -jc >> >> >> >> >> >> >> _______________________________________________ >> gpfsug-discuss mailing list >> gpfsug-discuss at spectrumscale.org >> http://gpfsug.org/mailman/listinfo/gpfsug-discuss >> > -------------- next part -------------- > An HTML attachment was scrubbed... > URL: < http://gpfsug.org/pipermail/gpfsug-discuss/attachments/20161109/f44369ab/attachment.html > > > ------------------------------ > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > End of gpfsug-discuss Digest, Vol 58, Issue 12 > ********************************************** > > > > ------------------------------ > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > End of gpfsug-discuss Digest, Vol 58, Issue 13 > ********************************************** _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: not available URL: From tortay at cc.in2p3.fr Thu Nov 10 06:38:35 2016 From: tortay at cc.in2p3.fr (Loic Tortay) Date: Thu, 10 Nov 2016 07:38:35 +0100 Subject: [gpfsug-discuss] Tuning AFM for high throughput/high IO over _really_ long distances In-Reply-To: References: <83652C3D-0802-4CC2-B636-9FAA31EF5AF0@uq.edu.au>

Message-ID: <582415EB.1030208@cc.in2p3.fr> On 09/11/2016 21:53, Olaf Weiser wrote: > let's say you have a RRT of 180 ms > what you then need is your theoretical link speed - let's say 10 Gbit/s ... > easily let's take 1 GB/s > > this means, you socket must be capable to take your bandwidth (data stream) > during the "first" 180ms because it will take at least this time to get back the > first ACKs .. . > so 1 GB / s x 0,180 s = 1024 MB/s x 0,180 s ==>> 185 MB this means, you have > to allow the operating system to accept socketsizes in that range... > > set something like this - but increase these values to 185 MB > sysctl -w net.ipv4.tcp_rmem="12194304 12194304 12194304" > sysctl -w net.ipv4.tcp_wmem="12194304 12194304 12194304" > sysctl -w net.ipv4.tcp_mem="12194304 12194304 12194304" > sysctl -w net.core.rmem_max=12194304 > sysctl -w net.core.wmem_max=12194304 > sysctl -w net.core.rmem_default=12194304 > sysctl -w net.core.wmem_default=12194304 > sysctl -w net.core.optmem_max=12194304 > Hello, In my opinion, some of these changes are, at best, misguided. For instance, the unit for "tcp_mem" is not bytes but pages. It's also not a parameter for buffers but a parameter influencing global kernel memory management for TCP (source: Linux kernel documentation/source). Or setting the maximum TCP ancillary data buffer size ("optmem_max") to a very large value when, as far a I know/saw when testing AFM w/ NFS, there is no ancillary data used. Setting the min, default and max to the same value for the buffers is also, in my opinion, highly debatable (do you really want, for instance, each and every SSH connection to have 185 MB TCP buffers? -- 185 MB being the value suggested above). I have seen the same suggestions in the AFM documentation, and in my opinion, along with the unhelpful "nfsPrefetchStrategy" recommandation ("it's critical: set it to at least 5 to 10", OK but how do I chose between 5 to 10 or should I use 42?, what's the unit?, what are the criteria?), these do not contribute to give a good understanding of the configuration (let alone "optimization") required for AFM over NFS. I must add that, in my opinion, I have "enough" experience with setting these "sysctl" parameters of NFS "tuning" (so I'm not overwhelmed by the complexity or whatever), to think something is really not right in that part of the AFM documentation. Lo?c. -- | Lo?c Tortay - IN2P3 Computing Centre | -------------- next part -------------- A non-text attachment was scrubbed... Name: smime.p7s Type: application/pkcs7-signature Size: 2931 bytes Desc: S/MIME Cryptographic Signature URL: From olaf.weiser at de.ibm.com Thu Nov 10 08:01:56 2016 From: olaf.weiser at de.ibm.com (Olaf Weiser) Date: Thu, 10 Nov 2016 09:01:56 +0100 Subject: [gpfsug-discuss] Tuning AFM for high throughput/high IO over _really_ long distances In-Reply-To: <582415EB.1030208@cc.in2p3.fr> References: <83652C3D-0802-4CC2-B636-9FAA31EF5AF0@uq.edu.au>

<582415EB.1030208@cc.in2p3.fr> Message-ID: An HTML attachment was scrubbed... URL: From luke.raimbach at googlemail.com Thu Nov 10 09:52:10 2016 From: luke.raimbach at googlemail.com (Luke Raimbach) Date: Thu, 10 Nov 2016 09:52:10 +0000 Subject: [gpfsug-discuss] AFM Licensing Message-ID: HI All, I have a tantalisingly interesting question about licensing... When installing a couple of AFM gateway nodes into a cluster for data migration, where the AFM filesets will only ever be local-updates, those nodes should just require a client license, right? No GPFS data will leave through those nodes, so I can't see any valid argument for them being server licensed. Anyone want to disagree? Cheers, Luke. -------------- next part -------------- An HTML attachment was scrubbed... URL: From abeattie at au1.ibm.com Thu Nov 10 12:07:25 2016 From: abeattie at au1.ibm.com (Andrew Beattie) Date: Thu, 10 Nov 2016 12:07:25 +0000 Subject: [gpfsug-discuss] AFM Licensing In-Reply-To: References: Message-ID: An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image.14787785499920.png Type: image/png Size: 30777 bytes Desc: not available URL: From ulmer at ulmer.org Thu Nov 10 13:10:06 2016 From: ulmer at ulmer.org (Stephen Ulmer) Date: Thu, 10 Nov 2016 08:10:06 -0500 Subject: [gpfsug-discuss] AFM Licensing In-Reply-To: References:

Message-ID: <6F30DFAF-1BD5-48D0-855E-FB9A3187AD0A@ulmer.org> The table you included was about Editions, not License types. -- Stephen > On Nov 10, 2016, at 7:07 AM, Andrew Beattie wrote: > > I think you will find that AFM in any flavor is a function of the Server license, not a client license. > > i've always found this to be a pretty good guide, although you now need to add Transparent Cloud Tiering into the bottom column > > > > > > Andrew Beattie > Software Defined Storage - IT Specialist > Phone: 614-2133-7927 > E-mail: abeattie at au1.ibm.com > > > ----- Original message ----- > From: Luke Raimbach > Sent by: gpfsug-discuss-bounces at spectrumscale.org > To: gpfsug main discussion list > Cc: > Subject: [gpfsug-discuss] AFM Licensing > Date: Thu, Nov 10, 2016 8:22 PM > > HI All, > > I have a tantalisingly interesting question about licensing... > > When installing a couple of AFM gateway nodes into a cluster for data migration, where the AFM filesets will only ever be local-updates, those nodes should just require a client license, right? No GPFS data will leave through those nodes, so I can't see any valid argument for them being server licensed. > > Anyone want to disagree? > > Cheers, > Luke. > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From luke.raimbach at googlemail.com Thu Nov 10 14:11:57 2016 From: luke.raimbach at googlemail.com (Luke Raimbach) Date: Thu, 10 Nov 2016 14:11:57 +0000 Subject: [gpfsug-discuss] AFM Licensing In-Reply-To: References:

Message-ID: Thanks for the feature matrix, but it doesn't really say anything about client / server licenses. Surely you can have clients and servers in all three flavours - Express, Standard and Advanced. On Thu, 10 Nov 2016 at 12:07 Andrew Beattie wrote: > I think you will find that AFM in any flavor is a function of the Server > license, not a client license. > > i've always found this to be a pretty good guide, although you now need to > add Transparent Cloud Tiering into the bottom column > > > > > Andrew Beattie > Software Defined Storage - IT Specialist > Phone: 614-2133-7927 > E-mail: abeattie at au1.ibm.com > > > > ----- Original message ----- > From: Luke Raimbach > Sent by: gpfsug-discuss-bounces at spectrumscale.org > To: gpfsug main discussion list > Cc: > Subject: [gpfsug-discuss] AFM Licensing > Date: Thu, Nov 10, 2016 8:22 PM > > HI All, > > I have a tantalisingly interesting question about licensing... > > When installing a couple of AFM gateway nodes into a cluster for data > migration, where the AFM filesets will only ever be local-updates, those > nodes should just require a client license, right? No GPFS data will leave > through those nodes, so I can't see any valid argument for them being > server licensed. > > Anyone want to disagree? > > Cheers, > Luke. > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image.14787785499920.png Type: image/png Size: 30777 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image.14787785499920.png Type: image/png Size: 30777 bytes Desc: not available URL: From kevindjo at us.ibm.com Thu Nov 10 14:20:23 2016 From: kevindjo at us.ibm.com (Kevin D Johnson) Date: Thu, 10 Nov 2016 14:20:23 +0000 Subject: [gpfsug-discuss] AFM Licensing In-Reply-To: References: ,

Message-ID: An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image.14787856423282.png Type: image/png Size: 30777 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image.14787856423283.png Type: image/png Size: 30777 bytes Desc: not available URL: From luke.raimbach at googlemail.com Thu Nov 10 14:37:02 2016 From: luke.raimbach at googlemail.com (Luke Raimbach) Date: Thu, 10 Nov 2016 14:37:02 +0000 Subject: [gpfsug-discuss] AFM Licensing In-Reply-To: References:

Message-ID: Hi Kevin, Thanks for the response, but that page is still not helpful. We will not be exporting any data from the GPFS cluster through the AFM gateways. Data will be coming from external NFS data sources, through the gateway nodes INTO the GPFS file systems. Reading that licensing page suggests a client license is acceptable in this situation. There is no mention of AFM explicitly as a function of the server license. Cheers, Luke. On Thu, 10 Nov 2016 at 14:20 Kevin D Johnson wrote: > An AFM gateway node would definitely be a server licensed node. Here are > the working definitions, and yes, this would be true for the various > editions of IBM Spectrum Scale: > > > http://www.ibm.com/support/knowledgecenter/STXKQY_4.2.0/com.ibm.spectrum.scale.v4r2.ins.doc/bl1ins_gpfslicensedesignation.htm > > Kevin D. Johnson, MBA, MAFM > Spectrum Computing, Senior Managing Consultant > > IBM Certified Deployment Professional - Spectrum Scale V4.1.1 > IBM Certified Deployment Professional - Cloud Object Storage V3.8 > IBM Certified Solution Advisor - Spectrum Computing V1 > > 720.349.6199 - kevindjo at us.ibm.com > > > > > ----- Original message ----- > From: Luke Raimbach > Sent by: gpfsug-discuss-bounces at spectrumscale.org > To: gpfsug main discussion list > Cc: > > Subject: Re: [gpfsug-discuss] AFM Licensing > Date: Thu, Nov 10, 2016 9:12 AM > > Thanks for the feature matrix, but it doesn't really say anything about > client / server licenses. Surely you can have clients and servers in all > three flavours - Express, Standard and Advanced. > > On Thu, 10 Nov 2016 at 12:07 Andrew Beattie wrote: > > I think you will find that AFM in any flavor is a function of the Server > license, not a client license. > > i've always found this to be a pretty good guide, although you now need to > add Transparent Cloud Tiering into the bottom column > > > > > Andrew Beattie > Software Defined Storage - IT Specialist > Phone: 614-2133-7927 > E-mail: abeattie at au1.ibm.com > > > > > > ----- Original message ----- > From: Luke Raimbach > Sent by: gpfsug-discuss-bounces at spectrumscale.org > To: gpfsug main discussion list > Cc: > Subject: [gpfsug-discuss] AFM Licensing > Date: Thu, Nov 10, 2016 8:22 PM > > HI All, > > I have a tantalisingly interesting question about licensing... > > When installing a couple of AFM gateway nodes into a cluster for data > migration, where the AFM filesets will only ever be local-updates, those > nodes should just require a client license, right? No GPFS data will leave > through those nodes, so I can't see any valid argument for them being > server licensed. > > Anyone want to disagree? > > Cheers, > Luke. > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > [image: Image.14787785499920.png][image: Image.14787785499920.png] > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image.14787856423282.png Type: image/png Size: 30777 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image.14787856423283.png Type: image/png Size: 30777 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image.14787856423283.png Type: image/png Size: 30777 bytes Desc: not available URL: From gcorneau at us.ibm.com Thu Nov 10 15:02:55 2016 From: gcorneau at us.ibm.com (Glen Corneau) Date: Thu, 10 Nov 2016 09:02:55 -0600 Subject: [gpfsug-discuss] AFM Licensing In-Reply-To: References:

Message-ID: The FAQ item does list "sharing data via NFS" as a Server license function (which is what the gateway node does): The IBM Spectrum Scale Server license permits the licensed virtual server to perform IBM Spectrum Scale management functions such as cluster configuration manager, quorum node, manager node, and Network Shared Disk (NSD) server. In addition, the IBM Spectrum Scale Server license permits the licensed virtual server to share IBM Spectrum Scale data directly through any application, service protocol or method such as Network File System (NFS), Common Internet File System (CIFS), File Transfer Protocol (FTP), Hypertext Transfer Protocol (HTTP), or OpenStack Swift. http://www.ibm.com/support/knowledgecenter/en/SSFKCN/com.ibm.cluster.gpfs.doc/gpfs_faqs/gpfsclustersfaq.html?view=kc#lic41 ------------------ Glen Corneau Washington Systems Center - Power Systems gcorneau at us.ibm.com From: Luke Raimbach To: gpfsug main discussion list Date: 11/10/2016 08:37 AM Subject: Re: [gpfsug-discuss] AFM Licensing Sent by: gpfsug-discuss-bounces at spectrumscale.org Hi Kevin, Thanks for the response, but that page is still not helpful. We will not be exporting any data from the GPFS cluster through the AFM gateways. Data will be coming from external NFS data sources, through the gateway nodes INTO the GPFS file systems. Reading that licensing page suggests a client license is acceptable in this situation. There is no mention of AFM explicitly as a function of the server license. Cheers, Luke. On Thu, 10 Nov 2016 at 14:20 Kevin D Johnson wrote: An AFM gateway node would definitely be a server licensed node. Here are the working definitions, and yes, this would be true for the various editions of IBM Spectrum Scale: http://www.ibm.com/support/knowledgecenter/STXKQY_4.2.0/com.ibm.spectrum.scale.v4r2.ins.doc/bl1ins_gpfslicensedesignation.htm Kevin D. Johnson, MBA, MAFM Spectrum Computing, Senior Managing Consultant IBM Certified Deployment Professional - Spectrum Scale V4.1.1 IBM Certified Deployment Professional - Cloud Object Storage V3.8 IBM Certified Solution Advisor - Spectrum Computing V1 720.349.6199 - kevindjo at us.ibm.com ----- Original message ----- From: Luke Raimbach Sent by: gpfsug-discuss-bounces at spectrumscale.org To: gpfsug main discussion list Cc: Subject: Re: [gpfsug-discuss] AFM Licensing Date: Thu, Nov 10, 2016 9:12 AM Thanks for the feature matrix, but it doesn't really say anything about client / server licenses. Surely you can have clients and servers in all three flavours - Express, Standard and Advanced. On Thu, 10 Nov 2016 at 12:07 Andrew Beattie wrote: I think you will find that AFM in any flavor is a function of the Server license, not a client license. i've always found this to be a pretty good guide, although you now need to add Transparent Cloud Tiering into the bottom column Andrew Beattie Software Defined Storage - IT Specialist Phone: 614-2133-7927 E-mail: abeattie at au1.ibm.com ----- Original message ----- From: Luke Raimbach Sent by: gpfsug-discuss-bounces at spectrumscale.org To: gpfsug main discussion list Cc: Subject: [gpfsug-discuss] AFM Licensing Date: Thu, Nov 10, 2016 8:22 PM HI All, I have a tantalisingly interesting question about licensing... When installing a couple of AFM gateway nodes into a cluster for data migration, where the AFM filesets will only ever be local-updates, those nodes should just require a client license, right? No GPFS data will leave through those nodes, so I can't see any valid argument for them being server licensed. Anyone want to disagree? Cheers, Luke. _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss[attachment "Image.14787856423282.png" deleted by Glen Corneau/Austin/IBM] [attachment "Image.14787856423283.png" deleted by Glen Corneau/Austin/IBM] [attachment "Image.14787856423283.png" deleted by Glen Corneau/Austin/IBM] _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 26117 bytes Desc: not available URL: From kevindjo at us.ibm.com Thu Nov 10 15:11:34 2016 From: kevindjo at us.ibm.com (Kevin D Johnson) Date: Thu, 10 Nov 2016 15:11:34 +0000 Subject: [gpfsug-discuss] AFM Licensing In-Reply-To: References: ,

Message-ID: An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image.147878564232819.png Type: image/png Size: 30777 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image.147878564232820.png Type: image/png Size: 30777 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image.147878564232821.png Type: image/png Size: 30777 bytes Desc: not available URL: From luke.raimbach at googlemail.com Thu Nov 10 15:17:13 2016 From: luke.raimbach at googlemail.com (Luke Raimbach) Date: Thu, 10 Nov 2016 15:17:13 +0000 Subject: [gpfsug-discuss] AFM Licensing In-Reply-To: References:

Message-ID: The gateway nodes will be mounting an external NFS server as a *client*. There will be NO NFS exports from these two AFM nodes. AFM Local Update filesets will cache the remote NFS exported file systems (pretend they are ReiserFS not GPFS to make things easier). On Thu, 10 Nov 2016 at 15:07 Glen Corneau wrote: > The FAQ item does list "sharing data via NFS" as a Server license function > (which is what the gateway node does): > > The IBM Spectrum Scale Server license permits the licensed virtual server > to perform IBM Spectrum Scale management functions such as cluster > configuration manager, quorum node, manager node, and Network Shared Disk > (NSD) server. In addition, the IBM Spectrum Scale Server license permits > the licensed virtual server to *share IBM Spectrum Scale data*directly > through any application, service protocol or method s*uch as Network File > System (NFS)*, Common Internet File System (CIFS), File Transfer Protocol > (FTP), Hypertext Transfer Protocol (HTTP), or OpenStack Swift. > > > http://www.ibm.com/support/knowledgecenter/en/SSFKCN/com.ibm.cluster.gpfs.doc/gpfs_faqs/gpfsclustersfaq.html?view=kc#lic41 > > ------------------ > Glen Corneau > Washington Systems Center - Power Systems > gcorneau at us.ibm.com > > > > > > From: Luke Raimbach > To: gpfsug main discussion list > Date: 11/10/2016 08:37 AM > Subject: Re: [gpfsug-discuss] AFM Licensing > Sent by: gpfsug-discuss-bounces at spectrumscale.org > ------------------------------ > > > > Hi Kevin, > > Thanks for the response, but that page is still not helpful. > > We will not be exporting any data from the GPFS cluster through the AFM > gateways. Data will be coming from external NFS data sources, through the > gateway nodes INTO the GPFS file systems. > > Reading that licensing page suggests a client license is acceptable in > this situation. There is no mention of AFM explicitly as a function of the > server license. > > Cheers, > Luke. > > On Thu, 10 Nov 2016 at 14:20 Kevin D Johnson <*kevindjo at us.ibm.com* > > wrote: > An AFM gateway node would definitely be a server licensed node. Here are > the working definitions, and yes, this would be true for the various > editions of IBM Spectrum Scale: > > > *http://www.ibm.com/support/knowledgecenter/STXKQY_4.2.0/com.ibm.spectrum.scale.v4r2.ins.doc/bl1ins_gpfslicensedesignation.htm* > > > *Kevin D. Johnson, MBA, MAFM* > > *Spectrum Computing, Senior Managing Consultant* > > *IBM Certified Deployment Professional - Spectrum Scale V4.1.1IBM > Certified Deployment Professional - Cloud Object Storage V3.8* > *IBM Certified Solution Advisor - Spectrum Computing V1* > > *720.349.6199 - **kevindjo at us.ibm.com* > > > > ----- Original message ----- > From: Luke Raimbach <*luke.raimbach at googlemail.com* > > > Sent by: *gpfsug-discuss-bounces at spectrumscale.org* > > To: gpfsug main discussion list <*gpfsug-discuss at spectrumscale.org* > > > Cc: > Subject: Re: [gpfsug-discuss] AFM Licensing > Date: Thu, Nov 10, 2016 9:12 AM > > Thanks for the feature matrix, but it doesn't really say anything about > client / server licenses. Surely you can have clients and servers in all > three flavours - Express, Standard and Advanced. > > On Thu, 10 Nov 2016 at 12:07 Andrew Beattie <*abeattie at au1.ibm.com* > > wrote: > I think you will find that AFM in any flavor is a function of the Server > license, not a client license. > > i've always found this to be a pretty good guide, although you now need to > add Transparent Cloud Tiering into the bottom column > > > > > > *Andrew Beattie* > *Software Defined Storage - IT Specialist* > *Phone: *614-2133-7927 > *E-mail: **abeattie at au1.ibm.com* > > > > ----- Original message ----- > From: Luke Raimbach <*luke.raimbach at googlemail.com* > > > Sent by: *gpfsug-discuss-bounces at spectrumscale.org* > > To: gpfsug main discussion list <*gpfsug-discuss at spectrumscale.org* > > > Cc: > Subject: [gpfsug-discuss] AFM Licensing > Date: Thu, Nov 10, 2016 8:22 PM > > HI All, > > I have a tantalisingly interesting question about licensing... > > When installing a couple of AFM gateway nodes into a cluster for data > migration, where the AFM filesets will only ever be local-updates, those > nodes should just require a client license, right? No GPFS data will leave > through those nodes, so I can't see any valid argument for them being > server licensed. > > Anyone want to disagree? > > Cheers, > Luke. > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at *spectrumscale.org* > *http://gpfsug.org/mailman/listinfo/gpfsug-discuss* > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at *spectrumscale.org* > *http://gpfsug.org/mailman/listinfo/gpfsug-discuss* > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at *spectrumscale.org* > *http://gpfsug.org/mailman/listinfo/gpfsug-discuss* > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at *spectrumscale.org* > *http://gpfsug.org/mailman/listinfo/gpfsug-discuss* > *[attachment > "Image.14787856423282.png" deleted by Glen Corneau/Austin/IBM] [attachment > "Image.14787856423283.png" deleted by Glen Corneau/Austin/IBM] [attachment > "Image.14787856423283.png" deleted by Glen Corneau/Austin/IBM] * > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 26117 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 26117 bytes Desc: not available URL: From luke.raimbach at googlemail.com Thu Nov 10 15:55:33 2016 From: luke.raimbach at googlemail.com (Luke Raimbach) Date: Thu, 10 Nov 2016 15:55:33 +0000 Subject: [gpfsug-discuss] AFM Licensing In-Reply-To: References:

Message-ID: Thanks! That's what I was looking for. Cheers, Luke. On Thu, 10 Nov 2016 at 15:17 Luke Raimbach wrote: > The gateway nodes will be mounting an external NFS server as a *client*. > There will be NO NFS exports from these two AFM nodes. > > AFM Local Update filesets will cache the remote NFS exported file systems > (pretend they are ReiserFS not GPFS to make things easier). > > > > On Thu, 10 Nov 2016 at 15:07 Glen Corneau wrote: > > The FAQ item does list "sharing data via NFS" as a Server license function > (which is what the gateway node does): > > The IBM Spectrum Scale Server license permits the licensed virtual server > to perform IBM Spectrum Scale management functions such as cluster > configuration manager, quorum node, manager node, and Network Shared Disk > (NSD) server. In addition, the IBM Spectrum Scale Server license permits > the licensed virtual server to *share IBM Spectrum Scale data*directly > through any application, service protocol or method s*uch as Network File > System (NFS)*, Common Internet File System (CIFS), File Transfer Protocol > (FTP), Hypertext Transfer Protocol (HTTP), or OpenStack Swift. > > > http://www.ibm.com/support/knowledgecenter/en/SSFKCN/com.ibm.cluster.gpfs.doc/gpfs_faqs/gpfsclustersfaq.html?view=kc#lic41 > > ------------------ > Glen Corneau > Washington Systems Center - Power Systems > gcorneau at us.ibm.com > > > > > > From: Luke Raimbach > To: gpfsug main discussion list > Date: 11/10/2016 08:37 AM > Subject: Re: [gpfsug-discuss] AFM Licensing > Sent by: gpfsug-discuss-bounces at spectrumscale.org > ------------------------------ > > > > Hi Kevin, > > Thanks for the response, but that page is still not helpful. > > We will not be exporting any data from the GPFS cluster through the AFM > gateways. Data will be coming from external NFS data sources, through the > gateway nodes INTO the GPFS file systems. > > Reading that licensing page suggests a client license is acceptable in > this situation. There is no mention of AFM explicitly as a function of the > server license. > > Cheers, > Luke. > > On Thu, 10 Nov 2016 at 14:20 Kevin D Johnson <*kevindjo at us.ibm.com* > > wrote: > An AFM gateway node would definitely be a server licensed node. Here are > the working definitions, and yes, this would be true for the various > editions of IBM Spectrum Scale: > > > *http://www.ibm.com/support/knowledgecenter/STXKQY_4.2.0/com.ibm.spectrum.scale.v4r2.ins.doc/bl1ins_gpfslicensedesignation.htm* > > > *Kevin D. Johnson, MBA, MAFM* > > *Spectrum Computing, Senior Managing Consultant* > > *IBM Certified Deployment Professional - Spectrum Scale V4.1.1IBM > Certified Deployment Professional - Cloud Object Storage V3.8* > *IBM Certified Solution Advisor - Spectrum Computing V1* > > *720.349.6199 - **kevindjo at us.ibm.com* > > > > ----- Original message ----- > From: Luke Raimbach <*luke.raimbach at googlemail.com* > > > Sent by: *gpfsug-discuss-bounces at spectrumscale.org* > > To: gpfsug main discussion list <*gpfsug-discuss at spectrumscale.org* > > > Cc: > Subject: Re: [gpfsug-discuss] AFM Licensing > Date: Thu, Nov 10, 2016 9:12 AM > > Thanks for the feature matrix, but it doesn't really say anything about > client / server licenses. Surely you can have clients and servers in all > three flavours - Express, Standard and Advanced. > > On Thu, 10 Nov 2016 at 12:07 Andrew Beattie <*abeattie at au1.ibm.com* > > wrote: > I think you will find that AFM in any flavor is a function of the Server > license, not a client license. > > i've always found this to be a pretty good guide, although you now need to > add Transparent Cloud Tiering into the bottom column > > > > > > *Andrew Beattie* > *Software Defined Storage - IT Specialist* > *Phone: *614-2133-7927 > *E-mail: **abeattie at au1.ibm.com* > > > > ----- Original message ----- > From: Luke Raimbach <*luke.raimbach at googlemail.com* > > > Sent by: *gpfsug-discuss-bounces at spectrumscale.org* > > To: gpfsug main discussion list <*gpfsug-discuss at spectrumscale.org* > > > Cc: > Subject: [gpfsug-discuss] AFM Licensing > Date: Thu, Nov 10, 2016 8:22 PM > > HI All, > > I have a tantalisingly interesting question about licensing... > > When installing a couple of AFM gateway nodes into a cluster for data > migration, where the AFM filesets will only ever be local-updates, those > nodes should just require a client license, right? No GPFS data will leave > through those nodes, so I can't see any valid argument for them being > server licensed. > > Anyone want to disagree? > > Cheers, > Luke. > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at *spectrumscale.org* > *http://gpfsug.org/mailman/listinfo/gpfsug-discuss* > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at *spectrumscale.org* > *http://gpfsug.org/mailman/listinfo/gpfsug-discuss* > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at *spectrumscale.org* > *http://gpfsug.org/mailman/listinfo/gpfsug-discuss* > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at *spectrumscale.org* > *http://gpfsug.org/mailman/listinfo/gpfsug-discuss* > *[attachment > "Image.14787856423282.png" deleted by Glen Corneau/Austin/IBM] [attachment > "Image.14787856423283.png" deleted by Glen Corneau/Austin/IBM] [attachment > "Image.14787856423283.png" deleted by Glen Corneau/Austin/IBM] * > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From Robert.Oesterlin at nuance.com Thu Nov 10 19:14:57 2016 From: Robert.Oesterlin at nuance.com (Oesterlin, Robert) Date: Thu, 10 Nov 2016 19:14:57 +0000 Subject: [gpfsug-discuss] Local Read-Only cache: Undocumented Config options Message-ID: Can anyone tell me what the highlighted options are? Some of them look "interesting". lrocChecksum 0 lrocData 1 lrocDataMaxBufferSize 0 ! lrocDataMaxFileSize -1 ! lrocDataStubFileSize -1 lrocDeviceMaxSectorsKB 64 lrocDeviceNrRequests 1024 lrocDeviceQueueDepth 31 lrocDevices 0A1E183A5824A9ED#/dev/sdb; lrocDeviceScheduler deadline lrocDeviceSetParams 1 lrocDirectories 1 lrocInodes 1 Bob Oesterlin Sr Principal Storage Engineer, Nuance 507-269-0413 -------------- next part -------------- An HTML attachment was scrubbed... URL: From r.sobey at imperial.ac.uk Fri Nov 11 08:50:00 2016 From: r.sobey at imperial.ac.uk (Sobey, Richard A) Date: Fri, 11 Nov 2016 08:50:00 +0000 Subject: [gpfsug-discuss] How to clear stale entries in GUI log In-Reply-To: References: ,

Message-ID: That?s worked, thanks Andreas. Question: when I upgrade to the new PTF when it?s available, can I install it first on just the GUI node (which happens to be the Quorum server for the cluster) and the fixes will go in, or do I need to deploy the new pmsensors packages? From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Andreas Koeninger Sent: 08 November 2016 16:50 To: gpfsug-discuss at spectrumscale.org Subject: Re: [gpfsug-discuss] How to clear stale entries in GUI log Hello Richard, without the PTF (which is not yet available) you will have to manually clear the GUI database as well by running the following command on the GUI node: psql postgres postgres -c "delete from fscc.gss_state where sensor like 'H\_%';" This will remove all events from the GUI database coming from mmhealth. To repopulate this table with all the currently reported events from mmhealth please run: /usr/lpp/mmfs/gui/cli/runtask HEALTH_STATES Let me know if that helps, Andreas Koeninger Spectrum Scale GUI Development ----- Original message ----- From: "Sobey, Richard A" Sent by: gpfsug-discuss-bounces at spectrumscale.org To: gpfsug main discussion list Cc: Subject: Re: [gpfsug-discuss] How to clear stale entries in GUI log Date: Tue, Nov 8, 2016 4:10 PM Thanks. I?ve run that on, I assume, our quorum server where this disk is mounted, but the error is still showing up. The event itself doesn?t say which node is affected. ICSAN_GPFS_FSD_QUORUM nsd 512 103 no no ready up system That looks ok to me. Maybe I misunderstood your line ?This is a per node database, so you need to run this on all the nodes which have stale entries.?. Should I just run it on all the nodes in the cluster instead? there?s not many so won?t take long but wondering if that?s really necessary? Thanks Richard From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Markus Rohwedder Sent: 08 November 2016 14:51 To: gpfsug-discuss at spectrumscale.org Subject: [gpfsug-discuss] How to clear stale entries in GUI log Hello, you ran into a defect which is fixed with the upcoming 4.2.1.2 PTF Here is a workaround: You can clear the eventlog of the system health component using mmsysmonc clearDB This is a per node database, so you need to run this on all the nodes which have stale entries. It will clear all the events on this node, if you want to save them run: mmhealth node eventlog > log.save On the GUI node, run systemctl restart gpfsgui afterwards. The mmhealth command suppresses events during startup. So in case a bad condition turns OK during a restart phase, the bad event will remain stale. Regards, Markus Rohwedder IBM Spectrum Scale GUI development _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From andy_parker1 at uk.ibm.com Fri Nov 11 16:20:52 2016 From: andy_parker1 at uk.ibm.com (Andy Parker1) Date: Fri, 11 Nov 2016 16:20:52 +0000 Subject: [gpfsug-discuss] SS 4.2.1 + CES NFS / SMB Message-ID: We have setup a small cluster to test, play & learn about the protocol servers. We have setup mmuserauth for AD + RFC2307 and we can share and access data via SMB and access is on windows clients with no issues. The file DAC of a file created via windows looks like this from the SS cesNode: $ ls -l total 0 -rwxr--r-- 1 SPECTRUMSCALE\newmanjo SPECTRUMSCALE\ces-admins 33 Nov 10 17:29 helloworld.txt The NFS protocol is also exported for NFS 3,4 and when mount using NFS version '3' from an AIX 7.1 server I see also OK DAC names uid / group, so the UID mapping is working. The AIX is linked to the AD for LDAP account services and I can query accounts and get shell logon for accounts defined within AD for unix services. # ls -l ( from AIX client NFS V3) total 0 -rwxr--r-- 1 newmanjo ces-admi 33 10 Nov 17:29 helloworld.txt Now the Problem: When I mount the AIX client as NFS4 I do no see the user/group names. I know NFS4 passes names and not UID/GID numbers so I guess this is linked. # pwd /mnt/ibm/hurss/share1 # ls -l ( from AIX client NFS V4) total 0 -rwxr--r-- 1 nobody nobody 33 10 Nov 17:29 helloworld.txt On the AIX server I have set NFS domain to virtual1.com # chnfsdom Current local domain: virtual1.com This matches the DOMAIN from the mmnfs config list domain ( not 100% sure this is correct) [root at hurss4 ~]# mmnfs config list NFS Ganesha Configuration: ========================== NFS_PROTOCOLS: 3,4 NFS_PORT: 2049 MNT_PORT: 0 NLM_PORT: 0 RQUOTA_PORT: 0 SHORT_FILE_HANDLE: FALSE LEASE_LIFETIME: 60 DOMAINNAME: VIRTUAL1.COM DELEGATIONS: Disabled Also the 'nfsrgyd' a name translation service for NFS servers and clients is running. lssrc -s nfsrgyd Subsystem Group PID Status nfsrgyd nfs 8585412 active Summary / Question: Can anybody explain why I do not see userID / Group names when viewing via a NFS4 client and ideally how to fix this. Rgds Andy P Unless stated otherwise above: IBM United Kingdom Limited - Registered in England and Wales with number 741598. Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6 3AU -------------- next part -------------- An HTML attachment was scrubbed... URL: From Valdis.Kletnieks at vt.edu Sat Nov 12 20:23:42 2016 From: Valdis.Kletnieks at vt.edu (Valdis.Kletnieks at vt.edu) Date: Sat, 12 Nov 2016 15:23:42 -0500 Subject: [gpfsug-discuss] How to clear stale entries in GUI log In-Reply-To: References: ,

Message-ID: <157403.1478982222@turing-police.cc.vt.edu> On Fri, 11 Nov 2016 08:50:00 +0000, "Sobey, Richard A" said: > Question: when I upgrade to the new PTF when it???s available, can I install it > first on just the GUI node (which happens to be the Quorum server for the > cluster) *the* quorum server, not "one of the quorum nodes"? Best practice is to have enough nodes designated as quorum nodes so even if one of them is taken down for upgrade or maintenance, the cluster as a whole remains up and serving data. That way, you can do rolling installs of patches without taking an outage. The number to pick depends on your config - we have one cluster with 4 NSD servers, where we've defined all 4 as quorum nodes. That way, as long as 3 of them (half plus 1) are up, the cluster stays up. We have another stretch cluster with 10 servers (5 at each node), and we defined 3 quorum nodes at our main site, and 2 at the remote site, specifically so that if we did lose the 10G link between sites, the main site would retain quorum and stay up. (Losing the remote site is, in our setup, *much* less critical than ensuring the main site stays up. We replicate between the two, and if the remote is down, and thus falls behind, mmrestripefs is available for cleaning up) -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 484 bytes Desc: not available URL: From r.sobey at imperial.ac.uk Sat Nov 12 20:39:07 2016 From: r.sobey at imperial.ac.uk (Sobey, Richard A) Date: Sat, 12 Nov 2016 20:39:07 +0000 Subject: [gpfsug-discuss] How to clear stale entries in GUI log In-Reply-To: <157403.1478982222@turing-police.cc.vt.edu> References: ,

<157403.1478982222@turing-police.cc.vt.edu> Message-ID: Sorry... one of the quorum nodes. -----Original Message----- From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Valdis.Kletnieks at vt.edu Sent: 12 November 2016 20:24 To: gpfsug main discussion list Subject: Re: [gpfsug-discuss] How to clear stale entries in GUI log On Fri, 11 Nov 2016 08:50:00 +0000, "Sobey, Richard A" said: > Question: when I upgrade to the new PTF when it???s available, can I > install it first on just the GUI node (which happens to be the Quorum > server for the > cluster) *the* quorum server, not "one of the quorum nodes"? Best practice is to have enough nodes designated as quorum nodes so even if one of them is taken down for upgrade or maintenance, the cluster as a whole remains up and serving data. That way, you can do rolling installs of patches without taking an outage. The number to pick depends on your config - we have one cluster with 4 NSD servers, where we've defined all 4 as quorum nodes. That way, as long as 3 of them (half plus 1) are up, the cluster stays up. We have another stretch cluster with 10 servers (5 at each node), and we defined 3 quorum nodes at our main site, and 2 at the remote site, specifically so that if we did lose the 10G link between sites, the main site would retain quorum and stay up. (Losing the remote site is, in our setup, *much* less critical than ensuring the main site stays up. We replicate between the two, and if the remote is down, and thus falls behind, mmrestripefs is available for cleaning up) From laurence at qsplace.co.uk Sat Nov 12 20:53:37 2016 From: laurence at qsplace.co.uk (Laurence Horrocks-Barlow) Date: Sat, 12 Nov 2016 20:53:37 +0000 Subject: [gpfsug-discuss] How to clear stale entries in GUI log In-Reply-To: References:

<157403.1478982222@turing-police.cc.vt.edu>