From oehmes at gmail.com Fri Mar 1 01:33:58 2019 From: oehmes at gmail.com (Sven Oehme) Date: Thu, 28 Feb 2019 17:33:58 -0800 Subject: [gpfsug-discuss] Clarification of mmdiag --iohist output In-Reply-To: References:

<9338621C-3F85-48DF-AE42-64998680E14C@vanderbilt.edu> Message-ID: Hi, using nsdSmallThreadRatio 1 is not necessarily correct, as it 'significant depends' (most used word combination of performance engineers) on your workload. to give some more background - on reads you need much more threads for small i/os than for large i/os to get maximum performance, the reason is a small i/o usually only reads one strip of data (sitting on one physical device) while a large i/o reads an entire stripe (which typically spans multiple devices). as a more concrete example, in a 8+2p raid setup a single full stripe read will trigger internal reads in parallel to 8 different targets at the same time, so for small i/os you would need 8 times as many small read requests (and therefore threads) to keep the drives busy at the same level. on writes its even more complex, a large full stripe write usually just writes to all target disks, while a tiny small write in the middle might force a read / modify / write which can have a huge write amplification and cause more work than a large full track i/o. raid controller caches also play a significant role here and make this especially hard to optimize as you need to know exactly what and where to measure when you tune to get improvements for real world workload and not just improve your synthetic test but actually hurt your real application performance. i should write a book about this some day ;-) hope that helps. Sven On Thu, Feb 21, 2019 at 4:23 AM Frederick Stock wrote: > Kevin I'm assuming you have seen the article on IBM developerWorks about > the GPFS NSD queues. It provides useful background for analyzing the dump > nsd information. Here I'll list some thoughts for items that you can > investigate/consider. > > If your NSD servers are doing both large (greater than 64K) and small (64K > or less) IOs then you want to have the nsdSmallThreadRatio set to 1 as it > seems you do for the NSD servers. This provides an equal number of SMALL > and LARGE NSD queues. You can also increase the total number of queues > (currently 256) but I cannot determine if that is necessary from the data > you provided. Only on rare occasions have I seen a need to increase the > number of queues. > > The fact that you have 71 highest pending on your LARGE queues and 73 > highest pending on your SMALL queues would imply your IOs are queueing for > a good while either waiting for resources in GPFS or waiting for IOs to > complete. Your maximum buffer size is 16M which is defined to be the > largest IO that can be requested by GPFS. This is the buffer size that > GPFS will use for LARGE IOs. You indicated you had sufficient memory on > the NSD servers but what is the value for the pagepool on those servers, > and what is the value of the nsdBufSpace parameter? If the NSD server is > just that then usually nsdBufSpace is set to 70. The IO buffers used by > the NSD server come from the pagepool so you need sufficient space there > for the maximum number of LARGE IO buffers that would be used concurrently > by GPFS or threads will need to wait for those buffers to become > available. Essentially you want to ensure you have sufficient memory for > the maximum number of IOs all doing a large IO and that value being less > than 70% of the pagepool size. > > You could look at the settings for the FC cards to ensure they are > configured to do the largest IOs possible. I forget the actual values > (have not done this for awhile) but there are settings for the adapters > that control the maximum IO size that will be sent. I think you want this > to be as large as the adapter can handle to reduce the number of messages > needed to complete the large IOs done by GPFS. > > > Fred > __________________________________________________ > Fred Stock | IBM Pittsburgh Lab | 720-430-8821 <(720)%20430-8821> > stockf at us.ibm.com > > > > ----- Original message ----- > From: "Buterbaugh, Kevin L" > Sent by: gpfsug-discuss-bounces at spectrumscale.org > To: gpfsug main discussion list > > Cc: > Subject: Re: [gpfsug-discuss] Clarification of mmdiag --iohist output > Date: Thu, Feb 21, 2019 6:39 AM > > Hi All, > > My thanks to Aaron, Sven, Steve, and whoever responded for the GPFS team. > You confirmed what I suspected ? my example 10 second I/O was _from an NSD > server_ ? and since we?re in a 8 Gb FC SAN environment, it therefore means > - correct me if I?m wrong about this someone - that I?ve got a problem > somewhere in one (or more) of the following 3 components: > > 1) the NSD servers > 2) the SAN fabric > 3) the storage arrays > > I?ve been looking at all of the above and none of them are showing any > obvious problems. I?ve actually got a techie from the storage array vendor > stopping by on Thursday, so I?ll see if he can spot anything there. Our FC > switches are QLogic?s, so I?m kinda screwed there in terms of getting any > help. But I don?t see any errors in the switch logs and ?show perf? on the > switches is showing I/O rates of 50-100 MB/sec on the in use ports, so I > don?t _think_ that?s the issue. > > And this is the GPFS mailing list, after all ? so let?s talk about the NSD > servers. Neither memory (64 GB) nor CPU (2 x quad-core Intel Xeon E5620?s) > appear to be an issue. But I have been looking at the output of ?mmfsadm > saferdump nsd? based on what Aaron and then Steve said. Here?s some fairly > typical output from one of the SMALL queues (I?ve checked several of my 8 > NSD servers and they?re all showing similar output): > > Queue NSD type NsdQueueTraditional [244]: SMALL, threads started 12, > active 3, highest 12, deferred 0, chgSize 0, draining 0, is_chg 0 > requests pending 0, highest pending 73, total processed 4859732 > mutex 0x7F3E449B8F10, reqCond 0x7F3E449B8F58, thCond 0x7F3E449B8F98, > queue 0x7F3E449B8EF0, nFreeNsdRequests 29 > > And for a LARGE queue: > > Queue NSD type NsdQueueTraditional [8]: LARGE, threads started 12, > active 1, highest 12, deferred 0, chgSize 0, draining 0, is_chg 0 > requests pending 0, highest pending 71, total processed 2332966 > mutex 0x7F3E441F3890, reqCond 0x7F3E441F38D8, thCond 0x7F3E441F3918, > queue 0x7F3E441F3870, nFreeNsdRequests 31 > > So my large queues seem to be slightly less utilized than my small queues > overall ? i.e. I see more inactive large queues and they generally have a > smaller ?highest pending? value. > > Question: are those non-zero ?highest pending? values something to be > concerned about? > > I have the following thread-related parameters set: > > [common] > maxReceiverThreads 12 > nsdMaxWorkerThreads 640 > nsdThreadsPerQueue 4 > nsdSmallThreadRatio 3 > workerThreads 128 > > [serverLicense] > nsdMaxWorkerThreads 1024 > nsdThreadsPerQueue 12 > nsdSmallThreadRatio 1 > pitWorkerThreadsPerNode 3 > workerThreads 1024 > > Also, at the top of the ?mmfsadm saferdump nsd? output I see: > > Total server worker threads: running 1008, desired 147, forNSD 147, forGNR > 0, nsdBigBufferSize 16777216 > nsdMultiQueue: 256, nsdMultiQueueType: 1, nsdMinWorkerThreads: 16, > nsdMaxWorkerThreads: 1024 > > Question: is the fact that 1008 is pretty close to 1024 a concern? > > Anything jump out at anybody? I don?t mind sharing full output, but it is > rather lengthy. Is this worthy of a PMR? > > Thanks! > > -- > Kevin Buterbaugh - Senior System Administrator > Vanderbilt University - Advanced Computing Center for Research and > Education > Kevin.Buterbaugh at vanderbilt.edu - (615)875-9633 <(615)%20875-9633> > > > On Feb 17, 2019, at 1:01 PM, IBM Spectrum Scale wrote: > > Hi Kevin, > > The I/O hist shown by the command mmdiag --iohist actually depends on the > node on which you are running this command from. > If you are running this on a NSD server node then it will show the time > taken to complete/serve the read or write I/O operation sent from the > client node. > And if you are running this on a client (or non NSD server) node then it > will show the complete time taken by the read or write I/O operation > requested by the client node to complete. > So in a nut shell for the NSD server case it is just the latency of the > I/O done on disk by the server whereas for the NSD client case it also the > latency of send and receive of I/O request to the NSD server along with the > latency of I/O done on disk by the NSD server. > I hope this answers your query. > > > Regards, The Spectrum Scale (GPFS) team > > > ------------------------------------------------------------------------------------------------------------------ > If you feel that your question can benefit other users of Spectrum Scale > (GPFS), then please post it to the public IBM developerWroks Forum at > https://www.ibm.com/developerworks/community/forums/html/forum?id=11111111-0000-0000-0000-000000000479 > > . > > If your query concerns a potential software error in Spectrum Scale (GPFS) > and you have an IBM software maintenance contract please contact > 1-800-237-5511 <(800)%20237-5511> in the United States or your local IBM > Service Center in other countries. > > The forum is informally monitored as time permits and should not be used > for priority messages to the Spectrum Scale (GPFS) team. > > > > From: "Buterbaugh, Kevin L" > To: gpfsug main discussion list > Date: 02/16/2019 08:18 PM > Subject: [gpfsug-discuss] Clarification of mmdiag --iohist output > Sent by: gpfsug-discuss-bounces at spectrumscale.org > ------------------------------ > > > > Hi All, > > Been reading man pages, docs, and Googling, and haven?t found a definitive > answer to this question, so I knew exactly where to turn? ;-) > > I?m dealing with some slow I/O?s to certain storage arrays in our > environments ? like really, really slow I/O?s ? here?s just one example > from one of my NSD servers of a 10 second I/O: > > 08:49:34.943186 W data 30:41615622144 2048 10115.192 srv > dm-92 > > So here?s my question ? when mmdiag ?iohist tells me that that I/O took > slightly over 10 seconds, is that: > > 1. The time from when the NSD server received the I/O request from the > client until it shipped the data back onto the wire towards the client? > 2. The time from when the client issued the I/O request until it received > the data back from the NSD server? > 3. Something else? > > I?m thinking it?s #1, but want to confirm. Which one it is has very > obvious implications for our troubleshooting steps. Thanks in advance? > > Kevin > ? > Kevin Buterbaugh - Senior System Administrator > Vanderbilt University - Advanced Computing Center for Research and > Education > *Kevin.Buterbaugh at vanderbilt.edu* - > (615)875-9633 <(615)%20875-9633> > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > > https://nam04.safelinks.protection.outlook.com/?url=http%3A%2F%2Fgpfsug.org%2Fmailman%2Flistinfo%2Fgpfsug-discuss&data=02%7C01%7CKevin.Buterbaugh%40vanderbilt.edu%7C2bfb2e8e30e64fa06c0f08d6959b2d38%7Cba5a7f39e3be4ab3b45067fa80faecad%7C0%7C0%7C636860891056297114&sdata=5pL67mhVyScJovkRHRqZog9bM5BZG8F2q972czIYAbA%3D&reserved=0 > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > -------------- next part -------------- An HTML attachment was scrubbed... URL: From jdratlif at iu.edu Tue Mar 5 16:21:18 2019 From: jdratlif at iu.edu (Ratliff, John) Date: Tue, 5 Mar 2019 16:21:18 +0000 Subject: [gpfsug-discuss] suggestions for copying one GPFS file system into another Message-ID: <827394bcbb794a0d9bd5bd8341fc1593@IN-CCI-D1S14.ads.iu.edu> We use a GPFS file system for our computing clusters and we're working on moving to a new SAN. We originally tried AFM, but it didn't seem to work very well. We tried to do a prefetch on a test policy scan of 100 million files, and after 24 hours it hadn't pre-fetched anything. It wasn't clear what was happening. Some smaller tests succeeded, but the NFSv4 ACLs did not seem to be transferred. Since then we started using rsync with the GPFS attrs patch. We have over 600 million files and 700 TB. I split up the rsync tasks with lists of files generated by the policy engine and we transferred the original data in about 2 weeks. Now we're working on final synchronization. I'd like to use one of the delete options to remove files that were sync'd earlier and then deleted. This can't be combined with the files-from option, so it's harder to break up the rsync tasks. Some of the directories I'm running this against have 30-150 million files each. This can take quite some time with a single rsync process. I'm also wondering if any of my rsync options are unnecessary. I was using avHAXS and numeric-ids. I'm thinking the A (acls) and X (xatttrs) might be unnecessary with GPFS->GPFS. We're only using NFSv4 GPFS ACLs. I don't know if GPFS uses any xattrs that rsync would sync or not. Removing those two options removed several system calls, which should make it much faster, but I want to make sure I'm syncing correctly. Also, it seems there is a problem with the GPFS patch on rsync where it will always give an error trying to get GPFS attributes on a symlink, which means it doesn't sync any symlinks when using that option. So you can rsync symlinks or GPFS attrs, but not both at the same time. This has lead to me running two rsyncs, one to get all files and one to get all attributes. Thanks for any ideas or suggestions. John Ratliff | Pervasive Technology Institute | UITS | Research Storage - Indiana University | http://pti.iu.edu -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: smime.p7s Type: application/pkcs7-signature Size: 5670 bytes Desc: not available URL: From S.J.Thompson at bham.ac.uk Tue Mar 5 16:38:31 2019 From: S.J.Thompson at bham.ac.uk (Simon Thompson) Date: Tue, 5 Mar 2019 16:38:31 +0000 Subject: [gpfsug-discuss] suggestions for copying one GPFS file system into another In-Reply-To: <827394bcbb794a0d9bd5bd8341fc1593@IN-CCI-D1S14.ads.iu.edu> References: <827394bcbb794a0d9bd5bd8341fc1593@IN-CCI-D1S14.ads.iu.edu> Message-ID: I wrote a patch to mpifileutils which will copy gpfs attributes, but when we played with it with rsync, something was obviously still different about the attrs from each, so use with care. Simon ________________________________________ From: gpfsug-discuss-bounces at spectrumscale.org [gpfsug-discuss-bounces at spectrumscale.org] on behalf of Ratliff, John [jdratlif at iu.edu] Sent: 05 March 2019 16:21 To: gpfsug-discuss at spectrumscale.org Subject: [gpfsug-discuss] suggestions for copying one GPFS file system into another We use a GPFS file system for our computing clusters and we?re working on moving to a new SAN. We originally tried AFM, but it didn?t seem to work very well. We tried to do a prefetch on a test policy scan of 100 million files, and after 24 hours it hadn?t pre-fetched anything. It wasn?t clear what was happening. Some smaller tests succeeded, but the NFSv4 ACLs did not seem to be transferred. Since then we started using rsync with the GPFS attrs patch. We have over 600 million files and 700 TB. I split up the rsync tasks with lists of files generated by the policy engine and we transferred the original data in about 2 weeks. Now we?re working on final synchronization. I?d like to use one of the delete options to remove files that were sync?d earlier and then deleted. This can?t be combined with the files-from option, so it?s harder to break up the rsync tasks. Some of the directories I?m running this against have 30-150 million files each. This can take quite some time with a single rsync process. I?m also wondering if any of my rsync options are unnecessary. I was using avHAXS and numeric-ids. I?m thinking the A (acls) and X (xatttrs) might be unnecessary with GPFS->GPFS. We?re only using NFSv4 GPFS ACLs. I don?t know if GPFS uses any xattrs that rsync would sync or not. Removing those two options removed several system calls, which should make it much faster, but I want to make sure I?m syncing correctly. Also, it seems there is a problem with the GPFS patch on rsync where it will always give an error trying to get GPFS attributes on a symlink, which means it doesn?t sync any symlinks when using that option. So you can rsync symlinks or GPFS attrs, but not both at the same time. This has lead to me running two rsyncs, one to get all files and one to get all attributes. Thanks for any ideas or suggestions. John Ratliff | Pervasive Technology Institute | UITS | Research Storage ? Indiana University | http://pti.iu.edu From Robert.Oesterlin at nuance.com Tue Mar 5 19:57:45 2019 From: Robert.Oesterlin at nuance.com (Oesterlin, Robert) Date: Tue, 5 Mar 2019 19:57:45 +0000 Subject: [gpfsug-discuss] Reminder - Registration now open - US User Group Meeting, April 16-17th, NCAR Boulder Message-ID: Registration is now open: https://www.eventbrite.com/e/spectrum-scale-gpfs-user-group-us-spring-2019-meeting-tickets-57035376346 Please note that agenda details are not set yet but these will be finalized in the next few weeks - when they are I will post to the registration page and the mailing list. - April 15th: Informal social gather on Monday for those arriving early (location TBD) - April 16th: Full day of talks from IBM and the user community, Social and Networking Event (details TBD) - April 17th: Talks and breakout sessions (If you have any topics for the breakout sessions, let us know) Looking forward to seeing everyone in Boulder! Bob Oesterlin/Kristy Kallback-Rose -------------- next part -------------- An HTML attachment was scrubbed... URL: From S.J.Thompson at bham.ac.uk Tue Mar 5 21:38:52 2019 From: S.J.Thompson at bham.ac.uk (Simon Thompson) Date: Tue, 5 Mar 2019 21:38:52 +0000 Subject: [gpfsug-discuss] suggestions for copying one GPFS file system into another In-Reply-To: References: <827394bcbb794a0d9bd5bd8341fc1593@IN-CCI-D1S14.ads.iu.edu>, Message-ID: DDN also have a paid for product for doing moving of data (data flow) We found out about it after we did a massive data migration... I can't comment on it other than being aware of it. Sure your local DDN sales person can help. But if only IBM supported some sort of restripe to new block size, we wouldn't have to do this mass migration :-P Simon ________________________________________ From: gpfsug-discuss-bounces at spectrumscale.org [gpfsug-discuss-bounces at spectrumscale.org] on behalf of Simon Thompson [S.J.Thompson at bham.ac.uk] Sent: 05 March 2019 16:38 To: gpfsug main discussion list Subject: Re: [gpfsug-discuss] suggestions forwar copying one GPFS file system into another I wrote a patch to mpifileutils which will copy gpfs attributes, but when we played with it with rsync, something was obviously still different about the attrs from each, so use with care. Simon ________________________________________ From: gpfsug-discuss-bounces at spectrumscale.org [gpfsug-discuss-bounces at spectrumscale.org] on behalf of Ratliff, John [jdratlif at iu.edu] Sent: 05 March 2019 16:21 To: gpfsug-discuss at spectrumscale.org Subject: [gpfsug-discuss] suggestions for copying one GPFS file system into another We use a GPFS file system for our computing clusters and we?re working on moving to a new SAN. We originally tried AFM, but it didn?t seem to work very well. We tried to do a prefetch on a test policy scan of 100 million files, and after 24 hours it hadn?t pre-fetched anything. It wasn?t clear what was happening. Some smaller tests succeeded, but the NFSv4 ACLs did not seem to be transferred. Since then we started using rsync with the GPFS attrs patch. We have over 600 million files and 700 TB. I split up the rsync tasks with lists of files generated by the policy engine and we transferred the original data in about 2 weeks. Now we?re working on final synchronization. I?d like to use one of the delete options to remove files that were sync?d earlier and then deleted. This can?t be combined with the files-from option, so it?s harder to break up the rsync tasks. Some of the directories I?m running this against have 30-150 million files each. This can take quite some time with a single rsync process. I?m also wondering if any of my rsync options are unnecessary. I was using avHAXS and numeric-ids. I?m thinking the A (acls) and X (xatttrs) might be unnecessary with GPFS->GPFS. We?re only using NFSv4 GPFS ACLs. I don?t know if GPFS uses any xattrs that rsync would sync or not. Removing those two options removed several system calls, which should make it much faster, but I want to make sure I?m syncing correctly. Also, it seems there is a problem with the GPFS patch on rsync where it will always give an error trying to get GPFS attributes on a symlink, which means it doesn?t sync any symlinks when using that option. So you can rsync symlinks or GPFS attrs, but not both at the same time. This has lead to me running two rsyncs, one to get all files and one to get all attributes. Thanks for any ideas or suggestions. John Ratliff | Pervasive Technology Institute | UITS | Research Storage ? Indiana University | http://pti.iu.edu _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss From Robert.Oesterlin at nuance.com Tue Mar 5 21:56:54 2019 From: Robert.Oesterlin at nuance.com (Oesterlin, Robert) Date: Tue, 5 Mar 2019 21:56:54 +0000 Subject: [gpfsug-discuss] Migrating billions of files? Message-ID: <4D433B18-3B14-4DFB-8954-868E67DA566D@nuance.com> I?m looking at migration 3-4 Billion files, maybe 3PB of data between GPFS clusters. Most of the files are small - 60% 8K or less. Ideally I?d like to copy at least 15-20M files per day - ideally 50M. Any thoughts on how achievable this is? Or what to use? Either with AFM, mpifileutils, rsync.. other? Many of these files would be in 4k inodes. Destination is ESS. Bob Oesterlin Sr Principal Storage Engineer, Nuance -------------- next part -------------- An HTML attachment was scrubbed... URL: From YARD at il.ibm.com Wed Mar 6 09:01:16 2019 From: YARD at il.ibm.com (Yaron Daniel) Date: Wed, 6 Mar 2019 11:01:16 +0200 Subject: [gpfsug-discuss] Migrating billions of files? In-Reply-To: <4D433B18-3B14-4DFB-8954-868E67DA566D@nuance.com> References: <4D433B18-3B14-4DFB-8954-868E67DA566D@nuance.com> Message-ID: Hi What permissions you have ? Do u have only Posix , or also SMB attributes ? If only posix attributes you can do the following: - rsync (which will work on different filesets/directories in parallel. - AFM (but in case you need rollback - it will be problematic) Regards Yaron Daniel 94 Em Ha'Moshavot Rd Storage Architect ? IL Lab Services (Storage) Petach Tiqva, 49527 IBM Global Markets, Systems HW Sales Israel Phone: +972-3-916-5672 Fax: +972-3-916-5672 Mobile: +972-52-8395593 e-mail: yard at il.ibm.com IBM Israel From: "Oesterlin, Robert" To: gpfsug main discussion list Date: 03/05/2019 11:57 PM Subject: [gpfsug-discuss] Migrating billions of files? Sent by: gpfsug-discuss-bounces at spectrumscale.org I?m looking at migration 3-4 Billion files, maybe 3PB of data between GPFS clusters. Most of the files are small - 60% 8K or less. Ideally I?d like to copy at least 15-20M files per day - ideally 50M. Any thoughts on how achievable this is? Or what to use? Either with AFM, mpifileutils, rsync.. other? Many of these files would be in 4k inodes. Destination is ESS. Bob Oesterlin Sr Principal Storage Engineer, Nuance _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org https://urldefense.proofpoint.com/v2/url?u=http-3A__gpfsug.org_mailman_listinfo_gpfsug-2Ddiscuss&d=DwICAg&c=jf_iaSHvJObTbx-siA1ZOg&r=Bn1XE9uK2a9CZQ8qKnJE3Q&m=uXadyLeBnskK8mq-S8OjwY-ESxuNxXme9Akj9QaQBiE&s=UdKoJNySkr8itrQaRD9XMkVjBGnVaU8XnyxuKCldX-8&e= -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 1851 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 4376 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 5093 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 4746 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 4557 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 5093 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 4786 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 5054 bytes Desc: not available URL: From S.J.Thompson at bham.ac.uk Wed Mar 6 09:08:21 2019 From: S.J.Thompson at bham.ac.uk (Simon Thompson) Date: Wed, 6 Mar 2019 09:08:21 +0000 Subject: [gpfsug-discuss] Migrating billions of files? In-Reply-To: References: <4D433B18-3B14-4DFB-8954-868E67DA566D@nuance.com> Message-ID: <011E924E-9FE6-4049-94B5-2D7EEB659D86@bham.ac.uk> AFM doesn?t work well if you have dependent filesets though .. which we did for quota purposes. Simon From: on behalf of "YARD at il.ibm.com" Reply-To: "gpfsug-discuss at spectrumscale.org" Date: Wednesday, 6 March 2019 at 09:01 To: "gpfsug-discuss at spectrumscale.org" Subject: Re: [gpfsug-discuss] Migrating billions of files? Hi What permissions you have ? Do u have only Posix , or also SMB attributes ? If only posix attributes you can do the following: - rsync (which will work on different filesets/directories in parallel. - AFM (but in case you need rollback - it will be problematic) Regards ________________________________ Yaron Daniel 94 Em Ha'Moshavot Rd [cid:_1_0FC36C500FC3669C00318CDBC22583B5] Storage Architect ? IL Lab Services (Storage) Petach Tiqva, 49527 IBM Global Markets, Systems HW Sales Israel Phone: +972-3-916-5672 Fax: +972-3-916-5672 Mobile: +972-52-8395593 e-mail: yard at il.ibm.com IBM Israel [IBM Storage Strategy and Solutions v1][IBM Storage Management and Data Protection v1][cid:_1_0FA0428C0FA03A6C00318CDBC22583B5][cid:_1_0FA044940FA03A6C00318CDBC22583B5] [https://acclaim-production-app.s3.amazonaws.com/images/6c2c3858-6df8-45be-ac2b-f93b8da74e20/Data%2BDriven%2BMulti%2BCloud%2BStrategy%2BV1%2Bver%2B4.png] [FlashSystem A9000/R Foundations] [All Flash Storage Foundations V2] From: "Oesterlin, Robert" To: gpfsug main discussion list Date: 03/05/2019 11:57 PM Subject: [gpfsug-discuss] Migrating billions of files? Sent by: gpfsug-discuss-bounces at spectrumscale.org ________________________________ I?m looking at migration 3-4 Billion files, maybe 3PB of data between GPFS clusters. Most of the files are small - 60% 8K or less. Ideally I?d like to copy at least 15-20M files per day - ideally 50M. Any thoughts on how achievable this is? Or what to use? Either with AFM, mpifileutils, rsync.. other? Many of these files would be in 4k inodes. Destination is ESS. Bob Oesterlin Sr Principal Storage Engineer, Nuance _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.gif Type: image/gif Size: 1852 bytes Desc: image001.gif URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image002.gif Type: image/gif Size: 4377 bytes Desc: image002.gif URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image003.gif Type: image/gif Size: 5094 bytes Desc: image003.gif URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image004.gif Type: image/gif Size: 4747 bytes Desc: image004.gif URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image005.gif Type: image/gif Size: 4558 bytes Desc: image005.gif URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image006.gif Type: image/gif Size: 5094 bytes Desc: image006.gif URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image007.gif Type: image/gif Size: 4787 bytes Desc: image007.gif URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image008.gif Type: image/gif Size: 5055 bytes Desc: image008.gif URL: From YARD at il.ibm.com Wed Mar 6 09:13:18 2019 From: YARD at il.ibm.com (Yaron Daniel) Date: Wed, 6 Mar 2019 11:13:18 +0200 Subject: [gpfsug-discuss] suggestions for copying one GPFS file system into another In-Reply-To: References: <827394bcbb794a0d9bd5bd8341fc1593@IN-CCI-D1S14.ads.iu.edu>, Message-ID: Hi U can also use today Aspera - which will replicate gpfs extended attr. Integration of IBM Aspera Sync with IBM Spectrum Scale: Protecting and Sharing Files Globally http://www.redbooks.ibm.com/redpieces/abstracts/redp5527.html?Open I used in the past the arsync - used for Sonas - i think this is now the Regards Yaron Daniel 94 Em Ha'Moshavot Rd Storage Architect ? IL Lab Services (Storage) Petach Tiqva, 49527 IBM Global Markets, Systems HW Sales Israel Phone: +972-3-916-5672 Fax: +972-3-916-5672 Mobile: +972-52-8395593 e-mail: yard at il.ibm.com IBM Israel From: Simon Thompson To: gpfsug main discussion list Date: 03/05/2019 11:39 PM Subject: Re: [gpfsug-discuss] suggestions for copying one GPFS file system into another Sent by: gpfsug-discuss-bounces at spectrumscale.org DDN also have a paid for product for doing moving of data (data flow) We found out about it after we did a massive data migration... I can't comment on it other than being aware of it. Sure your local DDN sales person can help. But if only IBM supported some sort of restripe to new block size, we wouldn't have to do this mass migration :-P Simon ________________________________________ From: gpfsug-discuss-bounces at spectrumscale.org [gpfsug-discuss-bounces at spectrumscale.org] on behalf of Simon Thompson [S.J.Thompson at bham.ac.uk] Sent: 05 March 2019 16:38 To: gpfsug main discussion list Subject: Re: [gpfsug-discuss] suggestions forwar copying one GPFS file system into another I wrote a patch to mpifileutils which will copy gpfs attributes, but when we played with it with rsync, something was obviously still different about the attrs from each, so use with care. Simon ________________________________________ From: gpfsug-discuss-bounces at spectrumscale.org [gpfsug-discuss-bounces at spectrumscale.org] on behalf of Ratliff, John [jdratlif at iu.edu] Sent: 05 March 2019 16:21 To: gpfsug-discuss at spectrumscale.org Subject: [gpfsug-discuss] suggestions for copying one GPFS file system into another We use a GPFS file system for our computing clusters and we?re working on moving to a new SAN. We originally tried AFM, but it didn?t seem to work very well. We tried to do a prefetch on a test policy scan of 100 million files, and after 24 hours it hadn?t pre-fetched anything. It wasn?t clear what was happening. Some smaller tests succeeded, but the NFSv4 ACLs did not seem to be transferred. Since then we started using rsync with the GPFS attrs patch. We have over 600 million files and 700 TB. I split up the rsync tasks with lists of files generated by the policy engine and we transferred the original data in about 2 weeks. Now we?re working on final synchronization. I?d like to use one of the delete options to remove files that were sync?d earlier and then deleted. This can?t be combined with the files-from option, so it?s harder to break up the rsync tasks. Some of the directories I?m running this against have 30-150 million files each. This can take quite some time with a single rsync process. I?m also wondering if any of my rsync options are unnecessary. I was using avHAXS and numeric-ids. I?m thinking the A (acls) and X (xatttrs) might be unnecessary with GPFS->GPFS. We?re only using NFSv4 GPFS ACLs. I don?t know if GPFS uses any xattrs that rsync would sync or not. Removing those two options removed several system calls, which should make it much faster, but I want to make sure I?m syncing correctly. Also, it seems there is a problem with the GPFS patch on rsync where it will always give an error trying to get GPFS attributes on a symlink, which means it doesn?t sync any symlinks when using that option. So you can rsync symlinks or GPFS attrs, but not both at the same time. This has lead to me running two rsyncs, one to get all files and one to get all attributes. Thanks for any ideas or suggestions. John Ratliff | Pervasive Technology Institute | UITS | Research Storage ? Indiana University | https://urldefense.proofpoint.com/v2/url?u=http-3A__pti.iu.edu&d=DwIF-g&c=jf_iaSHvJObTbx-siA1ZOg&r=Bn1XE9uK2a9CZQ8qKnJE3Q&m=Yz-c0LCo_QGBe4pgbJEr_zzSX4Q1ttDOaHYmcfLln5U&s=gNzUpbvNUfVteTqZ3zpzpbC4M1lQiopyrIfr46h4Okc&e= _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org https://urldefense.proofpoint.com/v2/url?u=http-3A__gpfsug.org_mailman_listinfo_gpfsug-2Ddiscuss&d=DwIF-g&c=jf_iaSHvJObTbx-siA1ZOg&r=Bn1XE9uK2a9CZQ8qKnJE3Q&m=Yz-c0LCo_QGBe4pgbJEr_zzSX4Q1ttDOaHYmcfLln5U&s=pG-g3zRAtaMwcmwoabY4dvuI1j3jbLk-uGHZ6nz6TlU&e= _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org https://urldefense.proofpoint.com/v2/url?u=http-3A__gpfsug.org_mailman_listinfo_gpfsug-2Ddiscuss&d=DwIF-g&c=jf_iaSHvJObTbx-siA1ZOg&r=Bn1XE9uK2a9CZQ8qKnJE3Q&m=Yz-c0LCo_QGBe4pgbJEr_zzSX4Q1ttDOaHYmcfLln5U&s=pG-g3zRAtaMwcmwoabY4dvuI1j3jbLk-uGHZ6nz6TlU&e= -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 1851 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 4376 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 5093 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 4746 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 4557 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 5093 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 4786 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 5054 bytes Desc: not available URL: From YARD at il.ibm.com Wed Mar 6 09:17:59 2019 From: YARD at il.ibm.com (Yaron Daniel) Date: Wed, 6 Mar 2019 11:17:59 +0200 Subject: [gpfsug-discuss] Migrating billions of files? In-Reply-To: <011E924E-9FE6-4049-94B5-2D7EEB659D86@bham.ac.uk> References: <4D433B18-3B14-4DFB-8954-868E67DA566D@nuance.com> <011E924E-9FE6-4049-94B5-2D7EEB659D86@bham.ac.uk> Message-ID: Hi U can also use today Aspera - which will replicate gpfs extended attr. Integration of IBM Aspera Sync with IBM Spectrum Scale: Protecting and Sharing Files Globally http://www.redbooks.ibm.com/redpieces/abstracts/redp5527.html?Open Regards Yaron Daniel 94 Em Ha'Moshavot Rd Storage Architect ? IL Lab Services (Storage) Petach Tiqva, 49527 IBM Global Markets, Systems HW Sales Israel Phone: +972-3-916-5672 Fax: +972-3-916-5672 Mobile: +972-52-8395593 e-mail: yard at il.ibm.com IBM Israel From: Simon Thompson To: gpfsug main discussion list Date: 03/06/2019 11:08 AM Subject: Re: [gpfsug-discuss] Migrating billions of files? Sent by: gpfsug-discuss-bounces at spectrumscale.org AFM doesn?t work well if you have dependent filesets though .. which we did for quota purposes. Simon From: on behalf of "YARD at il.ibm.com" Reply-To: "gpfsug-discuss at spectrumscale.org" Date: Wednesday, 6 March 2019 at 09:01 To: "gpfsug-discuss at spectrumscale.org" Subject: Re: [gpfsug-discuss] Migrating billions of files? Hi What permissions you have ? Do u have only Posix , or also SMB attributes ? If only posix attributes you can do the following: - rsync (which will work on different filesets/directories in parallel. - AFM (but in case you need rollback - it will be problematic) Regards Yaron Daniel 94 Em Ha'Moshavot Rd Storage Architect ? IL Lab Services (Storage) Petach Tiqva, 49527 IBM Global Markets, Systems HW Sales Israel Phone: +972-3-916-5672 Fax: +972-3-916-5672 Mobile: +972-52-8395593 e-mail: yard at il.ibm.com IBM Israel From: "Oesterlin, Robert" To: gpfsug main discussion list Date: 03/05/2019 11:57 PM Subject: [gpfsug-discuss] Migrating billions of files? Sent by: gpfsug-discuss-bounces at spectrumscale.org I?m looking at migration 3-4 Billion files, maybe 3PB of data between GPFS clusters. Most of the files are small - 60% 8K or less. Ideally I?d like to copy at least 15-20M files per day - ideally 50M. Any thoughts on how achievable this is? Or what to use? Either with AFM, mpifileutils, rsync.. other? Many of these files would be in 4k inodes. Destination is ESS. Bob Oesterlin Sr Principal Storage Engineer, Nuance _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org https://urldefense.proofpoint.com/v2/url?u=http-3A__gpfsug.org_mailman_listinfo_gpfsug-2Ddiscuss&d=DwICAg&c=jf_iaSHvJObTbx-siA1ZOg&r=Bn1XE9uK2a9CZQ8qKnJE3Q&m=B2e9s5aGSXZvMOkd4ZPk_EIjfTloX7O_ExWsyR0RGP8&s=wwIfs_8RrX5Z7mGp2Mehj5z7z2yUhr0r-vO7TMyNUeE&e= -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 1851 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 4376 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 5093 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 4746 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 4557 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 5093 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 4786 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 5054 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 1852 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 4377 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 5094 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 4747 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 4558 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 5094 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 4787 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 5055 bytes Desc: not available URL: From alvise.dorigo at psi.ch Wed Mar 6 09:24:08 2019 From: alvise.dorigo at psi.ch (Dorigo Alvise (PSI)) Date: Wed, 6 Mar 2019 09:24:08 +0000 Subject: [gpfsug-discuss] Memory accounting for processes writing to GPFS Message-ID: <83A6EEB0EC738F459A39439733AE80452682711C@MBX214.d.ethz.ch> Hello to everyone, Here a PSI we're observing something that in principle seems strange (at least to me). We run a Java application writing into disk by mean of a standard AsynchronousFileChannel, whose I do not the details. There are two instances of this application: one runs on a node writing on a local drive, the other one runs writing on a GPFS mounted filesystem (this node is part of the cluster, no remote-mounting). What we do see is that in the former the application has a lower sum VIRT+RES memory and the OS shows a really big cache usage; in the latter, OS's cache is negligible while VIRT+RES is very (even too) high (with VIRT very high). So I wonder what is the difference... Writing into a GPFS mounted filesystem, as far as I understand, implies "talking" to the local mmfsd daemon which fills up its own pagepool... and then the system will asynchronously handle these pages to be written on real pdisk. But why the Linux kernel accounts so much memory to the process itself ? And why this large amount of memory is much more VIRT than RES ? thanks in advance, Alvise -------------- next part -------------- An HTML attachment was scrubbed... URL: From mladen.portak at hr.ibm.com Wed Mar 6 09:49:13 2019 From: mladen.portak at hr.ibm.com (Mladen Portak) Date: Wed, 6 Mar 2019 10:49:13 +0100 Subject: [gpfsug-discuss] Question about inodes incrise Message-ID: Dear. is it process of increasing inodes disruptiv? Thank You Mladen Portak Lab Service SEE Storage Consultant mladen.portak at hr.ibm.com +385 91 6308 293 IBM Hrvatska d.o.o. za proizvodnju i trgovinu Miramarska 23, 10 000 Zagreb, Hrvatska Upisan kod Trgova?kog suda u Zagrebu pod br. 080011422 Temeljni kapital: 788,000.00 kuna - upla?en u cijelosti Direktor: ?eljka Ti?i? ?iro ra?un kod: RAIFFEISENBANK AUSTRIA d.d. Zagreb, Magazinska cesta 69, 10000 Zagreb, Hrvatska IBAN: HR5424840081100396574 (SWIFT RZBHHR2X); OIB 43331467622 -------------- next part -------------- An HTML attachment was scrubbed... URL: From chris.schlipalius at pawsey.org.au Wed Mar 6 09:56:31 2019 From: chris.schlipalius at pawsey.org.au (Chris Schlipalius) Date: Wed, 06 Mar 2019 17:56:31 +0800 Subject: [gpfsug-discuss] Migrating billions of files? Message-ID: <8D4D1060-A52E-4A7B-AE2F-25AD44FF141A@pawsey.org.au> Hi Bob, so Simon has hit the nail on the head. So it?s a challenge, we used dcp with multiple parallel threads per nsd with mmdsh - 2PB and millions of files, it?s worth a test as it does look after xattribs, but test it. See https://github.com/hpc/dcp Test the preserve: -p, --preserve Preserve the original files' owner, group, permissions (including the setuid and setgid bits), time of last modification and time of last access. In case duplication of owner or group fails, the setuid and setgid bits are cleared. ------- We migrated between 12K storage FS a few years back. My colleague also has tested https://www.nersc.gov/users/storage-and-file-systems/transferring-data/bbcp/ or http://www.slac.stanford.edu/~abh/bbcp/ It?s excellent I hear with xattribs and recursive small files copy. I steer clear of rsync, different versions do not preserve xattribs and this is a bit of an issue some have found Regards, Chris Schlipalius Team Lead, Data Storage Infrastructure, Data & Visualisation, Pawsey Supercomputing Centre (CSIRO) 13 Burvill Court Kensington WA 6151 Australia From jake.carroll at uq.edu.au Wed Mar 6 11:06:49 2019 From: jake.carroll at uq.edu.au (Jake Carroll) Date: Wed, 6 Mar 2019 11:06:49 +0000 Subject: [gpfsug-discuss] SLURM scripts/policy for data movement into a flash pool? In-Reply-To: References: Message-ID: Hi Scale-folk. I have an IBM ESS GH14S building block currently configured for my HPC workloads. I've got about 1PB of /scratch filesystem configured in mechanical spindles via GNR and about 20TB of SSD/flash sitting in another GNR filesystem at the moment. My intention is to destroy that stand-alone flash filesystem eventually and use storage pools coupled with GPFS policy to warm up workloads into that flash storage: https://www.ibm.com/support/knowledgecenter/en/STXKQY_4.2.3/com.ibm.spectrum.scale.v4r23.doc/bl1adv_storagepool.htm A little dated, but that kind of thing. Does anyone have any experience in this space in using flash storage inside a pool with pre/post flight SLURM scripts to puppeteer GPFS policy to warm data up? I had a few ideas for policy construction around file size, file count, file access intensity. Someone mentioned heat map construction and mmdiag --iohist to me the other day. Could use some background there. If anyone has any SLURM specific integration tips for the scheduler or pre/post flight bits for SBATCH, it'd be really very much appreciated. This array really does fly along and surpassed my expectations - but, I want to get the most out of it that I can for my users - and I think storage pool automation and good file placement management is going to be an important part of that. Thank you. -jc -------------- next part -------------- An HTML attachment was scrubbed... URL: From stockf at us.ibm.com Wed Mar 6 11:13:50 2019 From: stockf at us.ibm.com (Frederick Stock) Date: Wed, 6 Mar 2019 11:13:50 +0000 Subject: [gpfsug-discuss] Question about inodes incrise In-Reply-To: References: Message-ID: An HTML attachment was scrubbed... URL: From stockf at us.ibm.com Wed Mar 6 11:20:11 2019 From: stockf at us.ibm.com (Frederick Stock) Date: Wed, 6 Mar 2019 11:20:11 +0000 Subject: [gpfsug-discuss] Migrating billions of files? In-Reply-To: References: , <4D433B18-3B14-4DFB-8954-868E67DA566D@nuance.com><011E924E-9FE6-4049-94B5-2D7EEB659D86@bham.ac.uk> Message-ID: An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0D9491980D948BE4003315D3C22583B5.gif Type: image/gif Size: 1851 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0CDF62E00CDF5ED0003315D3C22583B5.gif Type: image/gif Size: 4376 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0CDF64E80CDF5ED0003315D3C22583B5.gif Type: image/gif Size: 5093 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0CDF66F00CDF5ED0003315D3C22583B5.gif Type: image/gif Size: 4746 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0CDF68F80CDF5ED0003315D3C22583B5.gif Type: image/gif Size: 4557 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0CDF6B180CDF5ED0003315D3C22583B5.gif Type: image/gif Size: 5093 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0CDF6D640CDF5ED0003315D3C22583B5.gif Type: image/gif Size: 4786 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0CDF6F6C0CDF5ED0003315D3C22583B5.gif Type: image/gif Size: 5054 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0FC365680FC36184003315D3C22583B5.gif Type: image/gif Size: 1852 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0FC2FF280CDE5DD0003315D3C22583B5.gif Type: image/gif Size: 4377 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0FC301300CDE5DD0003315D3C22583B5.gif Type: image/gif Size: 5094 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0FC303380CDE5DD0003315D3C22583B5.gif Type: image/gif Size: 4747 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0FC305400CDE5DD0003315D3C22583B5.gif Type: image/gif Size: 4558 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0FC307480CDE5DD0003315D3C22583B5.gif Type: image/gif Size: 5094 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0FC309500CDE5DD0003315D3C22583B5.gif Type: image/gif Size: 4787 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0FC30B580CDE5DD0003315D3C22583B5.gif Type: image/gif Size: 5055 bytes Desc: not available URL: From abeattie at au1.ibm.com Wed Mar 6 11:22:53 2019 From: abeattie at au1.ibm.com (Andrew Beattie) Date: Wed, 6 Mar 2019 11:22:53 +0000 Subject: [gpfsug-discuss] Migrating billions of files? In-Reply-To: References: , , <4D433B18-3B14-4DFB-8954-868E67DA566D@nuance.com><011E924E-9FE6-4049-94B5-2D7EEB659D86@bham.ac.uk> Message-ID: An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0D9491980D948BE4003315D3C22583B5.gif Type: image/gif Size: 1851 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0CDF62E00CDF5ED0003315D3C22583B5.gif Type: image/gif Size: 4376 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0CDF64E80CDF5ED0003315D3C22583B5.gif Type: image/gif Size: 5093 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0CDF66F00CDF5ED0003315D3C22583B5.gif Type: image/gif Size: 4746 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0CDF68F80CDF5ED0003315D3C22583B5.gif Type: image/gif Size: 4557 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0CDF6B180CDF5ED0003315D3C22583B5.gif Type: image/gif Size: 5093 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0CDF6D640CDF5ED0003315D3C22583B5.gif Type: image/gif Size: 4786 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0CDF6F6C0CDF5ED0003315D3C22583B5.gif Type: image/gif Size: 5054 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0FC365680FC36184003315D3C22583B5.gif Type: image/gif Size: 1852 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0FC2FF280CDE5DD0003315D3C22583B5.gif Type: image/gif Size: 4377 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0FC301300CDE5DD0003315D3C22583B5.gif Type: image/gif Size: 5094 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0FC303380CDE5DD0003315D3C22583B5.gif Type: image/gif Size: 4747 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0FC305400CDE5DD0003315D3C22583B5.gif Type: image/gif Size: 4558 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0FC307480CDE5DD0003315D3C22583B5.gif Type: image/gif Size: 5094 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0FC309500CDE5DD0003315D3C22583B5.gif Type: image/gif Size: 4787 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image._1_0FC30B580CDE5DD0003315D3C22583B5.gif Type: image/gif Size: 5055 bytes Desc: not available URL: From Robert.Oesterlin at nuance.com Wed Mar 6 12:44:24 2019 From: Robert.Oesterlin at nuance.com (Oesterlin, Robert) Date: Wed, 6 Mar 2019 12:44:24 +0000 Subject: [gpfsug-discuss] Follow-up: migrating billions of files Message-ID: Some of you had questions to my original post. More information: Source: - Files are straight GPFS/Posix - no extended NFSV4 ACLs - A solution that requires $?s to be spent on software (ie, Aspera) isn?t a very viable option - Both source and target clusters are in the same DC - Source is stand-alone NSD servers (bonded 10g-E) and 8gb FC SAN storage - Approx 40 file systems, a few large ones with 300M-400M files each, others smaller - no independent file sets - migration must pose minimal disruption to existing users Target architecture is a small number of file systems (2-3) on ESS with independent filesets - Target (ESS) will have multiple 40gb-E links on each NSD server (GS4) My current thinking is AFM with a pre-populate of the file space and switch the clients over to have them pull data they need (most of the data is older and less active) and them let AFM populate the rest in the background. Bob Oesterlin Sr Principal Storage Engineer, Nuance -------------- next part -------------- An HTML attachment was scrubbed... URL: From jjdoherty at yahoo.com Wed Mar 6 12:59:23 2019 From: jjdoherty at yahoo.com (Jim Doherty) Date: Wed, 6 Mar 2019 12:59:23 +0000 (UTC) Subject: [gpfsug-discuss] Memory accounting for processes writing to GPFS In-Reply-To: <83A6EEB0EC738F459A39439733AE80452682711C@MBX214.d.ethz.ch> References: <83A6EEB0EC738F459A39439733AE80452682711C@MBX214.d.ethz.ch> Message-ID: <410609032.929267.1551877163983@mail.yahoo.com> For any process with a large number of threads the VMM size has become an imaginary number ever since the glibc change to allocate a heap per thread. I look to /proc/$pid/status to find the memory used by a proc? RSS + Swap + kernel page tables.? Jim On Wednesday, March 6, 2019, 4:25:48 AM EST, Dorigo Alvise (PSI) wrote: #yiv1607149323 P {margin-top:0;margin-bottom:0;}Hello to everyone,Here a PSI we're observing something that in principle seems strange (at least to me).We run a Java application writing into disk by mean of a standard AsynchronousFileChannel, whose I do not the details.There are two instances of this application: one runs on a node writing on a local drive, the other one runs writing on a GPFS mounted filesystem (this node is part of the cluster, no remote-mounting). What we do see is that in the former the application has a lower sum VIRT+RES memory and the OS shows a really big cache usage; in the latter, OS's cache is negligible while VIRT+RES is very (even too) high (with VIRT very high). So I wonder what is the difference... Writing into a GPFS mounted filesystem, as far as I understand, implies "talking" to the local mmfsd daemon which fills up its own pagepool... and then the system will asynchronously handle these pages to be written on real pdisk. But why the Linux kernel accounts so much memory to the process itself ? And why this large amount of memory is much more VIRT than RES ? thanks in advance, ?? Alvise _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From UWEFALKE at de.ibm.com Wed Mar 6 13:13:16 2019 From: UWEFALKE at de.ibm.com (Uwe Falke) Date: Wed, 6 Mar 2019 14:13:16 +0100 Subject: [gpfsug-discuss] Follow-up: migrating billions of files In-Reply-To: References: Message-ID: Hi, in that case I'd open several tar pipes in parallel, maybe using directories carefully selected, like tar -c | ssh "tar -x" I am not quite sure whether "-C /" for tar works here ("tar -C / -x"), but along these lines might be a good efficient method. target_hosts should be all nodes haveing the target file system mounted, and you should start those pipes on the nodes with the source file system. It is best to start with the largest directories, and use some masterscript to start the tar pipes controlled by semaphores to not overload anything. Mit freundlichen Gr??en / Kind regards Dr. Uwe Falke IT Specialist High Performance Computing Services / Integrated Technology Services / Data Center Services ------------------------------------------------------------------------------------------------------------------------------------------- IBM Deutschland Rathausstr. 7 09111 Chemnitz Phone: +49 371 6978 2165 Mobile: +49 175 575 2877 E-Mail: uwefalke at de.ibm.com ------------------------------------------------------------------------------------------------------------------------------------------- IBM Deutschland Business & Technology Services GmbH / Gesch?ftsf?hrung: Thomas Wolter, Sven Schoo? Sitz der Gesellschaft: Ehningen / Registergericht: Amtsgericht Stuttgart, HRB 17122 From: "Oesterlin, Robert" To: gpfsug main discussion list Date: 06/03/2019 13:44 Subject: [gpfsug-discuss] Follow-up: migrating billions of files Sent by: gpfsug-discuss-bounces at spectrumscale.org Some of you had questions to my original post. More information: Source: - Files are straight GPFS/Posix - no extended NFSV4 ACLs - A solution that requires $?s to be spent on software (ie, Aspera) isn?t a very viable option - Both source and target clusters are in the same DC - Source is stand-alone NSD servers (bonded 10g-E) and 8gb FC SAN storage - Approx 40 file systems, a few large ones with 300M-400M files each, others smaller - no independent file sets - migration must pose minimal disruption to existing users Target architecture is a small number of file systems (2-3) on ESS with independent filesets - Target (ESS) will have multiple 40gb-E links on each NSD server (GS4) My current thinking is AFM with a pre-populate of the file space and switch the clients over to have them pull data they need (most of the data is older and less active) and them let AFM populate the rest in the background. Bob Oesterlin Sr Principal Storage Engineer, Nuance _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org https://urldefense.proofpoint.com/v2/url?u=http-3A__gpfsug.org_mailman_listinfo_gpfsug-2Ddiscuss&d=DwICAg&c=jf_iaSHvJObTbx-siA1ZOg&r=fTuVGtgq6A14KiNeaGfNZzOOgtHW5Lm4crZU6lJxtB8&m=J5RpIj-EzFyU_dM9I4P8SrpHMikte_pn9sbllFcOvyM&s=fEwDQyDSL7hvOVPbg_n8o_LDz-cLqSI6lQtSzmhaSoI&e= From TOMP at il.ibm.com Wed Mar 6 13:14:47 2019 From: TOMP at il.ibm.com (Tomer Perry) Date: Wed, 6 Mar 2019 07:14:47 -0600 Subject: [gpfsug-discuss] Memory accounting for processes writing to GPFS In-Reply-To: <410609032.929267.1551877163983@mail.yahoo.com> References: <83A6EEB0EC738F459A39439733AE80452682711C@MBX214.d.ethz.ch> <410609032.929267.1551877163983@mail.yahoo.com> Message-ID: It might be the case that AsynchronousFileChannel is actually doing mmap access to the files. Thus, the memory management will be completely different with GPFS in compare to local fs. Regards, Tomer Perry Scalable I/O Development (Spectrum Scale) email: tomp at il.ibm.com 1 Azrieli Center, Tel Aviv 67021, Israel Global Tel: +1 720 3422758 Israel Tel: +972 3 9188625 Mobile: +972 52 2554625 From: Jim Doherty To: gpfsug main discussion list Date: 06/03/2019 06:59 Subject: Re: [gpfsug-discuss] Memory accounting for processes writing to GPFS Sent by: gpfsug-discuss-bounces at spectrumscale.org For any process with a large number of threads the VMM size has become an imaginary number ever since the glibc change to allocate a heap per thread. I look to /proc/$pid/status to find the memory used by a proc RSS + Swap + kernel page tables. Jim On Wednesday, March 6, 2019, 4:25:48 AM EST, Dorigo Alvise (PSI) wrote: Hello to everyone, Here a PSI we're observing something that in principle seems strange (at least to me). We run a Java application writing into disk by mean of a standard AsynchronousFileChannel, whose I do not the details. There are two instances of this application: one runs on a node writing on a local drive, the other one runs writing on a GPFS mounted filesystem (this node is part of the cluster, no remote-mounting). What we do see is that in the former the application has a lower sum VIRT+RES memory and the OS shows a really big cache usage; in the latter, OS's cache is negligible while VIRT+RES is very (even too) high (with VIRT very high). So I wonder what is the difference... Writing into a GPFS mounted filesystem, as far as I understand, implies "talking" to the local mmfsd daemon which fills up its own pagepool... and then the system will asynchronously handle these pages to be written on real pdisk. But why the Linux kernel accounts so much memory to the process itself ? And why this large amount of memory is much more VIRT than RES ? thanks in advance, Alvise _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org https://urldefense.proofpoint.com/v2/url?u=http-3A__gpfsug.org_mailman_listinfo_gpfsug-2Ddiscuss&d=DwICAg&c=jf_iaSHvJObTbx-siA1ZOg&r=mLPyKeOa1gNDrORvEXBgMw&m=cm3DTOcac__Y20DdtIZcwEXYG9GqlDxlHFTLeSAUOdE&s=hxak8mqRwAQuN7BaF-B9gvTQu1PGnCFF8am1GvMu3bI&e= -------------- next part -------------- An HTML attachment was scrubbed... URL: From makaplan at us.ibm.com Wed Mar 6 15:01:57 2019 From: makaplan at us.ibm.com (Marc A Kaplan) Date: Wed, 6 Mar 2019 10:01:57 -0500 Subject: [gpfsug-discuss] Migrating billions of files? mmfind ... mmxcp In-Reply-To: References: , , <4D433B18-3B14-4DFB-8954-868E67DA566D@nuance.com><011E924E-9FE6-4049-94B5-2D7EEB659D86@bham.ac.uk> Message-ID: mmxcp may be in samples/ilm if not, perhaps we can put it on an approved file sharing service ... + mmxcp script, for use with mmfind ... -xargs mmxcp ... Which makes parallelized file copy relatively easy and super fast! Usage: /gh/bin/mmxcp -t target -p strip_count source_pathname1 source_pathname2 ... Run "cp" in a mmfind ... -xarg ... pipeline, e.g. mmfind -polFlags '-N all -g /gpfs/tmp' /gpfs/source -gpfsWeight DIRECTORY_HASH -xargs mmxcp -t /target -p 2 Options: -t target_path : Copy files to this path. -p strip_count : Remove this many directory names from the pathnames of the source files. -a : pass -a to cp -v : pass -v to cp -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 21994 bytes Desc: not available URL: From S.J.Thompson at bham.ac.uk Wed Mar 6 15:07:09 2019 From: S.J.Thompson at bham.ac.uk (Simon Thompson) Date: Wed, 6 Mar 2019 15:07:09 +0000 Subject: [gpfsug-discuss] Migrating billions of files? mmfind ... mmxcp In-Reply-To: References: , , <4D433B18-3B14-4DFB-8954-868E67DA566D@nuance.com><011E924E-9FE6-4049-94B5-2D7EEB659D86@bham.ac.uk> , Message-ID: Last time this was mentioned, it doesn't do ACLs? Simon ________________________________________ From: gpfsug-discuss-bounces at spectrumscale.org [gpfsug-discuss-bounces at spectrumscale.org] on behalf of makaplan at us.ibm.com [makaplan at us.ibm.com] Sent: 06 March 2019 15:01 To: gpfsug main discussion list Cc: gpfsug-discuss-bounces at spectrumscale.org Subject: Re: [gpfsug-discuss] Migrating billions of files? mmfind ... mmxcp mmxcp may be in samples/ilm if not, perhaps we can put it on an approved file sharing service ... + mmxcp script, for use with mmfind ... -xargs mmxcp ... Which makes parallelized file copy relatively easy and super fast! Usage: /gh/bin/mmxcp -t target -p strip_count source_pathname1 source_pathname2 ... Run "cp" in a mmfind ... -xarg ... pipeline, e.g. mmfind -polFlags '-N all -g /gpfs/tmp' /gpfs/source -gpfsWeight DIRECTORY_HASH -xargs mmxcp -t /target -p 2 Options: -t target_path : Copy files to this path. -p strip_count : Remove this many directory names from the pathnames of the source files. -a : pass -a to cp -v : pass -v to cp [Marc A Kaplan] -------------- next part -------------- A non-text attachment was scrubbed... Name: ATT00001.gif Type: image/gif Size: 21994 bytes Desc: ATT00001.gif URL: From oehmes at gmail.com Wed Mar 6 15:30:31 2019 From: oehmes at gmail.com (Sven Oehme) Date: Wed, 06 Mar 2019 07:30:31 -0800 Subject: [gpfsug-discuss] Question about inodes incrise In-Reply-To: References: Message-ID: <8140D183-2D50-4FF9-8BEA-329F4C9A5977@gmail.com> While Fred is right, in most cases you shouldn?t see this, under heavy burst create workloads before 5.0.2 you can even trigger out of space errors even you have plenty of space in the filesystem (very hard to reproduce so unlikely to hit for a normal enduser). to address the issues there have been significant enhancements in this area in 5.0.2. prior the changes expansions under heavy load many times happened in the foreground (means the application waits for the expansion to finish before it proceeds) especially if many nodes create lots of files in parallel. Since the changes you now see messages on the filesystem manager in its mmfs log when a expansion happens with details including if somebody had to wait for it or not. Sven From: on behalf of Mladen Portak Reply-To: gpfsug main discussion list Date: Wednesday, March 6, 2019 at 1:49 AM To: Subject: [gpfsug-discuss] Question about inodes incrise Dear. is it process of increasing inodes disruptiv? Thank You Mladen Portak Lab Service SEE Storage Consultant mladen.portak at hr.ibm.com +385 91 6308 293 IBM Hrvatska d.o.o. za proizvodnju i trgovinu Miramarska 23, 10 000 Zagreb, Hrvatska Upisan kod Trgova?kog suda u Zagrebu pod br. 080011422 Temeljni kapital: 788,000.00 kuna - upla?en u cijelosti Direktor: ?eljka Ti?i? ?iro ra?un kod: RAIFFEISENBANK AUSTRIA d.d. Zagreb, Magazinska cesta 69, 10000 Zagreb, Hrvatska IBAN: HR5424840081100396574 (SWIFT RZBHHR2X); OIB 43331467622 _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From eboyd at us.ibm.com Wed Mar 6 15:41:54 2019 From: eboyd at us.ibm.com (Edward Boyd) Date: Wed, 6 Mar 2019 15:41:54 +0000 Subject: [gpfsug-discuss] gpfsug-discuss mmxcp In-Reply-To: References: Message-ID: An HTML attachment was scrubbed... URL: From ulmer at ulmer.org Wed Mar 6 15:49:32 2019 From: ulmer at ulmer.org (Stephen Ulmer) Date: Wed, 6 Mar 2019 10:49:32 -0500 Subject: [gpfsug-discuss] Follow-up: migrating billions of files In-Reply-To: References:

Message-ID: <66B1E0A0-723A-4D08-B6D1-D99392E3DE71@ulmer.org> In the case where tar -C doesn?t work, you can always use a subshell (I do this regularly): tar -cf . | ssh someguy at otherhost "(cd targetdir; tar -xvf - )" Only use -v on one end. :) Also, for parallel work that?s not designed that way, don't underestimate the -P option to GNU and BSD xargs! With the amount of stuff to be copied, making sure a subjob doesn?t finish right after you go home leaving a slot idle for several hours is a medium deal. In Bob?s case, however, treating it like a DR exercise where users "restore" their own files by accessing them (using AFM instead of HSM) is probably the most convenient. -- Stephen > On Mar 6, 2019, at 8:13 AM, Uwe Falke > wrote: > > Hi, in that case I'd open several tar pipes in parallel, maybe using > directories carefully selected, like > > tar -c | ssh "tar -x" > > I am not quite sure whether "-C /" for tar works here ("tar -C / -x"), but > along these lines might be a good efficient method. target_hosts should be > all nodes haveing the target file system mounted, and you should start > those pipes on the nodes with the source file system. > It is best to start with the largest directories, and use some > masterscript to start the tar pipes controlled by semaphores to not > overload anything. > > > > Mit freundlichen Gr??en / Kind regards > > > Dr. Uwe Falke > > IT Specialist > High Performance Computing Services / Integrated Technology Services / > Data Center Services > ------------------------------------------------------------------------------------------------------------------------------------------- > IBM Deutschland > Rathausstr. 7 > 09111 Chemnitz > Phone: +49 371 6978 2165 > Mobile: +49 175 575 2877 > E-Mail: uwefalke at de.ibm.com > ------------------------------------------------------------------------------------------------------------------------------------------- > IBM Deutschland Business & Technology Services GmbH / Gesch?ftsf?hrung: > Thomas Wolter, Sven Schoo? > Sitz der Gesellschaft: Ehningen / Registergericht: Amtsgericht Stuttgart, > HRB 17122 > > > > > From: "Oesterlin, Robert" > > To: gpfsug main discussion list > > Date: 06/03/2019 13:44 > Subject: [gpfsug-discuss] Follow-up: migrating billions of files > Sent by: gpfsug-discuss-bounces at spectrumscale.org > > > > Some of you had questions to my original post. More information: > > Source: > - Files are straight GPFS/Posix - no extended NFSV4 ACLs > - A solution that requires $?s to be spent on software (ie, Aspera) isn?t > a very viable option > - Both source and target clusters are in the same DC > - Source is stand-alone NSD servers (bonded 10g-E) and 8gb FC SAN storage > - Approx 40 file systems, a few large ones with 300M-400M files each, > others smaller > - no independent file sets > - migration must pose minimal disruption to existing users > > Target architecture is a small number of file systems (2-3) on ESS with > independent filesets > - Target (ESS) will have multiple 40gb-E links on each NSD server (GS4) > > My current thinking is AFM with a pre-populate of the file space and > switch the clients over to have them pull data they need (most of the data > is older and less active) and them let AFM populate the rest in the > background. > > > Bob Oesterlin > Sr Principal Storage Engineer, Nuance > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > https://urldefense.proofpoint.com/v2/url?u=http-3A__gpfsug.org_mailman_listinfo_gpfsug-2Ddiscuss&d=DwICAg&c=jf_iaSHvJObTbx-siA1ZOg&r=fTuVGtgq6A14KiNeaGfNZzOOgtHW5Lm4crZU6lJxtB8&m=J5RpIj-EzFyU_dM9I4P8SrpHMikte_pn9sbllFcOvyM&s=fEwDQyDSL7hvOVPbg_n8o_LDz-cLqSI6lQtSzmhaSoI&e= > > > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From makaplan at us.ibm.com Wed Mar 6 15:59:55 2019 From: makaplan at us.ibm.com (Marc A Kaplan) Date: Wed, 6 Mar 2019 10:59:55 -0500 Subject: [gpfsug-discuss] gpfsug-discuss mmxcp In-Reply-To: References:

Message-ID: Basically yes. If you can't find the scripts in 4.2 samples... You can copy them over from 5.x to the 4.2 system... Should work except perhaps for some of the more esoteric find conditionals... From: "Edward Boyd" To: gpfsug-discuss at spectrumscale.org Date: 03/06/2019 10:42 AM Subject: Re: [gpfsug-discuss] gpfsug-discuss mmxcp Sent by: gpfsug-discuss-bounces at spectrumscale.org Curious if this command would be suitable for migration from Scale 4.2 file system to 5.x file system? What is lost or left behind? Edward L. Boyd ( Ed ), Client Technical Specialist IBM Systems Storage Solutions US Federal 407-271-9210 Office / Cell / Office / Text eboyd at us.ibm.com email -----gpfsug-discuss-bounces at spectrumscale.org wrote: ----- To: gpfsug-discuss at spectrumscale.org From: gpfsug-discuss-request at spectrumscale.org Sent by: gpfsug-discuss-bounces at spectrumscale.org Date: 03/06/2019 10:03AM Subject: gpfsug-discuss Digest, Vol 86, Issue 11 Send gpfsug-discuss mailing list submissions to gpfsug-discuss at spectrumscale.org To subscribe or unsubscribe via the World Wide Web, visit http://gpfsug.org/mailman/listinfo/gpfsug-discuss or, via email, send a message with subject or body 'help' to gpfsug-discuss-request at spectrumscale.org You can reach the person managing the list at gpfsug-discuss-owner at spectrumscale.org When replying, please edit your Subject line so it is more specific than "Re: Contents of gpfsug-discuss digest..." Today's Topics: 1. Re: Migrating billions of files? mmfind ... mmxcp (Marc A Kaplan) ---------------------------------------------------------------------- Message: 1 Date: Wed, 6 Mar 2019 10:01:57 -0500 From: "Marc A Kaplan" To: gpfsug main discussion list Cc: gpfsug-discuss-bounces at spectrumscale.org Subject: Re: [gpfsug-discuss] Migrating billions of files? mmfind ... mmxcp Message-ID: < OF18FDF6D8.C850134F-ON852583B5.005243D0-852583B5.0052961B at notes.na.collabserv.com > Content-Type: text/plain; charset="us-ascii" mmxcp may be in samples/ilm if not, perhaps we can put it on an approved file sharing service ... + mmxcp script, for use with mmfind ... -xargs mmxcp ... Which makes parallelized file copy relatively easy and super fast! Usage: /gh/bin/mmxcp -t target -p strip_count source_pathname1 source_pathname2 ... Run "cp" in a mmfind ... -xarg ... pipeline, e.g. mmfind -polFlags '-N all -g /gpfs/tmp' /gpfs/source -gpfsWeight DIRECTORY_HASH -xargs mmxcp -t /target -p 2 Options: -t target_path : Copy files to this path. -p strip_count : Remove this many directory names from the pathnames of the source files. -a : pass -a to cp -v : pass -v to cp -------------- next part -------------- An HTML attachment was scrubbed... URL: < http://gpfsug.org/pipermail/gpfsug-discuss/attachments/20190306/0361a3dd/attachment.html > -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 21994 bytes Desc: not available URL: < http://gpfsug.org/pipermail/gpfsug-discuss/attachments/20190306/0361a3dd/attachment.gif > ------------------------------ _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss End of gpfsug-discuss Digest, Vol 86, Issue 11 ********************************************** _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org https://urldefense.proofpoint.com/v2/url?u=http-3A__gpfsug.org_mailman_listinfo_gpfsug-2Ddiscuss&d=DwICAg&c=jf_iaSHvJObTbx-siA1ZOg&r=cvpnBBH0j41aQy0RPiG2xRL_M8mTc1izuQD3_PmtjZ8&m=UpQuMLyiY5RYAlgIz4tU_Ou1f0vzJQeW3YhaTsUNNjg&s=UG74CyaXta-G7ib_KTNz0_ypCbmqWveCUFnV-oPaDYY&e= -------------- next part -------------- An HTML attachment was scrubbed... URL: From jmsing at us.ibm.com Wed Mar 6 16:37:01 2019 From: jmsing at us.ibm.com (John M Sing) Date: Wed, 6 Mar 2019 11:37:01 -0500 Subject: [gpfsug-discuss] Question about inodes increase - how to increase non-disruptively - orig question by Mladen Portak on 3/6/19 - 09:46 GMT In-Reply-To: References: Message-ID: Upon further thought, it occurs to me that Spectrum Scale V5's introduction of variable sub-blocks must by necessity have changed the inode calculation that I describe below. I would be interested to know how exactly in Spectrum Scale V5 formatted file systems, how one may need to change the information I document below. I would imagine the pre-V5 file system format probably still uses the inode allocation schema that I document below. John Sing IBM Offering Evangelist, Spectrum Scale, ESS Venice FL From: John M Sing/Tampa/IBM To: gpfsug-discuss at spectrumscale.org Date: 03/06/2019 11:23 AM Subject: Question about inodes increase - how to increase non-disruptively - orig question by Mladen Portak on 3/6/19 - 09:46 GMT Hi, all, Mladen, (This is my first post to the GPFSug-discuss list. I am IBMer, am the IBM worldwide technical support Evangelist on Spectrum Scale/ESS. I am based in Florida. Apologies if my attachment is not permitted or if I did not reply properly to tie my reply to the original poster - pls let me know if there are more instructions or rules for using GPFSug-discuss (I could not find any such guidelines)). ------------- Mladen, Increasing or changing inodes in a GPFS/Spectrum Scale file system can be done non-disruptively, within the boundaries of how GPFS / Spectrum Scale works. I wrote and delivered the following presentation on this topic back in 2013 in the GPFS V4.1 timeframe. While older IBM technologies SONAS/V7000 Unified are the reason the preso was written, and the commands shown are from those now-withdrawn products, the GPFS concepts involved as far as I know have not changed, and you can simply use the GPFS/Spectrum Scale equivalent commands such as mmcrfs, mmcrfileset, mmchfileset, etc to allocate, add, or change inodes non-disruptively, within the boundaries of how GPFS / Spectrum Scale works. There's lots of diagrams. [attachment "sDS05_John_Sing_SONAS_V7000_GPFS_Unified_Independent_Filesets_Inode_Planning.ppt" deleted by John M Sing/Tampa/IBM] The PPT is handy because there is animation in Slideshow mode to better explain (at least in my mind) how GPFS allocates inodes, and how you extend or under what circumstances you can change the number of inodes in either a file system or an independent file set. Here is a Box link to download this 8.7MB preso, should the attachment not come thru or be too big for the list. https://ibm.box.com/shared/static/phn9dypcdbzyn2ei6hy2hc79lgmch904.ppt This Box link, which anyone who has the link can use to download, will expire on Dec 31, 2019. If you are reading this post past that date, just email me and I will be happy to reshare the preso with you. I wrote this up because I myself needed to remember inode allocation especially in light of how GPFS independent filesets works, should I ever need to refer back to it. Happy to hear feedback on the above preso from all of you out there. Corrections/comments/update suggestions welcome. Regards, John M. Sing Offering Evangelist, IBM Spectrum Scale, Elastic Storage Server, Spectrum NAS Venice, Florida https://www.linkedin.com/in/johnsing/ jmsing at us.ibm.com office: 941-492-2998 ------------------------------------------------------------------------------------------------------------------------------------------------------------- Mladen Portak?mladen.portak at hr.ibm.com? wrote on Wed Mar 6 09:49:13 GMT 2019 Dear. is it process of increasing inodes disruptive? Thank You Mladen Portak Lab Service SEE Storage Consultant mladen.portak at hr.ibm.com +385 91 6308 293 IBM Hrvatska d.o.o. za proizvodnju i trgovinu Miramarska 23, 10 000 Zagreb, Hrvatska Upisan kod Trgova?kog suda u Zagrebu pod br. 080011422 Temeljni kapital: 788,000.00 kuna - upla?en u cijelosti Direktor: ?eljka Ti?i? ?iro ra?un kod: RAIFFEISENBANK AUSTRIA d.d. Zagreb, Magazinska cesta 69, 10000 Zagreb, Hrvatska IBAN: HR5424840081100396574 (SWIFT RZBHHR2X); OIB 43331467622 -------------- next part -------------- An HTML attachment was scrubbed... URL: From jmsing at us.ibm.com Wed Mar 6 16:42:11 2019 From: jmsing at us.ibm.com (John M Sing) Date: Wed, 6 Mar 2019 11:42:11 -0500 Subject: [gpfsug-discuss] Fw: Question about inodes increase - how to increase non-disruptively - orig question by Mladen Portak on 3/6/19 - 09:46 GMT Message-ID: Hi, all, Mladen, (This is my first post to the GPFSug-discuss list. I am IBMer, am the IBM worldwide technical support Evangelist on Spectrum Scale/ESS. I am based in Florida. Apologies if my attachment URL is not permitted or if I did not reply properly to tie my reply to the original poster - pls let me know if there are more instructions or rules for using GPFSug-discuss (I could not find any such guidelines)). ------------- Mladen, Increasing or changing inodes in a GPFS/Spectrum Scale file system can be done non-disruptively, within the boundaries of how GPFS / Spectrum Scale works. I wrote and delivered the following presentation on this topic back in 2013 in the GPFS V4.1 timeframe. While older IBM technologies SONAS/V7000 Unified are the reason the preso was written, and the commands shown are from those now-withdrawn products, the GPFS concepts involved as far as I know have not changed, and you can simply use the GPFS/Spectrum Scale equivalent commands such as mmcrfs, mmcrfileset, mmchfileset, etc to allocate, add, or change inodes non-disruptively, within the boundaries of how GPFS / Spectrum Scale works. There's lots of diagrams. Here is a Box link to download this 8.7MB preso which anyone who has the link can use to download : https://ibm.box.com/shared/static/phn9dypcdbzyn2ei6hy2hc79lgmch904.ppt This should apply to any Spectrum Scale / GPFS file system this is the the Spectrum Scale V4.x or older format. I would imagine a file system with the newer Scale V5 variable sub-blocks has a modification to the above schema. I'd be interested to know what that is and how V5 users should modify the above diagrams/information. The PPT is handy because there is animation in Slideshow mode to better explain (at least in my mind) how GPFS / Spectrum Scale V4.x and older allocates inodes, and how you extend or under what circumstances you can change the number of inodes in either a file system or an independent file set. This Box link, will expire on Dec 31, 2019. If you are reading this post past that date, just email me and I will be happy to reshare the preso with you. I wrote this up because I myself needed to remember inode allocation especially in light of how GPFS independent filesets works, should I ever need to refer back to it. Happy to hear feedback on the above preso from all of you out there. Corrections/comments/update suggestions welcome. Regards, John M. Sing Offering Evangelist, IBM Spectrum Scale, Elastic Storage Server, Spectrum NAS Venice, Florida https://www.linkedin.com/in/johnsing/ jmsing at us.ibm.com office: 941-492-2998 ------------------------------------------------------------------------------------------------------------------------------------------------------------- Mladen Portak?mladen.portak at hr.ibm.com? wrote on Wed Mar 6 09:49:13 GMT 2019 Dear. is it process of increasing inodes disruptive? Thank You Mladen Portak Lab Service SEE Storage Consultant mladen.portak at hr.ibm.com +385 91 6308 293 IBM Hrvatska d.o.o. za proizvodnju i trgovinu Miramarska 23, 10 000 Zagreb, Hrvatska Upisan kod Trgova?kog suda u Zagrebu pod br. 080011422 Temeljni kapital: 788,000.00 kuna - upla?en u cijelosti Direktor: ?eljka Ti?i? ?iro ra?un kod: RAIFFEISENBANK AUSTRIA d.d. Zagreb, Magazinska cesta 69, 10000 Zagreb, Hrvatska IBAN: HR5424840081100396574 (SWIFT RZBHHR2X); OIB 43331467622 -------------- next part -------------- An HTML attachment was scrubbed... URL: From alex at calicolabs.com Wed Mar 6 17:13:18 2019 From: alex at calicolabs.com (Alex Chekholko) Date: Wed, 6 Mar 2019 09:13:18 -0800 Subject: [gpfsug-discuss] SLURM scripts/policy for data movement into a flash pool? In-Reply-To: References: Message-ID: Hi, I have tried this before and I would like to temper your expectations. If you use a placement policy to allow users to write any files into your "small" pool (e.g. by directory), they will get E_NOSPC when your small pool fills up. And they will be confused because they can't see the pool configuration, they just see a large filesystem with lots of space. I think there may now be an "overflow" policy but it will only work for new files, not if someone keeps writing into an existing file in an existing pool. If you use a migration policy (even based on heat map) it is still a periodic scheduled data movement and not anything that happens "on the fly". Also, "fileheat" only gets updated at some interval anyway. If you use a migration policy to move data between pools, you may starve users of I/O which will confuse your users because suddenly things are slow. I think there is now a QOS way to throttle your data migration. I guess it depends on how much of your disk I/O throughput is not used; if your disks are already churning, migrations will just slow everything down. Think of it less like a cache layer and more like two separate storage locations. If a bunch of jobs want to read the same files from your big pool, it's probably faster to just have them read from the big pool directly rather than have some kind of prologue job to read the data from the big pool, write it into the small poool, then have the jobs read from the small pool. Also, my experience was with pool ratios of like 10%/90%, yours is more like 2%/98%. However, mine were with write-heavy workloads (typical university environment with quickly growing capacity utilization). Hope these anecdotes help. Also, it could be that things work a bit differently now in new versions. Regards, Alex On Wed, Mar 6, 2019 at 3:13 AM Jake Carroll wrote: > Hi Scale-folk. > > I have an IBM ESS GH14S building block currently configured for my HPC > workloads. > > I've got about 1PB of /scratch filesystem configured in mechanical > spindles via GNR and about 20TB of SSD/flash sitting in another GNR > filesystem at the moment. My intention is to destroy that stand-alone flash > filesystem eventually and use storage pools coupled with GPFS policy to > warm up workloads into that flash storage: > > > https://www.ibm.com/support/knowledgecenter/en/STXKQY_4.2.3/com.ibm.spectrum.scale.v4r23.doc/bl1adv_storagepool.htm > > A little dated, but that kind of thing. > > Does anyone have any experience in this space in using flash storage > inside a pool with pre/post flight SLURM scripts to puppeteer GPFS policy > to warm data up? > > I had a few ideas for policy construction around file size, file count, > file access intensity. Someone mentioned heat map construction and mmdiag > --iohist to me the other day. Could use some background there. > > If anyone has any SLURM specific integration tips for the scheduler or > pre/post flight bits for SBATCH, it'd be really very much appreciated. > > This array really does fly along and surpassed my expectations - but, I > want to get the most out of it that I can for my users - and I think > storage pool automation and good file placement management is going to be > an important part of that. > > Thank you. > > -jc > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > -------------- next part -------------- An HTML attachment was scrubbed... URL: From novosirj at rutgers.edu Wed Mar 6 12:55:15 2019 From: novosirj at rutgers.edu (Ryan Novosielski) Date: Wed, 6 Mar 2019 12:55:15 +0000 Subject: [gpfsug-discuss] Question about inodes incrise In-Reply-To: References: , Message-ID: <75613CE6-602B-4792-9F01-E736E7AFF0EA@rutgers.edu> They hadn?t asked, but neither is the process of raising the maximum, which could be what they?re asking about (might be some momentary performance hit ? can?t recall, but I don?t believe it?s significant if so). -- ____ || \\UTGERS, |---------------------------*O*--------------------------- ||_// the State | Ryan Novosielski - novosirj at rutgers.edu || \\ University | Sr. Technologist - 973/972.0922 (2x0922) ~*~ RBHS Campus || \\ of NJ | Office of Advanced Research Computing - MSB C630, Newark `' On Mar 6, 2019, at 06:14, Frederick Stock > wrote: No. It happens automatically and generally without notice to end users, that is they do not see any noticeable pause in operations. If you are asking the question because you are considering pre-allocating all of your inodes I would advise you not take that option. Fred __________________________________________________ Fred Stock | IBM Pittsburgh Lab | 720-430-8821 stockf at us.ibm.com ----- Original message ----- From: "Mladen Portak" > Sent by: gpfsug-discuss-bounces at spectrumscale.org To: gpfsug-discuss at spectrumscale.org Cc: Subject: [gpfsug-discuss] Question about inodes incrise Date: Wed, Mar 6, 2019 4:49 AM Dear. is it process of increasing inodes disruptiv? Thank You Mladen Portak Lab Service SEE Storage Consultant mladen.portak at hr.ibm.com +385 91 6308 293 IBM Hrvatska d.o.o. za proizvodnju i trgovinu Miramarska 23, 10 000 Zagreb, Hrvatska Upisan kod Trgova?kog suda u Zagrebu pod br. 080011422 Temeljni kapital: 788,000.00 kuna - upla?en u cijelosti Direktor: ?eljka Ti?i? ?iro ra?un kod: RAIFFEISENBANK AUSTRIA d.d. Zagreb, Magazinska cesta 69, 10000 Zagreb, Hrvatska IBAN: HR5424840081100396574 (SWIFT RZBHHR2X); OIB 43331467622 _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From alvise.dorigo at psi.ch Thu Mar 7 10:15:16 2019 From: alvise.dorigo at psi.ch (Dorigo Alvise (PSI)) Date: Thu, 7 Mar 2019 10:15:16 +0000 Subject: [gpfsug-discuss] Memory accounting for processes writing to GPFS In-Reply-To: References: <83A6EEB0EC738F459A39439733AE80452682711C@MBX214.d.ethz.ch> <410609032.929267.1551877163983@mail.yahoo.com>, Message-ID: <83A6EEB0EC738F459A39439733AE80452682C54A@MBX214.d.ethz.ch> Thanks to all for clarification. A ________________________________ From: gpfsug-discuss-bounces at spectrumscale.org [gpfsug-discuss-bounces at spectrumscale.org] on behalf of Tomer Perry [TOMP at il.ibm.com] Sent: Wednesday, March 06, 2019 2:14 PM To: gpfsug main discussion list Subject: Re: [gpfsug-discuss] Memory accounting for processes writing to GPFS It might be the case that AsynchronousFileChannelis actually doing mmap access to the files. Thus, the memory management will be completely different with GPFS in compare to local fs. Regards, Tomer Perry Scalable I/O Development (Spectrum Scale) email: tomp at il.ibm.com 1 Azrieli Center, Tel Aviv 67021, Israel Global Tel: +1 720 3422758 Israel Tel: +972 3 9188625 Mobile: +972 52 2554625 From: Jim Doherty To: gpfsug main discussion list Date: 06/03/2019 06:59 Subject: Re: [gpfsug-discuss] Memory accounting for processes writing to GPFS Sent by: gpfsug-discuss-bounces at spectrumscale.org ________________________________ For any process with a large number of threads the VMM size has become an imaginary number ever since the glibc change to allocate a heap per thread. I look to /proc/$pid/status to find the memory used by a proc RSS + Swap + kernel page tables. Jim On Wednesday, March 6, 2019, 4:25:48 AM EST, Dorigo Alvise (PSI) wrote: Hello to everyone, Here a PSI we're observing something that in principle seems strange (at least to me). We run a Java application writing into disk by mean of a standard AsynchronousFileChannel, whose I do not the details. There are two instances of this application: one runs on a node writing on a local drive, the other one runs writing on a GPFS mounted filesystem (this node is part of the cluster, no remote-mounting). What we do see is that in the former the application has a lower sum VIRT+RES memory and the OS shows a really big cache usage; in the latter, OS's cache is negligible while VIRT+RES is very (even too) high (with VIRT very high). So I wonder what is the difference... Writing into a GPFS mounted filesystem, as far as I understand, implies "talking" to the local mmfsd daemon which fills up its own pagepool... and then the system will asynchronously handle these pages to be written on real pdisk. But why the Linux kernel accounts so much memory to the process itself ? And why this large amount of memory is much more VIRT than RES ? thanks in advance, Alvise _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss_______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From UWEFALKE at de.ibm.com Thu Mar 7 11:41:18 2019 From: UWEFALKE at de.ibm.com (Uwe Falke) Date: Thu, 7 Mar 2019 12:41:18 +0100 Subject: [gpfsug-discuss] Follow-up: migrating billions of files In-Reply-To: <66B1E0A0-723A-4D08-B6D1-D99392E3DE71@ulmer.org> References:

<66B1E0A0-723A-4D08-B6D1-D99392E3DE71@ulmer.org> Message-ID: As for "making sure a subjob doesn't finish right after you go home leaving a slot idle for several hours ". That's the reason for the masterscript / control script / whatever. There would be a list of directories sorted to decreasing size, the master script would have a counter for each participating source host (a semaphore) and start as many parallel copy jobs, each with the currently topmost directory in the list, removing that directory (best possibly to an intermediary "in-work" list), counting down the semaphore on each start , unless 0. As soon as a job returns successfully, count up the semaphore, and if >0, start the next job, and so on. I suppose you can easily run about 8 to 12 such jobs per server (maybe best to use dedicated source server - dest server pairs). So, no worries about leaving at any time WRT jobs ending and idle job slots . of course, some precautions should be taken to ensure each job succeeds and gets repeated if not , and a lot of logging should take place to be sure you would know what's happened. Mit freundlichen Gr??en / Kind regards Dr. Uwe Falke IT Specialist High Performance Computing Services / Integrated Technology Services / Data Center Services ------------------------------------------------------------------------------------------------------------------------------------------- IBM Deutschland Rathausstr. 7 09111 Chemnitz Phone: +49 371 6978 2165 Mobile: +49 175 575 2877 E-Mail: uwefalke at de.ibm.com ------------------------------------------------------------------------------------------------------------------------------------------- IBM Deutschland Business & Technology Services GmbH / Gesch?ftsf?hrung: Thomas Wolter, Sven Schoo? Sitz der Gesellschaft: Ehningen / Registergericht: Amtsgericht Stuttgart, HRB 17122 From: Stephen Ulmer To: gpfsug main discussion list Date: 06/03/2019 16:55 Subject: Re: [gpfsug-discuss] Follow-up: migrating billions of files Sent by: gpfsug-discuss-bounces at spectrumscale.org In the case where tar -C doesn?t work, you can always use a subshell (I do this regularly): tar -cf . | ssh someguy at otherhost "(cd targetdir; tar -xvf - )" Only use -v on one end. :) Also, for parallel work that?s not designed that way, don't underestimate the -P option to GNU and BSD xargs! With the amount of stuff to be copied, making sure a subjob doesn?t finish right after you go home leaving a slot idle for several hours is a medium deal. In Bob?s case, however, treating it like a DR exercise where users "restore" their own files by accessing them (using AFM instead of HSM) is probably the most convenient. -- Stephen On Mar 6, 2019, at 8:13 AM, Uwe Falke wrote: Hi, in that case I'd open several tar pipes in parallel, maybe using directories carefully selected, like tar -c | ssh "tar -x" I am not quite sure whether "-C /" for tar works here ("tar -C / -x"), but along these lines might be a good efficient method. target_hosts should be all nodes haveing the target file system mounted, and you should start those pipes on the nodes with the source file system. It is best to start with the largest directories, and use some masterscript to start the tar pipes controlled by semaphores to not overload anything. Mit freundlichen Gr??en / Kind regards Dr. Uwe Falke IT Specialist High Performance Computing Services / Integrated Technology Services / Data Center Services ------------------------------------------------------------------------------------------------------------------------------------------- IBM Deutschland Rathausstr. 7 09111 Chemnitz Phone: +49 371 6978 2165 Mobile: +49 175 575 2877 E-Mail: uwefalke at de.ibm.com ------------------------------------------------------------------------------------------------------------------------------------------- IBM Deutschland Business & Technology Services GmbH / Gesch?ftsf?hrung: Thomas Wolter, Sven Schoo? Sitz der Gesellschaft: Ehningen / Registergericht: Amtsgericht Stuttgart, HRB 17122 From: "Oesterlin, Robert" To: gpfsug main discussion list Date: 06/03/2019 13:44 Subject: [gpfsug-discuss] Follow-up: migrating billions of files Sent by: gpfsug-discuss-bounces at spectrumscale.org Some of you had questions to my original post. More information: Source: - Files are straight GPFS/Posix - no extended NFSV4 ACLs - A solution that requires $?s to be spent on software (ie, Aspera) isn?t a very viable option - Both source and target clusters are in the same DC - Source is stand-alone NSD servers (bonded 10g-E) and 8gb FC SAN storage - Approx 40 file systems, a few large ones with 300M-400M files each, others smaller - no independent file sets - migration must pose minimal disruption to existing users Target architecture is a small number of file systems (2-3) on ESS with independent filesets - Target (ESS) will have multiple 40gb-E links on each NSD server (GS4) My current thinking is AFM with a pre-populate of the file space and switch the clients over to have them pull data they need (most of the data is older and less active) and them let AFM populate the rest in the background. Bob Oesterlin Sr Principal Storage Engineer, Nuance _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org https://urldefense.proofpoint.com/v2/url?u=http-3A__gpfsug.org_mailman_listinfo_gpfsug-2Ddiscuss&d=DwICAg&c=jf_iaSHvJObTbx-siA1ZOg&r=fTuVGtgq6A14KiNeaGfNZzOOgtHW5Lm4crZU6lJxtB8&m=J5RpIj-EzFyU_dM9I4P8SrpHMikte_pn9sbllFcOvyM&s=fEwDQyDSL7hvOVPbg_n8o_LDz-cLqSI6lQtSzmhaSoI&e= _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org https://urldefense.proofpoint.com/v2/url?u=http-3A__gpfsug.org_mailman_listinfo_gpfsug-2Ddiscuss&d=DwICAg&c=jf_iaSHvJObTbx-siA1ZOg&r=fTuVGtgq6A14KiNeaGfNZzOOgtHW5Lm4crZU6lJxtB8&m=4gYLFpEqhJ4XD4RdqwClWf14hrSb2JKrH_EirNxZtuY&s=InZvoRUosC8y-cfwNsRiXvN3fujTLLf4U_uDvPGupoc&e= From jonathan.buzzard at strath.ac.uk Thu Mar 7 11:39:45 2019 From: jonathan.buzzard at strath.ac.uk (Jonathan Buzzard) Date: Thu, 07 Mar 2019 11:39:45 +0000 Subject: [gpfsug-discuss] Follow-up: migrating billions of files In-Reply-To: References: Message-ID: On Wed, 2019-03-06 at 12:44 +0000, Oesterlin, Robert wrote: > Some of you had questions to my original post. More information: > > Source: > - Files are straight GPFS/Posix - no extended NFSV4 ACLs > - A solution that requires $?s to be spent on software (ie, Aspera) > isn?t a very viable option > - Both source and target clusters are in the same DC > - Source is stand-alone NSD servers (bonded 10g-E) and 8gb FC SAN > storage > - Approx 40 file systems, a few large ones with 300M-400M files each, > others smaller > - no independent file sets > - migration must pose minimal disruption to existing users > > Target architecture is a small number of file systems (2-3) on ESS > with independent filesets > - Target (ESS) will have multiple 40gb-E links on each NSD server > (GS4) > > My current thinking is AFM with a pre-populate of the file space and > switch the clients over to have them pull data they need (most of the > data is older and less active) and them let AFM populate the rest in > the background. > As it's not been mentioned yet "dsmc restore" or equivalent depending on your backup solution. JAB. -- Jonathan A. Buzzard Tel: +44141-5483420 HPC System Administrator, ARCHIE-WeSt. University of Strathclyde, John Anderson Building, Glasgow. G4 0NG From stefan.dietrich at desy.de Thu Mar 7 12:05:13 2019 From: stefan.dietrich at desy.de (Dietrich, Stefan) Date: Thu, 7 Mar 2019 13:05:13 +0100 (CET) Subject: [gpfsug-discuss] CES Ganesha netgroup caching? In-Reply-To: References: <2121724779.6221169.1551340616921.JavaMail.zimbra@desy.de> Message-ID: <1829516345.7374115.1551960313390.JavaMail.zimbra@desy.de> Hi Malhal, thanks for the quick answer! Regards, Stefan ----- Original Message ----- > From: "Malahal R Naineni" > To: gpfsug-discuss at spectrumscale.org > Cc: gpfsug-discuss at spectrumscale.org > Sent: Thursday, February 28, 2019 1:33:50 PM > Subject: Re: [gpfsug-discuss] CES Ganesha netgroup caching? > Ganesha maintains negative and positive cache. Maybe, we should remove negative > cache. A cache entry (either negative or positive) auto expires after 30 > minutes. "ganesha_mgr purge netgroup" removes the entire netgroup cache. > So, if you add a host to the netgroup, it should be able to access exports > immediately provided the host never tried to access in the past. If it did, > then it would have been part of negative cache entry and you may need to wait > for 30 minutes. If you remove a host from a netgroups, it may take about 30 > minutes to revoke the access. > Added, "ganesha_mgr purge netgroup" to purge the cache to make the cache > consistent with the actual configuration. It needs to be run on each node. > Regards, Malahal. > > > ----- Original message ----- > From: "Dietrich, Stefan" > Sent by: gpfsug-discuss-bounces at spectrumscale.org > To: gpfsug-discuss at spectrumscale.org > Cc: > Subject: [gpfsug-discuss] CES Ganesha netgroup caching? > Date: Thu, Feb 28, 2019 1:36 PM > Hi, > > I am currently playing around with LDAP netgroups for NFS exports via CES. > However, I could not figure out how long Ganesha is caching the netgroup > entries? > > There is definitely some caching, as adding a host to the netgroup does not > immediately grant access to the share. > A "getent netgroup " on the CES node returns the correct result, so > this is not some other caching effect. > > Resetting the cache via "ganesha_mgr purge netgroup" works, but is probably not > officially supported. > > The CES nodes are running with GPFS 5.0.2.3 and > gpfs.nfs-ganesha-2.5.3-ibm030.01.el7. > CES authentication is set to user-defined, the nodes just use SSSD with a > rfc2307bis LDAP server. > > Regards, > Stefan > > -- > ------------------------------------------------------------------------ > Stefan Dietrich Deutsches Elektronen-Synchrotron (IT-Systems) > Ein Forschungszentrum der Helmholtz-Gemeinschaft > Notkestr. 85 > phone: +49-40-8998-4696 22607 Hamburg > e-mail: stefan.dietrich at desy.de Germany > ------------------------------------------------------------------------ > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > [ http://gpfsug.org/mailman/listinfo/gpfsug-discuss | > http://gpfsug.org/mailman/listinfo/gpfsug-discuss ] > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss From vpuvvada at in.ibm.com Thu Mar 7 13:52:12 2019 From: vpuvvada at in.ibm.com (Venkateswara R Puvvada) Date: Thu, 7 Mar 2019 19:22:12 +0530 Subject: [gpfsug-discuss] Follow-up: migrating billions of files In-Reply-To: References: Message-ID: AFM based migration provides near-zero downtime and supports migrating EAs/ACLs including immutability attributes (if home is Scale/ESS). I would recommend starting migration in read-only mode, prefetch most of the data and convert the fileset to local-updates (if backup is not needed during the migration) or independent-writer mode before moving the applications to the AFM cache filesets. AFM now supports (from 5.0.2) directory level prefetch with many performance improvements and does not require list-files to be specified. ~Venkat (vpuvvada at in.ibm.com) From: "Oesterlin, Robert" To: gpfsug main discussion list Date: 03/06/2019 06:14 PM Subject: [gpfsug-discuss] Follow-up: migrating billions of files Sent by: gpfsug-discuss-bounces at spectrumscale.org Some of you had questions to my original post. More information: Source: - Files are straight GPFS/Posix - no extended NFSV4 ACLs - A solution that requires $?s to be spent on software (ie, Aspera) isn?t a very viable option - Both source and target clusters are in the same DC - Source is stand-alone NSD servers (bonded 10g-E) and 8gb FC SAN storage - Approx 40 file systems, a few large ones with 300M-400M files each, others smaller - no independent file sets - migration must pose minimal disruption to existing users Target architecture is a small number of file systems (2-3) on ESS with independent filesets - Target (ESS) will have multiple 40gb-E links on each NSD server (GS4) My current thinking is AFM with a pre-populate of the file space and switch the clients over to have them pull data they need (most of the data is older and less active) and them let AFM populate the rest in the background. Bob Oesterlin Sr Principal Storage Engineer, Nuance _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org https://urldefense.proofpoint.com/v2/url?u=http-3A__gpfsug.org_mailman_listinfo_gpfsug-2Ddiscuss&d=DwICAg&c=jf_iaSHvJObTbx-siA1ZOg&r=92LOlNh2yLzrrGTDA7HnfF8LFr55zGxghLZtvZcZD7A&m=YkRmc5bZTZ4O8u_y9PwCjhzuvVXZmhm-_SNQzKhDt0g&s=DUBqVmYz6ycQjkr-PZk4r5hndMIB1-FVzan1CCzlxRg&e= -------------- next part -------------- An HTML attachment was scrubbed... URL: From valleru at cbio.mskcc.org Thu Mar 7 18:59:03 2019 From: valleru at cbio.mskcc.org (valleru at cbio.mskcc.org) Date: Thu, 7 Mar 2019 12:59:03 -0600 Subject: [gpfsug-discuss] Exporting remote GPFS mounts on a non-ces SMB share Message-ID: <9ceb9b16-18ad-4137-a8d1-f0b34966d7cf@Spark> Hello All, We are thinking of exporting ?remote" GPFS mounts on a remote GPFS 5.0 cluster through a SMB share. I have heard in a previous thread that it is not a good idea to export NFS/SMB share on a remote GPFS mount, and make it writable. The issue that could be caused by making it writable would be metanode swapping between the GPFS clusters. May i understand this better and the seriousness of this issue? The possibility of a single file being written at the same time from a GPFS node and NFS/SMB node is minimum - however it is possible that a file is written at the same time from multiple protocols by mistake and we cannot prevent it. This is the setup: GPFS storage cluster: /gpfs01 GPFS CES cluster ( does not have any storage) : /gpfs01 -> mounted remotely . NFS export /gpfs01 as part of CES cluster GPFS client for CES cluster -> Acts as SMB server and exports /gpfs01 over SMB Are there any other limitations that i need to know for the above setup? We cannot use GPFS CES SMB as of now for few other reasons such as LDAP/AD id mapping and authentication complications. Regards, Lohit -------------- next part -------------- An HTML attachment was scrubbed... URL: From abeattie at au1.ibm.com Thu Mar 7 20:44:59 2019 From: abeattie at au1.ibm.com (Andrew Beattie) Date: Thu, 7 Mar 2019 20:44:59 +0000 Subject: [gpfsug-discuss] Exporting remote GPFS mounts on a non-ces SMB share In-Reply-To: <9ceb9b16-18ad-4137-a8d1-f0b34966d7cf@Spark> References: <9ceb9b16-18ad-4137-a8d1-f0b34966d7cf@Spark> Message-ID: An HTML attachment was scrubbed... URL: From valleru at cbio.mskcc.org Thu Mar 7 21:02:54 2019 From: valleru at cbio.mskcc.org (valleru at cbio.mskcc.org) Date: Thu, 7 Mar 2019 15:02:54 -0600 Subject: [gpfsug-discuss] Exporting remote GPFS mounts on a non-ces SMB share In-Reply-To: References: <9ceb9b16-18ad-4137-a8d1-f0b34966d7cf@Spark> Message-ID: <2b02981b-185b-4f64-988f-ce2c19b55c29@Spark> Thank you Andrew. However, we are not using SMB from the CES cluster but instead running a Redhat based SMB on a GPFS client of the CES cluster and exporting it from the GPFS client. Is the above supported, and not known to cause any issues? Regards, Lohit On Mar 7, 2019, 2:45 PM -0600, Andrew Beattie , wrote: > > https://www.ibm.com/support/knowledgecenter/en/STXKQY_5.0.2/com.ibm.spectrum.scale.v5r02.doc/bl1adv_configprotocolsonremotefs.htm -------------- next part -------------- An HTML attachment was scrubbed... URL: From abeattie at au1.ibm.com Thu Mar 7 21:12:31 2019 From: abeattie at au1.ibm.com (Andrew Beattie) Date: Thu, 7 Mar 2019 21:12:31 +0000 Subject: [gpfsug-discuss] Exporting remote GPFS mounts on a non-ces SMB share In-Reply-To: <2b02981b-185b-4f64-988f-ce2c19b55c29@Spark> References: <2b02981b-185b-4f64-988f-ce2c19b55c29@Spark>, <9ceb9b16-18ad-4137-a8d1-f0b34966d7cf@Spark> Message-ID: An HTML attachment was scrubbed... URL: From valleru at cbio.mskcc.org Thu Mar 7 22:10:25 2019 From: valleru at cbio.mskcc.org (valleru at cbio.mskcc.org) Date: Thu, 7 Mar 2019 16:10:25 -0600 Subject: [gpfsug-discuss] Exporting remote GPFS mounts on a non-ces SMB share In-Reply-To: References: <2b02981b-185b-4f64-988f-ce2c19b55c29@Spark> <9ceb9b16-18ad-4137-a8d1-f0b34966d7cf@Spark>

Message-ID: We have many current usernames from LDAP that do not exactly match with the usernames from AD. Unfortunately, i guess CES SMB will need us to use either AD or LDAP or use the same usernames in both AD and LDAP. I have been looking for a solution where could map the different usernames from LDAP and AD but have not found a solution. So exploring ways to do this from RHEL SMB. I would appreciate if you have any solution to this issue. As of now we use LDAP uids/gids and SSH keys for authentication to the HPC cluster. We want to use CES SMB to export the same mounts which have LDAP usernames/uids/gids however because of different usernames in AD - it has become a challenge. Even if we do find a solution to this, i want to be able to use AD authentication for SMB and ssh key authentication for NFS. The above are the reasons we are just using CES with NFS and user defined authentication for users to have access with login through ssh keys. Regards, Lohit On Mar 7, 2019, 3:12 PM -0600, Andrew Beattie , wrote: > That would not be supported > > You shouldn't publish a remote mount Protocol cluster , and then connect a native client to that cluster and create a non CES protocol export > if you are going to use a Protocol cluster that's how you present your protocols. > otherwise don't set up the remote mount cluster. > > Why are you trying to publish a non HA RHEL SMB share instead of using the HA CES protocols? > Andrew Beattie > File and Object Storage Technical Specialist - A/NZ > IBM Systems - Storage > Phone: 614-2133-7927 > E-mail: abeattie at au1.ibm.com > > > > ----- Original message ----- > > From: valleru at cbio.mskcc.org > > Sent by: gpfsug-discuss-bounces at spectrumscale.org > > To: gpfsug-discuss at spectrumscale.org, gpfsug main discussion list > > Cc: > > Subject: Re: [gpfsug-discuss] Exporting remote GPFS mounts on a non-ces SMB share > > Date: Fri, Mar 8, 2019 7:05 AM > > > > Thank you Andrew. > > > > However, we are not using SMB from the CES cluster but instead running a Redhat based SMB on a GPFS client of the CES cluster and exporting it from the GPFS client. > > Is the above supported, and not known to cause any issues? > > > > Regards, > > Lohit > > > > On Mar 7, 2019, 2:45 PM -0600, Andrew Beattie , wrote: > > > > > > https://www.ibm.com/support/knowledgecenter/en/STXKQY_5.0.2/com.ibm.spectrum.scale.v5r02.doc/bl1adv_configprotocolsonremotefs.htm > > _______________________________________________ > > gpfsug-discuss mailing list > > gpfsug-discuss at spectrumscale.org > > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From abeattie at au1.ibm.com Thu Mar 7 22:52:28 2019 From: abeattie at au1.ibm.com (Andrew Beattie) Date: Thu, 7 Mar 2019 22:52:28 +0000 Subject: [gpfsug-discuss] Exporting remote GPFS mounts on a non-ces SMB share In-Reply-To: References: , <2b02981b-185b-4f64-988f-ce2c19b55c29@Spark><9ceb9b16-18ad-4137-a8d1-f0b34966d7cf@Spark>

Message-ID: An HTML attachment was scrubbed... URL: From S.J.Thompson at bham.ac.uk Thu Mar 7 23:00:46 2019 From: S.J.Thompson at bham.ac.uk (Simon Thompson) Date: Thu, 7 Mar 2019 23:00:46 +0000 Subject: [gpfsug-discuss] Exporting remote GPFS mounts on a non-ces SMB share In-Reply-To: References: <2b02981b-185b-4f64-988f-ce2c19b55c29@Spark> <9ceb9b16-18ad-4137-a8d1-f0b34966d7cf@Spark>

, Message-ID: There is a custom Auth mode I think that allows you to use ad for Auth and LDAP for identity. You'd could do what you wanted but you'd need another LDAP instance that mapped the ad usernames to the UID that is only used by SMB. Hack yes. Simon ________________________________________ From: gpfsug-discuss-bounces at spectrumscale.org [gpfsug-discuss-bounces at spectrumscale.org] on behalf of valleru at cbio.mskcc.org [valleru at cbio.mskcc.org] Sent: 07 March 2019 22:10 To: gpfsug-discuss at spectrumscale.org; gpfsug main discussion list Subject: Re: [gpfsug-discuss] Exporting remote GPFS mounts on a non-ces SMB share We have many current usernames from LDAP that do not exactly match with the usernames from AD. Unfortunately, i guess CES SMB will need us to use either AD or LDAP or use the same usernames in both AD and LDAP. I have been looking for a solution where could map the different usernames from LDAP and AD but have not found a solution. So exploring ways to do this from RHEL SMB. I would appreciate if you have any solution to this issue. As of now we use LDAP uids/gids and SSH keys for authentication to the HPC cluster. We want to use CES SMB to export the same mounts which have LDAP usernames/uids/gids however because of different usernames in AD - it has become a challenge. Even if we do find a solution to this, i want to be able to use AD authentication for SMB and ssh key authentication for NFS. The above are the reasons we are just using CES with NFS and user defined authentication for users to have access with login through ssh keys. Regards, Lohit On Mar 7, 2019, 3:12 PM -0600, Andrew Beattie , wrote: That would not be supported You shouldn't publish a remote mount Protocol cluster , and then connect a native client to that cluster and create a non CES protocol export if you are going to use a Protocol cluster that's how you present your protocols. otherwise don't set up the remote mount cluster. Why are you trying to publish a non HA RHEL SMB share instead of using the HA CES protocols? Andrew Beattie File and Object Storage Technical Specialist - A/NZ IBM Systems - Storage Phone: 614-2133-7927 E-mail: abeattie at au1.ibm.com ----- Original message ----- From: valleru at cbio.mskcc.org Sent by: gpfsug-discuss-bounces at spectrumscale.org To: gpfsug-discuss at spectrumscale.org, gpfsug main discussion list Cc: Subject: Re: [gpfsug-discuss] Exporting remote GPFS mounts on a non-ces SMB share Date: Fri, Mar 8, 2019 7:05 AM Thank you Andrew. However, we are not using SMB from the CES cluster but instead running a Redhat based SMB on a GPFS client of the CES cluster and exporting it from the GPFS client. Is the above supported, and not known to cause any issues? Regards, Lohit On Mar 7, 2019, 2:45 PM -0600, Andrew Beattie , wrote: https://www.ibm.com/support/knowledgecenter/en/STXKQY_5.0.2/com.ibm.spectrum.scale.v5r02.doc/bl1adv_configprotocolsonremotefs.htm _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss From valleru at cbio.mskcc.org Thu Mar 7 23:29:49 2019 From: valleru at cbio.mskcc.org (valleru at cbio.mskcc.org) Date: Thu, 7 Mar 2019 17:29:49 -0600 Subject: [gpfsug-discuss] Exporting remote GPFS mounts on a non-ces SMB share In-Reply-To: References: <2b02981b-185b-4f64-988f-ce2c19b55c29@Spark> <9ceb9b16-18ad-4137-a8d1-f0b34966d7cf@Spark>

Message-ID: <0736385e-0371-4295-b665-a745af85ab29@Spark> Thanks a lot Andrew. It does look promising but It does not strike me immediately on how this could solve the SMB export where user authenticates with an AD username but the gpfs files that are present are owned by LDAP username. May be you are saying that if i enable GPFS to use these scripts - then GPFS will map the AD username to the LDAP username? I found this url too.. https://www.ibm.com/support/knowledgecenter/en/SSFKCN/com.ibm.cluster.gpfs.doc/gpfs_uid/uid_gpfs.html I will give it a read, try to understand how to implement it and get back if i have any more questions. If this works, it should help me configure and use the CES SMB. (Hopefully, CES file based authentication will allow both ssh key authentication for NFS and AD for SMB in same CES cluster). Regards, Lohit On Mar 7, 2019, 4:52 PM -0600, Andrew Beattie , wrote: > Lohit > > Have you looked at mmUIDtoName mmNametoUID > > Yes it will require some custom scripting on your behalf but it would be a far more elegant solution and not run the risk of data corruption issues. > > There is at least one university on this mailing list that is doing exactly what you are talking about, and they successfully use > mmUIDtoName / mmNametoUID? to provide the relevant mapping between different authentication environments - both internally in the university and externally from other institutions. > > They use AFM to move data between different storage clusters, and mmUIDtoName / mmNametoUID, to manage the ACL and permissions, they then move the data from the AFM filesystem to the HPC scratch filesystem for processing by the HPC (different filesystems within the same cluster) > > > Regards, > Andrew Beattie > File and Object Storage Technical Specialist - A/NZ > IBM Systems - Storage > Phone: 614-2133-7927 > E-mail: abeattie at au1.ibm.com > > > > ----- Original message ----- > > From: valleru at cbio.mskcc.org > > Sent by: gpfsug-discuss-bounces at spectrumscale.org > > To: gpfsug-discuss at spectrumscale.org, gpfsug main discussion list > > Cc: > > Subject: Re: [gpfsug-discuss] Exporting remote GPFS mounts on a non-ces SMB share > > Date: Fri, Mar 8, 2019 8:21 AM > > > > We have many current usernames from LDAP that do not exactly match with the usernames from AD. > > Unfortunately, i guess CES SMB will need us to use either AD or LDAP or use the same usernames in both AD and LDAP. > > I have been looking for a solution where could map the different usernames from LDAP and AD but have not found a solution. So exploring ways to do this from RHEL SMB. > > I would appreciate if you have any solution to this issue. > > > > As of now we use LDAP uids/gids and SSH keys for authentication to the HPC cluster. > > We want to use CES SMB to export the same mounts which have LDAP usernames/uids/gids however because of different usernames in AD - it has become a challenge. > > Even if we do find a solution to this, i want to be able to use AD authentication for SMB and ssh key authentication for NFS. > > > > The above are the reasons we are just using CES with NFS and user defined authentication for users to have access with login through ssh keys. > > > > Regards, > > Lohit > > > > On Mar 7, 2019, 3:12 PM -0600, Andrew Beattie , wrote: > > > That would not be supported > > > > > > You shouldn't publish a remote mount Protocol cluster , and then connect a native client to that cluster and create a non CES protocol export > > > if you are going to use a Protocol cluster that's how you present your protocols. > > > otherwise don't set up the remote mount cluster. > > > > > > Why are you trying to publish a non HA RHEL SMB share instead of using the HA CES protocols? > > > Andrew Beattie > > > File and Object Storage Technical Specialist - A/NZ > > > IBM Systems - Storage > > > Phone: 614-2133-7927 > > > E-mail: abeattie at au1.ibm.com > > > > > > > > > > ----- Original message ----- > > > > From: valleru at cbio.mskcc.org > > > > Sent by: gpfsug-discuss-bounces at spectrumscale.org > > > > To: gpfsug-discuss at spectrumscale.org, gpfsug main discussion list > > > > Cc: > > > > Subject: Re: [gpfsug-discuss] Exporting remote GPFS mounts on a non-ces SMB share > > > > Date: Fri, Mar 8, 2019 7:05 AM > > > > > > > > Thank you Andrew. > > > > > > > > However, we are not using SMB from the CES cluster but instead running a Redhat based SMB on a GPFS client of the CES cluster and exporting it from the GPFS client. > > > > Is the above supported, and not known to cause any issues? > > > > > > > > Regards, > > > > Lohit > > > > > > > > On Mar 7, 2019, 2:45 PM -0600, Andrew Beattie , wrote: > > > > > > > > > > https://www.ibm.com/support/knowledgecenter/en/STXKQY_5.0.2/com.ibm.spectrum.scale.v5r02.doc/bl1adv_configprotocolsonremotefs.htm > > > > _______________________________________________ > > > > gpfsug-discuss mailing list > > > > gpfsug-discuss at spectrumscale.org > > > > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > > > > > > _______________________________________________ > > > gpfsug-discuss mailing list > > > gpfsug-discuss at spectrumscale.org > > > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > _______________________________________________ > > gpfsug-discuss mailing list > > gpfsug-discuss at spectrumscale.org > > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From Robert.Oesterlin at nuance.com Fri Mar 8 13:05:13 2019 From: Robert.Oesterlin at nuance.com (Oesterlin, Robert) Date: Fri, 8 Mar 2019 13:05:13 +0000 Subject: [gpfsug-discuss] US Spring User Group Meeting update - April 16-17th, NCAR Boulder Co Message-ID: <76731BC4-8C08-4D18-A052-1E3D34F6111A@nuance.com> Less than 6 weeks until the US Spring user group meeting! Thanks to the team at NCAR and IBM, we have an excellent facility and we?ll be able to offer breakfast, lunch, and evening social event on site. All at no charge to attendees. Detailed agenda coming soon. Register here: https://www.eventbrite.com/e/spectrum-scale-gpfs-user-group-us-spring-2019-meeting-tickets-57035376346 (directions, locations, and suggested hotels) Topics will include: - User Talks - Breakout sessions - Spectrum Scale: The past, the present, the future - Accelerating AI workloads with IBM Spectrum Scale - AI ecosystem and solutions with IBM Spectrum Scale - Spectrum Scale Update - ESS Update - Support Update - Container & Cloud Update - AFM Update - High Performance Tier - Memory Consumption in Spectrum Scale - Spectrum Scale Use Cases - New storage options for Spectrum Scale - Overview - Introduction to Spectrum Scale (For Beginners) Bob Oesterlin/Kristy Kallback-Rose -------------- next part -------------- An HTML attachment was scrubbed... URL: From babbott at rutgers.edu Wed Mar 6 15:11:15 2019 From: babbott at rutgers.edu (William Abbott) Date: Wed, 6 Mar 2019 15:11:15 +0000 Subject: [gpfsug-discuss] Follow-up: migrating billions of files In-Reply-To: References:

Message-ID: <54153a80-efef-4757-df89-69df1751648e@rutgers.edu> We had a similar situation and ended up using parsyncfp, which generates multiple parallel rsyncs based on file lists. If they're on the same IB fabric (as ours were) you can use that instead of ethernet, and it worked pretty well. One caveat is that you need to follow the parallel transfers with a final single rsync, so you can use --delete. For the initial transfer you can also use bbcp. It can get very good performance but isn't nearly as convenient as rsync for subsequent transfers. The performance isn't good with small files but you can use tar on both ends to deal with that, in a similar way to what Uwe suggests below. The bbcp documentation outlines how to do that. Bill On 3/6/19 8:13 AM, Uwe Falke wrote: > Hi, in that case I'd open several tar pipes in parallel, maybe using > directories carefully selected, like > > tar -c | ssh "tar -x" > > I am not quite sure whether "-C /" for tar works here ("tar -C / -x"), but > along these lines might be a good efficient method. target_hosts should be > all nodes haveing the target file system mounted, and you should start > those pipes on the nodes with the source file system. > It is best to start with the largest directories, and use some > masterscript to start the tar pipes controlled by semaphores to not > overload anything. > > > > Mit freundlichen Gr??en / Kind regards > > > Dr. Uwe Falke > > IT Specialist > High Performance Computing Services / Integrated Technology Services / > Data Center Services > ------------------------------------------------------------------------------------------------------------------------------------------- > IBM Deutschland > Rathausstr. 7 > 09111 Chemnitz > Phone: +49 371 6978 2165 > Mobile: +49 175 575 2877 > E-Mail: uwefalke at de.ibm.com > ------------------------------------------------------------------------------------------------------------------------------------------- > IBM Deutschland Business & Technology Services GmbH / Gesch?ftsf?hrung: > Thomas Wolter, Sven Schoo? > Sitz der Gesellschaft: Ehningen / Registergericht: Amtsgericht Stuttgart, > HRB 17122 > > > > > From: "Oesterlin, Robert" > To: gpfsug main discussion list > Date: 06/03/2019 13:44 > Subject: [gpfsug-discuss] Follow-up: migrating billions of files > Sent by: gpfsug-discuss-bounces at spectrumscale.org > > > > Some of you had questions to my original post. More information: > > Source: > - Files are straight GPFS/Posix - no extended NFSV4 ACLs > - A solution that requires $?s to be spent on software (ie, Aspera) isn?t > a very viable option > - Both source and target clusters are in the same DC > - Source is stand-alone NSD servers (bonded 10g-E) and 8gb FC SAN storage > - Approx 40 file systems, a few large ones with 300M-400M files each, > others smaller > - no independent file sets > - migration must pose minimal disruption to existing users > > Target architecture is a small number of file systems (2-3) on ESS with > independent filesets > - Target (ESS) will have multiple 40gb-E links on each NSD server (GS4) > > My current thinking is AFM with a pre-populate of the file space and > switch the clients over to have them pull data they need (most of the data > is older and less active) and them let AFM populate the rest in the > background. > > > Bob Oesterlin > Sr Principal Storage Engineer, Nuance > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > https://nam02.safelinks.protection.outlook.com/?url=https%3A%2F%2Furldefense.proofpoint.com%2Fv2%2Furl%3Fu%3Dhttp-3A__gpfsug.org_mailman_listinfo_gpfsug-2Ddiscuss%26d%3DDwICAg%26c%3Djf_iaSHvJObTbx-siA1ZOg%26r%3DfTuVGtgq6A14KiNeaGfNZzOOgtHW5Lm4crZU6lJxtB8%26m%3DJ5RpIj-EzFyU_dM9I4P8SrpHMikte_pn9sbllFcOvyM%26s%3DfEwDQyDSL7hvOVPbg_n8o_LDz-cLqSI6lQtSzmhaSoI%26e&data=02%7C01%7Cbabbott%40rutgers.edu%7C8cbda3d651584119393808d6a2358544%7Cb92d2b234d35447093ff69aca6632ffe%7C1%7C0%7C636874748092821399&sdata=W06i8IWqrxgEmdp3htxad0euiRhA6%2Bexd3YAziSrUhg%3D&reserved=0= > > > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > https://nam02.safelinks.protection.outlook.com/?url=http%3A%2F%2Fgpfsug.org%2Fmailman%2Flistinfo%2Fgpfsug-discuss&data=02%7C01%7Cbabbott%40rutgers.edu%7C8cbda3d651584119393808d6a2358544%7Cb92d2b234d35447093ff69aca6632ffe%7C1%7C0%7C636874748092821399&sdata=Pjf4RhUchThoFvWI7hLJO4eWhoTXnIYd9m7Mvf809iE%3D&reserved=0 > From valleru at cbio.mskcc.org Fri Mar 8 15:01:13 2019 From: valleru at cbio.mskcc.org (valleru at cbio.mskcc.org) Date: Fri, 8 Mar 2019 09:01:13 -0600 Subject: [gpfsug-discuss] Follow-up: migrating billions of files In-Reply-To: <54153a80-efef-4757-df89-69df1751648e@rutgers.edu> References:

<54153a80-efef-4757-df89-69df1751648e@rutgers.edu> Message-ID: I had to do this twice too. Once i had to copy a 4 PB filesystem as fast as possible when NSD disk descriptors were corrupted and shutting down GPFS would have led to me loosing those files forever, and the other was a regular maintenance but had to copy similar data in less time. In both the cases, i just used GPFS provided util scripts in?/usr/lpp/mmfs/samples/util/ ?. These could be run only as root i believe. I wish i could give them to users to use. I had used few of those scripts like?tsreaddir which used to be really fast in listing all the paths in the directories. It prints full paths of all files along with there inodes etc. I had modified it to print just the full file paths. I then use these paths and group them up in different groups which gets fed into a array jobs to the SGE/LSF cluster. Each array jobs basically uses GNU parallel and running something similar to rsync -avR . The ?-R? option basically creates the directories as given. Of course this worked because i was using the fast private network to transfer between the storage systems. Also i know that cp or tar might be better than rsync with respect to speed, but rsync was convenient and i could always start over again without checkpointing or remembering where i left off previously. Similar to how Bill mentioned in the previous email, but i used gpfs util scripts and basic GNU parallel/rsync, SGE/LSF to submit jobs to the cluster as superuser. It used to work pretty well. Since then - I constantly use parallel and rsync to copy large directories. Thank you, Lohit On Mar 8, 2019, 7:43 AM -0600, William Abbott , wrote: > We had a similar situation and ended up using parsyncfp, which generates > multiple parallel rsyncs based on file lists. If they're on the same IB > fabric (as ours were) you can use that instead of ethernet, and it > worked pretty well. One caveat is that you need to follow the parallel > transfers with a final single rsync, so you can use --delete. > > For the initial transfer you can also use bbcp. It can get very good > performance but isn't nearly as convenient as rsync for subsequent > transfers. The performance isn't good with small files but you can use > tar on both ends to deal with that, in a similar way to what Uwe > suggests below. The bbcp documentation outlines how to do that. > > Bill > > On 3/6/19 8:13 AM, Uwe Falke wrote: > > Hi, in that case I'd open several tar pipes in parallel, maybe using > > directories carefully selected, like > > > > tar -c | ssh "tar -x" > > > > I am not quite sure whether "-C /" for tar works here ("tar -C / -x"), but > > along these lines might be a good efficient method. target_hosts should be > > all nodes haveing the target file system mounted, and you should start > > those pipes on the nodes with the source file system. > > It is best to start with the largest directories, and use some > > masterscript to start the tar pipes controlled by semaphores to not > > overload anything. > > > > > > > > Mit freundlichen Gr??en / Kind regards > > > > > > Dr. Uwe Falke > > > > IT Specialist > > High Performance Computing Services / Integrated Technology Services / > > Data Center Services > > ------------------------------------------------------------------------------------------------------------------------------------------- > > IBM Deutschland > > Rathausstr. 7 > > 09111 Chemnitz > > Phone: +49 371 6978 2165 > > Mobile: +49 175 575 2877 > > E-Mail: uwefalke at de.ibm.com > > ------------------------------------------------------------------------------------------------------------------------------------------- > > IBM Deutschland Business & Technology Services GmbH / Gesch?ftsf?hrung: > > Thomas Wolter, Sven Schoo? > > Sitz der Gesellschaft: Ehningen / Registergericht: Amtsgericht Stuttgart, > > HRB 17122 > > > > > > > > > > From: "Oesterlin, Robert" > To: gpfsug main discussion list > Date: 06/03/2019 13:44 > > Subject: [gpfsug-discuss] Follow-up: migrating billions of files > > Sent by: gpfsug-discuss-bounces at spectrumscale.org > > > > > > > > Some of you had questions to my original post. More information: > > > > Source: > > - Files are straight GPFS/Posix - no extended NFSV4 ACLs > > - A solution that requires $?s to be spent on software (ie, Aspera) isn?t > > a very viable option > > - Both source and target clusters are in the same DC > > - Source is stand-alone NSD servers (bonded 10g-E) and 8gb FC SAN storage > > - Approx 40 file systems, a few large ones with 300M-400M files each, > > others smaller > > - no independent file sets > > - migration must pose minimal disruption to existing users > > > > Target architecture is a small number of file systems (2-3) on ESS with > > independent filesets > > - Target (ESS) will have multiple 40gb-E links on each NSD server (GS4) > > > > My current thinking is AFM with a pre-populate of the file space and > > switch the clients over to have them pull data they need (most of the data > > is older and less active) and them let AFM populate the rest in the > > background. > > > > > > Bob Oesterlin > > Sr Principal Storage Engineer, Nuance > > _______________________________________________ > > gpfsug-discuss mailing list > > gpfsug-discuss at spectrumscale.org > > https://nam02.safelinks.protection.outlook.com/?url=https%3A%2F%2Furldefense.proofpoint.com%2Fv2%2Furl%3Fu%3Dhttp-3A__gpfsug.org_mailman_listinfo_gpfsug-2Ddiscuss%26d%3DDwICAg%26c%3Djf_iaSHvJObTbx-siA1ZOg%26r%3DfTuVGtgq6A14KiNeaGfNZzOOgtHW5Lm4crZU6lJxtB8%26m%3DJ5RpIj-EzFyU_dM9I4P8SrpHMikte_pn9sbllFcOvyM%26s%3DfEwDQyDSL7hvOVPbg_n8o_LDz-cLqSI6lQtSzmhaSoI%26e&data=02%7C01%7Cbabbott%40rutgers.edu%7C8cbda3d651584119393808d6a2358544%7Cb92d2b234d35447093ff69aca6632ffe%7C1%7C0%7C636874748092821399&sdata=W06i8IWqrxgEmdp3htxad0euiRhA6%2Bexd3YAziSrUhg%3D&reserved=0= > > > > > > > > > > > > _______________________________________________ > > gpfsug-discuss mailing list > > gpfsug-discuss at spectrumscale.org > > https://nam02.safelinks.protection.outlook.com/?url=http%3A%2F%2Fgpfsug.org%2Fmailman%2Flistinfo%2Fgpfsug-discuss&data=02%7C01%7Cbabbott%40rutgers.edu%7C8cbda3d651584119393808d6a2358544%7Cb92d2b234d35447093ff69aca6632ffe%7C1%7C0%7C636874748092821399&sdata=Pjf4RhUchThoFvWI7hLJO4eWhoTXnIYd9m7Mvf809iE%3D&reserved=0 > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From jtucker at pixitmedia.com Fri Mar 8 16:08:14 2019 From: jtucker at pixitmedia.com (Jez Tucker) Date: Fri, 8 Mar 2019 16:08:14 +0000 Subject: [gpfsug-discuss] suggestions for copying one GPFS file system into another In-Reply-To: References: <827394bcbb794a0d9bd5bd8341fc1593@IN-CCI-D1S14.ads.iu.edu> Message-ID: <7432d104-4780-2224-19a3-e080078dbe74@pixitmedia.com> Hi ? I feel as an 'other products do exist' I should also mention Ngenea and APSync which could meet the technical requirements of these use cases. Ngenea allows you to bring data in from 'cloud' and also of interest in this specific use case, POSIX filesystems or filer islands.? You can present the remote data available locally and then inflate the data either on demand or via enacted process.? Massively parallel, multi-node, highly threaded with extremely granular rules based control.?? You can also migrate data back to your filer re-utilising such islands as tiers.? You can even use it to 'virtually tier' within GPFS/Scale filesystems, alike a 'hardlink across independent filesets'.? Or even across Global WANs for true 24x7 follow-the-sun working practices. APSync also provides a differently patched version of rsync and builds on top of the 'SnapDiff' technology previously presented at the UG whereby you don't need to re-scan your entire filesystem for each sync and thus can do incremental changes for create, modified, deleted and _track moved files_.? Handy and extremely time saving over regularised full runs.? Massively parallel, multi-node, highly threaded (a common theme with our tools...). As I don't do sales; if anyone wants to talk tech nuts-and-bolts with me about these, or you have challenges (and I love a challenge..) by all means hit me up directly.? I like solving people's blockers :-) Happy Friday ppl, Jez On 05/03/2019 21:38, Simon Thompson wrote: > DDN also have a paid for product for doing moving of data (data flow) We found out about it after we did a massive data migration... > > I can't comment on it other than being aware of it. Sure your local DDN sales person can help. > > But if only IBM supported some sort of restripe to new block size, we wouldn't have to do this mass migration :-P > > Simon > ________________________________________ > From: gpfsug-discuss-bounces at spectrumscale.org [gpfsug-discuss-bounces at spectrumscale.org] on behalf of Simon Thompson [S.J.Thompson at bham.ac.uk] > Sent: 05 March 2019 16:38 > To: gpfsug main discussion list > Subject: Re: [gpfsug-discuss] suggestions forwar copying one GPFS file system into another > > I wrote a patch to mpifileutils which will copy gpfs attributes, but when we played with it with rsync, something was obviously still different about the attrs from each, so use with care. > > Simon > ________________________________________ > From: gpfsug-discuss-bounces at spectrumscale.org [gpfsug-discuss-bounces at spectrumscale.org] on behalf of Ratliff, John [jdratlif at iu.edu] > Sent: 05 March 2019 16:21 > To: gpfsug-discuss at spectrumscale.org > Subject: [gpfsug-discuss] suggestions for copying one GPFS file system into another > > We use a GPFS file system for our computing clusters and we?re working on moving to a new SAN. > > We originally tried AFM, but it didn?t seem to work very well. We tried to do a prefetch on a test policy scan of 100 million files, and after 24 hours it hadn?t pre-fetched anything. It wasn?t clear what was happening. Some smaller tests succeeded, but the NFSv4 ACLs did not seem to be transferred. > > Since then we started using rsync with the GPFS attrs patch. We have over 600 million files and 700 TB. I split up the rsync tasks with lists of files generated by the policy engine and we transferred the original data in about 2 weeks. Now we?re working on final synchronization. I?d like to use one of the delete options to remove files that were sync?d earlier and then deleted. This can?t be combined with the files-from option, so it?s harder to break up the rsync tasks. Some of the directories I?m running this against have 30-150 million files each. This can take quite some time with a single rsync process. > > I?m also wondering if any of my rsync options are unnecessary. I was using avHAXS and numeric-ids. I?m thinking the A (acls) and X (xatttrs) might be unnecessary with GPFS->GPFS. We?re only using NFSv4 GPFS ACLs. I don?t know if GPFS uses any xattrs that rsync would sync or not. Removing those two options removed several system calls, which should make it much faster, but I want to make sure I?m syncing correctly. Also, it seems there is a problem with the GPFS patch on rsync where it will always give an error trying to get GPFS attributes on a symlink, which means it doesn?t sync any symlinks when using that option. So you can rsync symlinks or GPFS attrs, but not both at the same time. This has lead to me running two rsyncs, one to get all files and one to get all attributes. > > Thanks for any ideas or suggestions. > > John Ratliff | Pervasive Technology Institute | UITS | Research Storage ? Indiana University | http://pti.iu.edu > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss -- *Jez Tucker* Head of Research and Development, Pixit Media 07764193820 | jtucker at pixitmedia.com www.pixitmedia.com | Tw:@pixitmedia.com -- This email is confidential in that it is intended for the exclusive attention of the addressee(s) indicated. If you are not the intended recipient, this email should not be read or disclosed to any other person. Please notify the sender immediately and delete this email from your computer system. Any opinions expressed are not necessarily those of the company from which this email was sent and, whilst to the best of our knowledge no viruses or defects exist, no responsibility can be accepted for any loss or damage arising from its receipt or subsequent use of this email. -------------- next part -------------- An HTML attachment was scrubbed... URL: From valleru at cbio.mskcc.org Fri Mar 8 16:42:17 2019 From: valleru at cbio.mskcc.org (valleru at cbio.mskcc.org) Date: Fri, 8 Mar 2019 10:42:17 -0600 Subject: [gpfsug-discuss] Exporting remote GPFS mounts on a non-ces SMB share In-Reply-To: <186F603F-A278-433F-AE2C-7080EBA94AC9@bham.ac.uk> References: <2b02981b-185b-4f64-988f-ce2c19b55c29@Spark> <9ceb9b16-18ad-4137-a8d1-f0b34966d7cf@Spark>

<036e8839-90be-421a-844c-fd7d299f92d4@Spark> <186F603F-A278-433F-AE2C-7080EBA94AC9@bham.ac.uk> Message-ID: Thank you Simon. I do remember reading your page about few years back, when i was researching this issue. When you mentioned Custom Auth. I assumed it to be user-defined authentication from CES. However, looks like i need to hack it a bit to get SMB working with AD? I did not feel comfortable hacking the SMB from the CES cluster, and thus i was trying to bring up SMB outside the CES cluster. I almost hack with everything in the cluster but i leave GPFS and any of its configuration in the supported config, because if things break - i felt it might mess up things real bad. I wish we do not have to hack our way out of this, and IBM supported this config out of the box. I do not understand the current requirements from CES with respect to AD or user defined authentication where either both SMB and NFS should be AD/LDAP authenticated or both of them user defined. I believe many places do use just ssh-key as authentication for linux machines including the cloud instances, while SMB obviously cannot be used with ssh-key authentication and has to be used either with LDAP or AD authentication. Did anyone try to raise this as a feature request? Even if i do figure to hack this thing and make sure that updating CES won?t mess it up badly. I think i will have to do few things to get the SIDs to Uids match as you mentioned. We do not use passwords to authenticate to LDAP and I do not want to be creating another set of passwords apart from AD which is already existing, and users authenticate to it when they login to machines. I was thinking to bring up something like Redhat IDM that could sync with AD and get all the usernames/sids and password hashes. I could then enter my current LDAP uids/gids in the Redhat IDM. IDM will automatically create uids/gids for usernames that do not have them i believe. In this way, when SMB authenticates with Redhat IDM - users can use there current AD kerberos tickets or the same passwords and i do not have to change the passwords. It will also automatically sync with AD and create UIDs/GIDs and thus i don?t have to manually script something to create one for every person in AD. I however need to see if i could get to make this work with institutional AD and it might not be as smooth. So which of the below cases will IBM most probably support? :) 1. Run SMB outside the CES cluster with the above configuration. 2. Hack SMB inside the CES cluster Is it that running SMB outside the CES cluster with R/W has a possibility of corrupting the GPFS filesystem? We do not necessarily need HA with SMB and so apart from HA - What does IBM SMB do that would prevent such corruption from happening? The reason i was expecting the usernames to be same in LDAP and AD is because - if they are, then SMB will do uid mapping by default. i.e SMB will automatically map windows sids to ldap uids. I will not have to bring up Redhat IDM if this was the case. But unfortunately we have many users who have different ldap usernames from AD usernames - so i guess the practical way would be to use Redhat IDM to map windows sids to ldap uids. I have read about mmname2uid and mmuid2name that Andrew mentioned but looks like it is made to work between 2 gpfs clusters with different uids. Not exactly to make SMB map windows SIDs to ldap uids. Regards, Lohit On Mar 8, 2019, 2:41 AM -0600, Simon Thompson , wrote: > Hi Lohit, > > Custom auth sounds like it would work. > > NFS uses the ?system? ldap, SMB can use LDAP or AD, or you can fudge it and actually use both. We came at this very early in CES and I think some of this is better in mixed mode now, but we do something vaguely related to what you need. > > What you?d need is data in your ldap server to map windows usernames and SIDs to Unix IDs. So for example we have in our mmsmb config: > idmap config * : backend?????????? ldap > idmap config * : bind_path_group?? ou=SidMap,dc=rds,dc=adf,dc=bham,dc=ac,dc=uk > idmap config * : ldap_base_dn????? ou=SidMap,dc=rds,dc=adf,dc=bham,dc=ac,dc=uk > idmap config * : ldap_server?????? stand-alone > idmap config * : ldap_url????????? ldap://localhost > idmap config * : ldap_user_dn????? uid=nslcd,ou=People,dc=rds,dc=adf,dc=bham,dc=ac,dc=uk > idmap config * : range???????????? 1000-9999999 > idmap config * : rangesize???????? 1000000 > idmap config * : read only???????? yes > > You then need entries in the LDAP server, it could be a different server or somewhere else in the schema, but basically LDAP entries that map windows username/sid to underlying UID, e.g: > > dn: uid=USERNAME,ou=People,dc=rds,dc=adf,dc=bham,dc=ac,dc=uk > uid: USERNAME > objectClass: top > objectClass: posixAccount > objectClass: account > objectClass: shadowAccount > loginShell: /bin/bash > uidNumber: 605436 > shadowMax: 99999 > gidNumber: 100 > homeDirectory: /rds/homes/u/USERNAME > cn: USERS DISPLAY NAME > structuralObjectClass: account > entryUUID: 85a18df0-88bd-1037-9152-418eb0c7777 > creatorsName: cn=Manager,dc=rds,dc=adf,dc=bham,dc=ac,dc=uk > createTimestamp: 20180108124516Z > entryCSN: 20180108124516.623983Z#000000#001#000000 > modifiersName: cn=Manager,dc=rds,dc=adf,dc=bham,dc=ac,dc=uk > modifyTimestamp: 20180108124516Z > > dn: sambaSID=S-1-5-21-1390067357-308236825-725345543-498888,ou=SidMap,dc=rds > ,dc=adf,dc=bham,dc=ac,dc=uk > objectClass: sambaIdmapEntry > objectClass: sambaSidEntry > sambaSID: S-1-5-21-1390067357-308236825-725345543-498888 > uidNumber: 605436 > structuralObjectClass: sambaSidEntry > entryUUID: 85efa490-88bd-1037-9153-418eb0c9999 > creatorsName: cn=Manager,dc=rds,dc=adf,dc=bham,dc=ac,dc=uk > createTimestamp: 20180108124517Z > entryCSN: 20180108124517.135744Z#000000#001#000000 > modifiersName: cn=Manager,dc=rds,dc=adf,dc=bham,dc=ac,dc=uk > modifyTimestamp: 20180108124517Z > > I don?t think SMB actually cares about the username matching, what it needs to be able to do is resolve the Windows SID presented to the Unix UID underneath which is how it then accesses files. i.e. it doesn?t really matter what the username in the middle is ? > > Supported config? No. Works for what you need? Probably ... > > I wrote this: https://www.roamingzebra.co.uk/2015/07/smb-protocol-support-with-spectrum.html back in 2015 about what we were doing, probably much of it stands, but you might want to look at proper supported mixed mode. That is our plan at some point. > > Simon > > From: "valleru at cbio.mskcc.org" > Date: Friday, 8 March 2019 at 00:08 > To: "Simon Thompson (IT Research Support)" > Subject: Re: [gpfsug-discuss] Exporting remote GPFS mounts on a non-ces SMB share > > Thank you Simon. > > First issue: > I believe what i would need is a combination of user-defined authentication and ad authentication. > > User-defined authentication to help me export NFS and have the linux clients authenticate users with ssh keys. > AD based authentication to help me export SMB with AD authentication/kerberos to mount filesystem on windows connected to just AD. > > At first look, it looked like CES either supports user-defined authentication or AD based authentication - which would not work. We do not use kerberos or ldap passwords for accessing the HPC clusters. > > Second issue: > AD username to LDAP username mapping. I could bring up another AD/LDAP server that has the AD usernames and LDAP uids just for SMB authentication but i would need to do this for all the users in the agency. > I will try and research if this way is easier or the mmNametoUID. > > > Regards, > Lohit > > On Mar 7, 2019, 5:00 PM -0600, Simon Thompson , wrote: > > > > > custom Auth mode -------------- next part -------------- An HTML attachment was scrubbed... URL: From valleru at cbio.mskcc.org Fri Mar 8 16:52:13 2019 From: valleru at cbio.mskcc.org (valleru at cbio.mskcc.org) Date: Fri, 8 Mar 2019 10:52:13 -0600 Subject: [gpfsug-discuss] Exporting remote GPFS mounts on a non-ces SMB share In-Reply-To: References: <2b02981b-185b-4f64-988f-ce2c19b55c29@Spark> <9ceb9b16-18ad-4137-a8d1-f0b34966d7cf@Spark>

<036e8839-90be-421a-844c-fd7d299f92d4@Spark> <186F603F-A278-433F-AE2C-7080EBA94AC9@bham.ac.uk> Message-ID: Well, reading the user-defined authentication documentation again. It is basically left to sysadmins to deal with authentication and it looks like it would not be so much of a hack, to customize smb on CES nodes according to our needs. I will see if i could do this without much trouble. Regards, Lohit On Mar 8, 2019, 10:42 AM -0600, valleru at cbio.mskcc.org, wrote: > Thank you Simon. > > I do remember reading your page about few years back, when i was researching this issue. > When you mentioned Custom Auth. I assumed it to be user-defined authentication from CES. However, looks like i need to hack it a bit to get SMB working with AD? > > I did not feel comfortable hacking the SMB from the CES cluster, and thus i was trying to bring up SMB outside the CES cluster. I almost hack with everything in the cluster but i leave GPFS and any of its configuration in the supported config, because if things break - i felt it might mess up things real bad. > I wish we do not have to hack our way out of this, and IBM supported this config out of the box. > > I do not understand the current requirements from CES with respect to AD or user defined authentication where either both SMB and NFS should be AD/LDAP authenticated or both of them user defined. > > I believe many places do use just ssh-key as authentication for linux machines including the cloud instances, while SMB obviously cannot be used with ssh-key authentication and has to be used either with LDAP or AD authentication. > > Did anyone try to raise this as a feature request? > > Even if i do figure to hack this thing and make sure that updating CES won?t mess it up badly. I think i will have to do few things to get the SIDs to Uids match as you mentioned. > We do not use passwords to authenticate to LDAP and I do not want to be creating another set of passwords apart from AD which is already existing, and users authenticate to it when they login to machines. > > I was thinking to bring up something like Redhat IDM that could sync with AD and get all the usernames/sids and password hashes. I could then enter my current LDAP uids/gids in the Redhat IDM. IDM will automatically create uids/gids for usernames that do not have them i believe. > In this way, when SMB authenticates with Redhat IDM - users can use there current AD kerberos tickets or the same passwords and i do not have to change the passwords. > It will also automatically sync with AD and create UIDs/GIDs and thus i don?t have to manually script something to create one for every person in AD. > I however need to see if i could get to make this work with institutional AD and it might not be as smooth. > > So which of the below cases will IBM most probably support? :) > > 1. Run SMB outside the CES cluster with the above configuration. > 2. Hack SMB inside the CES cluster > > Is it that running SMB outside the CES cluster with R/W has a possibility of corrupting the GPFS filesystem? > We do not necessarily need HA with SMB and so apart from HA - What does IBM SMB do that would prevent such corruption from happening? > > The reason i was expecting the usernames to be same in LDAP and AD is because - if they are, then SMB will do uid mapping by default. i.e SMB will automatically map windows sids to ldap uids. I will not have to bring up Redhat IDM if this was the case. But unfortunately we have many users who have different ldap usernames from AD usernames - so i guess the practical way would be to use Redhat IDM to map windows sids to ldap uids. > > I have read about mmname2uid and mmuid2name that Andrew mentioned but looks like it is made to work between 2 gpfs clusters with different uids. Not exactly to make SMB map windows SIDs to ldap uids. > > Regards, > Lohit > > On Mar 8, 2019, 2:41 AM -0600, Simon Thompson , wrote: > > Hi Lohit, > > > > Custom auth sounds like it would work. > > > > NFS uses the ?system? ldap, SMB can use LDAP or AD, or you can fudge it and actually use both. We came at this very early in CES and I think some of this is better in mixed mode now, but we do something vaguely related to what you need. > > > > What you?d need is data in your ldap server to map windows usernames and SIDs to Unix IDs. So for example we have in our mmsmb config: > > idmap config * : backend?????????? ldap > > idmap config * : bind_path_group?? ou=SidMap,dc=rds,dc=adf,dc=bham,dc=ac,dc=uk > > idmap config * : ldap_base_dn????? ou=SidMap,dc=rds,dc=adf,dc=bham,dc=ac,dc=uk > > idmap config * : ldap_server?????? stand-alone > > idmap config * : ldap_url????????? ldap://localhost > > idmap config * : ldap_user_dn????? uid=nslcd,ou=People,dc=rds,dc=adf,dc=bham,dc=ac,dc=uk > > idmap config * : range???????????? 1000-9999999 > > idmap config * : rangesize???????? 1000000 > > idmap config * : read only???????? yes > > > > You then need entries in the LDAP server, it could be a different server or somewhere else in the schema, but basically LDAP entries that map windows username/sid to underlying UID, e.g: > > > > dn: uid=USERNAME,ou=People,dc=rds,dc=adf,dc=bham,dc=ac,dc=uk > > uid: USERNAME > > objectClass: top > > objectClass: posixAccount > > objectClass: account > > objectClass: shadowAccount > > loginShell: /bin/bash > > uidNumber: 605436 > > shadowMax: 99999 > > gidNumber: 100 > > homeDirectory: /rds/homes/u/USERNAME > > cn: USERS DISPLAY NAME > > structuralObjectClass: account > > entryUUID: 85a18df0-88bd-1037-9152-418eb0c7777 > > creatorsName: cn=Manager,dc=rds,dc=adf,dc=bham,dc=ac,dc=uk > > createTimestamp: 20180108124516Z > > entryCSN: 20180108124516.623983Z#000000#001#000000 > > modifiersName: cn=Manager,dc=rds,dc=adf,dc=bham,dc=ac,dc=uk > > modifyTimestamp: 20180108124516Z > > > > dn: sambaSID=S-1-5-21-1390067357-308236825-725345543-498888,ou=SidMap,dc=rds > > ,dc=adf,dc=bham,dc=ac,dc=uk > > objectClass: sambaIdmapEntry > > objectClass: sambaSidEntry > > sambaSID: S-1-5-21-1390067357-308236825-725345543-498888 > > uidNumber: 605436 > > structuralObjectClass: sambaSidEntry > > entryUUID: 85efa490-88bd-1037-9153-418eb0c9999 > > creatorsName: cn=Manager,dc=rds,dc=adf,dc=bham,dc=ac,dc=uk > > createTimestamp: 20180108124517Z > > entryCSN: 20180108124517.135744Z#000000#001#000000 > > modifiersName: cn=Manager,dc=rds,dc=adf,dc=bham,dc=ac,dc=uk > > modifyTimestamp: 20180108124517Z > > > > I don?t think SMB actually cares about the username matching, what it needs to be able to do is resolve the Windows SID presented to the Unix UID underneath which is how it then accesses files. i.e. it doesn?t really matter what the username in the middle is ? > > > > Supported config? No. Works for what you need? Probably ... > > > > I wrote this: https://www.roamingzebra.co.uk/2015/07/smb-protocol-support-with-spectrum.html back in 2015 about what we were doing, probably much of it stands, but you might want to look at proper supported mixed mode. That is our plan at some point. > > > > Simon > > > > From: "valleru at cbio.mskcc.org" > > Date: Friday, 8 March 2019 at 00:08 > > To: "Simon Thompson (IT Research Support)" > > Subject: Re: [gpfsug-discuss] Exporting remote GPFS mounts on a non-ces SMB share > > > > Thank you Simon. > > > > First issue: > > I believe what i would need is a combination of user-defined authentication and ad authentication. > > > > User-defined authentication to help me export NFS and have the linux clients authenticate users with ssh keys. > > AD based authentication to help me export SMB with AD authentication/kerberos to mount filesystem on windows connected to just AD. > > > > At first look, it looked like CES either supports user-defined authentication or AD based authentication - which would not work. We do not use kerberos or ldap passwords for accessing the HPC clusters. > > > > Second issue: > > AD username to LDAP username mapping. I could bring up another AD/LDAP server that has the AD usernames and LDAP uids just for SMB authentication but i would need to do this for all the users in the agency. > > I will try and research if this way is easier or the mmNametoUID. > > > > > > Regards, > > Lohit > > > > On Mar 7, 2019, 5:00 PM -0600, Simon Thompson , wrote: > > > > > > > > custom Auth mode > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From makaplan at us.ibm.com Fri Mar 8 21:43:36 2019 From: makaplan at us.ibm.com (Marc A Kaplan) Date: Fri, 8 Mar 2019 16:43:36 -0500 Subject: [gpfsug-discuss] Follow-up: migrating billions of files In-Reply-To: References:

<54153a80-efef-4757-df89-69df1751648e@rutgers.edu> Message-ID: Lohit... Any and all of those commands and techniques should still work with newer version of GPFS. But mmapplypolicy is the supported command for generating file lists. It uses the GPFS APIs and some parallel processing tricks. mmfind is a script that make it easier to write GPFS "policy rules" and runs mmapplypolicy for you. mmxcp can be used with mmfind (and/or mmapplypolicy) to make it easy to run a cp (or other command) in parallel on those filelists ... --marc K of GPFS From: valleru at cbio.mskcc.org To: ""gpfsug-discuss<""gpfsug-discuss at spectrumscale.org ", gpfsug main discussion list Date: 03/08/2019 10:13 AM Subject: Re: [gpfsug-discuss] Follow-up: migrating billions of files Sent by: gpfsug-discuss-bounces at spectrumscale.org I had to do this twice too. Once i had to copy a 4 PB filesystem as fast as possible when NSD disk descriptors were corrupted and shutting down GPFS would have led to me loosing those files forever, and the other was a regular maintenance but had to copy similar data in less time. In both the cases, i just used GPFS provided util scripts in /usr/lpp/mmfs/samples/util/ . These could be run only as root i believe. I wish i could give them to users to use. I had used few of those scripts like tsreaddir which used to be really fast in listing all the paths in the directories. It prints full paths of all files along with there inodes etc. I had modified it to print just the full file paths. I then use these paths and group them up in different groups which gets fed into a array jobs to the SGE/LSF cluster. Each array jobs basically uses GNU parallel and running something similar to rsync -avR . The ?-R? option basically creates the directories as given. Of course this worked because i was using the fast private network to transfer between the storage systems. Also i know that cp or tar might be better than rsync with respect to speed, but rsync was convenient and i could always start over again without checkpointing or remembering where i left off previously. Similar to how Bill mentioned in the previous email, but i used gpfs util scripts and basic GNU parallel/rsync, SGE/LSF to submit jobs to the cluster as superuser. It used to work pretty well. Since then - I constantly use parallel and rsync to copy large directories. Thank you, Lohit On Mar 8, 2019, 7:43 AM -0600, William Abbott , wrote: We had a similar situation and ended up using parsyncfp, which generates multiple parallel rsyncs based on file lists. If they're on the same IB fabric (as ours were) you can use that instead of ethernet, and it worked pretty well. One caveat is that you need to follow the parallel transfers with a final single rsync, so you can use --delete. For the initial transfer you can also use bbcp. It can get very good performance but isn't nearly as convenient as rsync for subsequent transfers. The performance isn't good with small files but you can use tar on both ends to deal with that, in a similar way to what Uwe suggests below. The bbcp documentation outlines how to do that. Bill On 3/6/19 8:13 AM, Uwe Falke wrote: Hi, in that case I'd open several tar pipes in parallel, maybe using directories carefully selected, like tar -c | ssh "tar -x" I am not quite sure whether "-C /" for tar works here ("tar -C / -x"), but along these lines might be a good efficient method. target_hosts should be all nodes haveing the target file system mounted, and you should start those pipes on the nodes with the source file system. It is best to start with the largest directories, and use some masterscript to start the tar pipes controlled by semaphores to not overload anything. Mit freundlichen Gr??en / Kind regards Dr. Uwe Falke IT Specialist High Performance Computing Services / Integrated Technology Services / Data Center Services ------------------------------------------------------------------------------------------------------------------------------------------- IBM Deutschland Rathausstr. 7 09111 Chemnitz Phone: +49 371 6978 2165 Mobile: +49 175 575 2877 E-Mail: uwefalke at de.ibm.com ------------------------------------------------------------------------------------------------------------------------------------------- IBM Deutschland Business & Technology Services GmbH / Gesch?ftsf?hrung: Thomas Wolter, Sven Schoo? Sitz der Gesellschaft: Ehningen / Registergericht: Amtsgericht Stuttgart, HRB 17122 From: "Oesterlin, Robert" From valleru at cbio.mskcc.org Fri Mar 8 22:40:32 2019 From: valleru at cbio.mskcc.org (valleru at cbio.mskcc.org) Date: Fri, 8 Mar 2019 16:40:32 -0600 Subject: [gpfsug-discuss] Follow-up: migrating billions of files In-Reply-To: References:

<54153a80-efef-4757-df89-69df1751648e@rutgers.edu>

Message-ID: Thank you Marc. I was just trying to suggest another approach to this email thread. However i believe, we cannot run mmfind/mmapplypolicy with remote filesystems and can only be run on the owning cluster? In our clusters - All the gpfs clients are generally in there own compute clusters and mount filesystems from other storage clusters - which i thought is one of the recommended designs. The scripts in the /usr/lpp/mmfs/samples/util folder do work with remote filesystems, and thus on the compute nodes. I was also trying to find something that could be used by users and not by superuser? but i guess none of these tools are meant to be run by a user without superuser privileges. Regards, Lohit On Mar 8, 2019, 3:54 PM -0600, Marc A Kaplan , wrote: > Lohit... Any and all of those commands and techniques should still work with newer version of GPFS. > > But mmapplypolicy is the supported command for generating file lists. ?It uses the GPFS APIs and some parallel processing tricks. > > mmfind is a script that make it easier to write GPFS "policy rules" and runs mmapplypolicy for you. > > mmxcp can be used with mmfind (and/or mmapplypolicy) to make it easy to run a cp (or other command) in parallel on those filelists ... > > --marc K of GPFS > > > > From: ? ? ? ?valleru at cbio.mskcc.org > To: ? ? ? ?""gpfsug-discuss<""gpfsug-discuss at spectrumscale.org ? ? ? ? ", gpfsug main discussion list > Date: ? ? ? ?03/08/2019 10:13 AM > Subject: ? ? ? ?Re: [gpfsug-discuss] Follow-up: migrating billions of files > Sent by: ? ? ? ?gpfsug-discuss-bounces at spectrumscale.org > > > > I had to do this twice too. Once i had to copy a 4 PB filesystem as fast as possible when NSD disk descriptors were corrupted and shutting down GPFS would have led to me loosing those files forever, and the other was a regular maintenance but had to copy similar data in less time. > > In both the cases, i just used GPFS provided util scripts in /usr/lpp/mmfs/samples/util/ ?. These could be run only as root i believe. I wish i could give them to users to use. > > I had used few of those scripts like tsreaddir which used to be really fast in listing all the paths in the directories. It prints full paths of all files along with there inodes etc. I had modified it to print just the full file paths. > > I then use these paths and group them up in different groups which gets fed into a array jobs to the SGE/LSF cluster. > Each array jobs basically uses GNU parallel and running something similar to rsync -avR . The ?-R? option basically creates the directories as given. > Of course this worked because i was using the fast private network to transfer between the storage systems. Also i know that cp or tar might be better than rsync with respect to speed, but rsync was convenient and i could always start over again without checkpointing or remembering where i left off previously. > > Similar to how Bill mentioned in the previous email, but i used gpfs util scripts and basic GNU parallel/rsync, SGE/LSF to submit jobs to the cluster as superuser. It used to work pretty well. > > Since then - I constantly use parallel and rsync to copy large directories. > > Thank you, > Lohit > > On Mar 8, 2019, 7:43 AM -0600, William Abbott , wrote: > We had a similar situation and ended up using parsyncfp, which generates > multiple parallel rsyncs based on file lists. If they're on the same IB > fabric (as ours were) you can use that instead of ethernet, and it > worked pretty well. One caveat is that you need to follow the parallel > transfers with a final single rsync, so you can use --delete. > > For the initial transfer you can also use bbcp. It can get very good > performance but isn't nearly as convenient as rsync for subsequent > transfers. The performance isn't good with small files but you can use > tar on both ends to deal with that, in a similar way to what Uwe > suggests below. The bbcp documentation outlines how to do that. > > Bill > > On 3/6/19 8:13 AM, Uwe Falke wrote: > Hi, in that case I'd open several tar pipes in parallel, maybe using > directories carefully selected, like > > tar -c | ssh "tar -x" > > I am not quite sure whether "-C /" for tar works here ("tar -C / -x"), but > along these lines might be a good efficient method. target_hosts should be > all nodes haveing the target file system mounted, and you should start > those pipes on the nodes with the source file system. > It is best to start with the largest directories, and use some > masterscript to start the tar pipes controlled by semaphores to not > overload anything. > > > > Mit freundlichen Gr??en / Kind regards > > > Dr. Uwe Falke > > IT Specialist > High Performance Computing Services / Integrated Technology Services / > Data Center Services > ------------------------------------------------------------------------------------------------------------------------------------------- > IBM Deutschland > Rathausstr. 7 > 09111 Chemnitz > Phone: +49 371 6978 2165 > Mobile: +49 175 575 2877 > E-Mail: uwefalke at de.ibm.com > ------------------------------------------------------------------------------------------------------------------------------------------- > IBM Deutschland Business & Technology Services GmbH / Gesch?ftsf?hrung: > Thomas Wolter, Sven Schoo? > Sitz der Gesellschaft: Ehningen / Registergericht: Amtsgericht Stuttgart, > HRB 17122 > > > > > From: "Oesterlin, Robert" To: gpfsug main discussion list Date: 06/03/2019 13:44 > Subject: [gpfsug-discuss] Follow-up: migrating billions of files > Sent by: gpfsug-discuss-bounces at spectrumscale.org > > > > Some of you had questions to my original post. More information: > > Source: > - Files are straight GPFS/Posix - no extended NFSV4 ACLs > - A solution that requires $?s to be spent on software (ie, Aspera) isn?t > a very viable option > - Both source and target clusters are in the same DC > - Source is stand-alone NSD servers (bonded 10g-E) and 8gb FC SAN storage > - Approx 40 file systems, a few large ones with 300M-400M files each, > others smaller > - no independent file sets > - migration must pose minimal disruption to existing users > > Target architecture is a small number of file systems (2-3) on ESS with > independent filesets > - Target (ESS) will have multiple 40gb-E links on each NSD server (GS4) > > My current thinking is AFM with a pre-populate of the file space and > switch the clients over to have them pull data they need (most of the data > is older and less active) and them let AFM populate the rest in the > background. > > > Bob Oesterlin > Sr Principal Storage Engineer, Nuance > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > https://nam02.safelinks.protection.outlook.com/?url=https%3A%2F%2Furldefense.proofpoint.com%2Fv2%2Furl%3Fu%3Dhttp-3A__gpfsug.org_mailman_listinfo_gpfsug-2Ddiscuss%26d%3DDwICAg%26c%3Djf_iaSHvJObTbx-siA1ZOg%26r%3DfTuVGtgq6A14KiNeaGfNZzOOgtHW5Lm4crZU6lJxtB8%26m%3DJ5RpIj-EzFyU_dM9I4P8SrpHMikte_pn9sbllFcOvyM%26s%3DfEwDQyDSL7hvOVPbg_n8o_LDz-cLqSI6lQtSzmhaSoI%26e&data=02%7C01%7Cbabbott%40rutgers.edu%7C8cbda3d651584119393808d6a2358544%7Cb92d2b234d35447093ff69aca6632ffe%7C1%7C0%7C636874748092821399&sdata=W06i8IWqrxgEmdp3htxad0euiRhA6%2Bexd3YAziSrUhg%3D&reserved=0= > > > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > https://nam02.safelinks.protection.outlook.com/?url=http%3A%2F%2Fgpfsug.org%2Fmailman%2Flistinfo%2Fgpfsug-discuss&data=02%7C01%7Cbabbott%40rutgers.edu%7C8cbda3d651584119393808d6a2358544%7Cb92d2b234d35447093ff69aca6632ffe%7C1%7C0%7C636874748092821399&sdata=Pjf4RhUchThoFvWI7hLJO4eWhoTXnIYd9m7Mvf809iE%3D&reserved=0 > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss_______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From christof.schmitt at us.ibm.com Fri Mar 8 22:58:59 2019 From: christof.schmitt at us.ibm.com (Christof Schmitt) Date: Fri, 8 Mar 2019 22:58:59 +0000 Subject: [gpfsug-discuss] Exporting remote GPFS mounts on a non-ces SMB share In-Reply-To: References: , <2b02981b-185b-4f64-988f-ce2c19b55c29@Spark><9ceb9b16-18ad-4137-a8d1-f0b34966d7cf@Spark>

<036e8839-90be-421a-844c-fd7d299f92d4@Spark><186F603F-A278-433F-AE2C-7080EBA94AC9@bham.ac.uk> Message-ID: An HTML attachment was scrubbed... URL: From Kevin.Buterbaugh at Vanderbilt.Edu Fri Mar 8 16:24:40 2019 From: Kevin.Buterbaugh at Vanderbilt.Edu (Buterbaugh, Kevin L) Date: Fri, 8 Mar 2019 16:24:40 +0000 Subject: [gpfsug-discuss] SSDs for data - DWPD? Message-ID: <7B8A565F-94B7-419E-A2D0-35FE1C898BB6@vanderbilt.edu> Hi All, This is kind of a survey if you will, so for this one it might be best if you responded directly to me and I?ll summarize the results next week. Question 1 - do you use SSDs for data? If not - i.e. if you only use SSDs for metadata (as we currently do) - thanks, that?s all! If, however, you do use SSDs for data, please see Question 2. Question 2 - what is the DWPD (daily writes per day) of the SSDs that you use for data? Question 3 - is that different than the DWPD of the SSDs for metadata? Question 4 - any pertinent information in regards to your answers above (i.e. if you?ve got a filesystem that data is uploaded to only once and never modified after that then that?s useful to know!)? Thanks? Kevin ? Kevin Buterbaugh - Senior System Administrator Vanderbilt University - Advanced Computing Center for Research and Education Kevin.Buterbaugh at vanderbilt.edu - (615)875-9633 -------------- next part -------------- An HTML attachment was scrubbed... URL: From pinto at scinet.utoronto.ca Tue Mar 12 13:15:24 2019 From: pinto at scinet.utoronto.ca (Jaime Pinto) Date: Tue, 12 Mar 2019 09:15:24 -0400 Subject: [gpfsug-discuss] mmbackup: how to keep list(expiredFiles, updatedFiles) files Message-ID: <20190312091524.10175q4zufaqley4@support.scinet.utoronto.ca> How can I instruct mmbackup to *NOT* delete the temporary directories and files created inside the FILESET/.mmbackupCfg folder? I can see that during the process the folders expiredFiles & updatedFiles are there, and contain the lists I'm interested in for post-analysis. Thanks Jaime --- Jaime Pinto - Storage Analyst SciNet HPC Consortium - Compute/Calcul Canada www.scinet.utoronto.ca - www.computecanada.ca University of Toronto 661 University Ave. (MaRS), Suite 1140 Toronto, ON, M5G1M1 P: 416-978-2755 C: 416-505-1477 ---------------------------------------------------------------- This message was sent using IMP at SciNet Consortium, University of Toronto. From stockf at us.ibm.com Tue Mar 12 15:19:41 2019 From: stockf at us.ibm.com (Frederick Stock) Date: Tue, 12 Mar 2019 15:19:41 +0000 Subject: [gpfsug-discuss] mmbackup: how to keep list(expiredFiles, updatedFiles) files In-Reply-To: <20190312091524.10175q4zufaqley4@support.scinet.utoronto.ca> References: <20190312091524.10175q4zufaqley4@support.scinet.utoronto.ca> Message-ID: An HTML attachment was scrubbed... URL: