Skip to content

Commit 0179f82

Browse files
Added info about Myriad new filesystem and migration
1 parent 6ef5a19 commit 0179f82

File tree

1 file changed

+95
-2
lines changed

1 file changed

+95
-2
lines changed

mkdocs-project-dir/docs/Status_page.md

Lines changed: 95 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -641,7 +641,6 @@ This page outlines that status of each of the machines managed by the Research C
641641

642642
(This did not happen, the certificate was renewed in time).
643643

644-
#### Latest on Myriad
645644

646645
2025-03-05 - **Myriad new filesystem update**
647646

@@ -688,6 +687,100 @@ This page outlines that status of each of the machines managed by the Research C
688687
Further details on this to come. A similar process will take place when your Myriad user account
689688
expires.
690689

690+
#### Latest on Myriad
691+
692+
- 2025-03-31 - **Myriad new filesystem on Mon 7 April**
693+
694+
We are replacing Myriad's filesystem with the new one on Monday 7 April.
695+
696+
From 9am there will be a maintenance period when you will not be able to log in while we switch over to the new filesystem and do final checks. We expect to give you access again later during the 7 April, but if it takes longer you may not have access until Tues 8 April.
697+
698+
Only the data on ACFS will be backed up. Please note that the data on the new filesystem will not be backed up,
699+
not even data under `/home`.
700+
701+
**After the maintenance, you will have the following storage locations:**
702+
703+
- `/home/username`: your new home directory on the new filesystem; not backed up
704+
- `/home/username/Scratch`: we are keeping this as a directory in your home for convenience. It is now part of the same space with the same rules and quota as the rest of your home directory, still not backed up.
705+
- `/home/username/oldhome`: a symbolic link to `/old_lustre/home/username`, your old home directory on the old filesystem; read-only (no changes to data possible)
706+
- `/home/username/oldscratch`: a symbolic link to `/old_lustre/scratch/username`, your old scratch directory on the old filesystem; read-only (no changes to data possible)
707+
- `/home/username/ACFS`: a symbolic link to your ACFS space where you can put data you want to be backed up; unaffected by this change
708+
709+
**Quotas**
710+
711+
You will have one quota of 1T on Myriad for your home by default. If you applied for a quota increase or renewal after 15 Feb 2025 we will set this quota on the new filesystem for you (also if you already applied for the increase specifically for the new filesystem). If you had a quota increase from before 15 Feb, it will not be recreated on the new filesystem straight away and you will need to apply for it again.
712+
713+
**New terms for quota increases and when data is deleted**
714+
715+
If you have a Myriad quota increase, it will have an expiry date, maximum of one year after application. You will be sent a reminder one month before your quota expires, and then reminders at two weeks, one week, expiry day, two weeks after, one month after. One month after a quota has expired, if you have not contacted us, we will move all the contents of your Myriad home and Scratch into another location. We will keep your data there for two months further to give you longer to contact us to reapply for quota or retrieve the data and will then delete all of it. Please consider this at the time of applying for your quota increase, especially if you expect to be away from UCL for a period of time.
716+
717+
This is to prevent Myriad's filesystem from filling up from large quota increases that expire and are not removed. If you are in contact with us and reapplying for your quota increases, this should allow us to keep granting them to you.
718+
**What you need to do**
719+
720+
**Step 1: Move your data**
721+
722+
After we tell you the new filesystem is available and you can log in, you will need to log in using your UCL
723+
password, as any ssh keys you might have set up will not be there on the new filesystem. You can then copy your
724+
`.ssh` directory from `/old_lustre/home/username` to your new home.
725+
726+
Commands you may wish to use for copying:
727+
728+
- Copy your old .ssh directory into your new home (~) recursively and preserve permissions:
729+
- `cp -rp ~/oldhome/.ssh ~`
730+
- Use rsync archive mode (recursively copy directories, copy symlinks without resolving, and preserve permissions, ownership and modification times) to copy your old .ssh directory into your new home:
731+
- `rsync -r -a ~/oldhome/.ssh ~`
732+
733+
Copy only works locally, so you can use it for any filesystems that are directly mounted on Myriad (old_lustre, ACFS, RDSS, new filesystem). Rsync can be used locally or between remote systems as well. It can also resume incomplete copies by running again and doesn't re-copy data that you have already copied if your transfer gets interrupted for any reason.
734+
735+
You will need to copy your data off old_lustre onto the new filesystem, or the ACFS, or the RDSS if you have a project, or onto other external systems if you want to keep it for future reference.
736+
737+
If you have large amounts of data (particularly many small files) that you are intending to archive elsewhere, consider creating tar archives straight away instead of copying data recursively first.
738+
739+
- `tar -czvf /home/username/Scratch/myarchive.tar.gz /old_lustre/home/username/data` will (c)reate a g(z)ipped archive (v)erbosely in the specified (f)ilename location on the new filesystem. The contents will be everything in this user's old "data" directory.
740+
741+
**Step 2: release your jobs**
742+
743+
All your jobs will be in held status (`hqw`) so that they do not fail while your data is not there. After you have copied the data that your jobs need to the new filesystem, you can release the hold on your queued jobs.
744+
745+
- `qrls $JOB_ID` will release a specific job ID, and `qrls all` will release all your jobs.
746+
747+
Released array jobs will have the first task in status qw and the rest in hqw - this is normal.
748+
749+
**FAQ: .bashrc and hidden files**
750+
751+
_Where is my old .bashrc? Why are my jobs failing with module errors now? Why are my python packages not there?_
752+
753+
Your `.bashrc` is in `/old_lustre/home/username/.bashrc`
754+
755+
It begins with a dot and is a hidden file so will only show up with `ls -a` rather than `ls`. You can copy this across into your current home again. You may have put module load and unload commands in it, so are now getting module conflicts when your jobs run since otherwise the modules are still the default ones.
756+
757+
This also applies to other hidden files or directories you may have, like `.condarc` and `.python3local` where you may have environments defined or packages installed.
758+
759+
**Project/shared spaces/hosted datasets**
760+
761+
If you have an existing project or shared space or hosted dataset (in `/lustre/projects` aka `/shared/ucl/depts`) you will need to reapply for this space and we will need to recreate it on the new filesystem. We'll be sending another email separately to people we have listed as the owners of spaces.
762+
763+
The same terms for quota increases and data deletion set out above will apply to project spaces so the data will
764+
be deleted if they expire and are not renewed, after reminders. Where an access group (eg `ag-archpc-groupname`,
765+
formerly `lgsh0xx`) has access to the space, we will contact all members of the group so they are still aware if
766+
the original owner of the space has left.
767+
768+
If you currently have an access group named like `lgsh0xx` for your space, as part of reapplying we will be
769+
transferring this to a new access group named `ag-archpc-groupname`. These groups can be updated within half
770+
an hour rather than only overnight.
771+
772+
**Removal of old filesystem**
773+
774+
`/old_lustre` will be available for three months, until 9am on Monday 7 July. It will then be unmounted and you will not be able to access it any longer.
775+
776+
**Myriad at risk for first week**
777+
778+
Myriad should be considered exposed to potential issues for the first week of running a full workload with the new filesystem, and so there might be interruptions to service if anything goes wrong or needs tuning differently.
779+
780+
The new filesystem is GPFS (IBM Storage Scale) and not Lustre, for those who are interested.
781+
782+
Additional FAQs will be added here based on questions we receive.
783+
691784

692785
### Kathleen
693786

@@ -893,7 +986,7 @@ This page outlines that status of each of the machines managed by the Research C
893986
some of the oldest installs and modules that are not being used in jobs. We then intend to prune
894987
this further over time and add newer versions into the Spack stack.
895988

896-
**Documentation links**
989+
**Documentation links**
897990

898991
The SSL certificate for www.rc.ucl.ac.uk is due to expire at midnight on 12 Feb. We're getting a
899992
new one but there might be a gap if it can't be renewed in time. If that happens your browser may

0 commit comments

Comments
 (0)