Storage – FASRC DOCS

Data Storage Billing

RC Admin — Fri, 10 Oct 2025 13:44:31 +0000

This document is designed to assist faculty and researchers with understanding the contents of the monthly storage bills, distributed by the FASRC Storage Service Center. For more information about the storage options and their features please visit our Data Storage page. Please feel free to reach out to us at rchelp@rc.fas.harvard.edu with any additional questions.

Billing notifications are sent monthly

The charge for a Service Center allocation is a combination of the size of the allocation (in TiB) and the storage offering. While rates are charged per year, bills are distributed monthly. Storage allocations are determined on the 15th of the month and bills are sent within the first week of the following month (i.e. billing notices for January come out in early Feb). Once billing emails are sent, PIs or lab managers have 3 days to review the charges and approve them, make adjustments, or contact the storage service center to clear up any issues. After 3 days, a lack of communication is considered approval and bills will be processed.

By default, bills are sent to lab administrators, but arbitrary lists can be created using the FIINE system (see below).

EXAMPLE BILL

RC Storage billing is ready for John Harvard Lab from Research Computing Storage for MM/YYYY.

This is your monthly bill for FASRC storage service center allocation(s). This bill represents 1/12 of your yearly allocation cost as it currently stands.

The cutoff day for billing changes is the 15th of each month. Any changes made to allocation after that will be reflected on the following month’s bill.
The monthly charge for RC Storage is based on the storage tier and the total size of the allocation, not the amount of used space. As a result, deleting files will not reduce your monthly cost unless the allocation size is also reduced.
In order to be in compliance with grant management rules, the total charge for an allocation is applied to users proportional to how much they have used. That distribution is described in the table below. If there are files owned by users that are no longer affiliated with the lab, it is the lab’s responsibility to remove those files, change their ownership, or move to other storage.
For billing questions, to change billing codes, or any other queries, email RC Storage Billing

This link to the fiine system billing records listing can be used to approve records or make adjustments. If you don’t approve or respond within 3 business days, the records will be considered approved

[A LINK TO THE BILL IN FIINE]

The above link can be used to approve records or make adjustments. If you do not approve or respond within 3 business days, the records will be considered approved. Please respond to RC Storage Billing if you have issues questions.

Total Monthly Charge $100.00

Total charge is distributed across users that own files proportional to their usage as shown below.

User	Storage Product	Account	Description	Charge
1. John Harvard	[storage directory path]	370-00000-8250-00000-000000-0001-99999	$75.00 for 75% of 10 TiB of [storage directory path] at $10.00 per TB	$75.00
2. Jill Harvard	[storage directory path]	370-00000-8250-00000-000000-0001-99999	$25.00 for 25% of 10 TiB of [storage directory path] at $10.00 per TB	$25.00

Monthly storage billing is based on total allocation size, not usage

The bill amount is based on the size of the provided storage allocation, not the amount of storage used. Removing unneeded data from an allocation is a valuable practice, but charges will not decrease unless the overall size of the provided allocation is reduced.

Original file owners will be listed for the allocated storage

Line items in the monthly billing statement list which lab members have data stored in the allocation. This is for informational use, as some labs need to allocate certain individuals to specific grants. The amount displayed is proportional to the total storage. For example, if there are 3 group members with files in the 10 TiB storage allocation (at $10 per TiB):

User	Used storage	Charge
Member #1	5 TiB	$50
Member #2	2.5 TiB	$25
Member #3	2.5 TiB	$25

Inactive users are included in the distribution of the allocation charge

Because the list of members is derived from file ownership, former lab members may appear in bills even if they no longer work at Harvard. The only way to prevent former lab members from appearing in billing is to either remove the files or change the ownership using something like the Linux tool chown.

Billing record description

The final column of the billing statement email includes a description of the charge that may cover a number of elements:

The charge represented by the line item
The percent of the billing being applied to the associated Expense Code (less than 100% if there is a split)
The percentage of usage for the group member being billed. For example, Member 1 above carries 50% of the charge.
The billed size. This only differs from the total allocation size if a portion has been covered by a faculty commitment
The size and resource being allocated (i.e. where the storage is located)
An indicator if the charge is against a remainder after applying a faculty commitment
The rate being charged

Your department may be paying your bill

The Account column of the billing email line item table shows the 33 digit code to which the charge is applied. If you are part of SEAS, HCSPH, or other departments that have elected to pay for their lab’s storage, there may be a zero-rooted code in this column. That indicates that the department is covering the cost of the storage.

Encodings are managed with the FIINE system

The FAS Instrument Invoicing Environment (FIINE) software is used to maintain the expense code authorizations for the RC Storage Service Center. The software is also used for monthly review and invoicing. Lab administrators and PIs can use FIINE to setup new expense codes, expire existing ones to prevent further usage, authorize users to use one or more codes and review billing.

Encodings can apply to individual allocations, file systems, or to charges in the RC Storage facility. Authorizations can be changed during the billing review process, but will only be applied to the current billing cycle if the “Rebalance” checkbox is selected.

Details on the use of FIINE, including account requests, can be found in the user documentation.

Purchase Orders (POs) are not supported

The RC Storage Service Center only supports internal Harvard faculty using 33 digit codes. External institutions, companies, or any other entity that require a purchase order cannot have allocations within the center. This includes investigators at Harvard affiliated hospitals (e.g. MGH) that do not have 33 digit codes.

Data deletion

FASRC will not delete or remove data from a group’s storage allocation unless expressly asked (through written documentation) by the primary PI. Each lab is responsible for the management and cleanup of their stored data. If you have difficulty removing items from your storage, please contract FASRC for assistance.

Request a New Storage Allocation

To request an allocation or manage an existing one, the PI (or a previously designated storage manager) should log into Coldfront. If you cannot access Coldfront, or you are a PI who would like to designate a Storage Manager for your lab, please contact FASRC.

For requests greater than 100TiB for a single storage allocation, please contact us to discuss the request or drop by Office Hours.

IMPORTANT: As FASRC transitions to our new storage infrastructure, monthly billing may be affected. Further information will be made available on the Data Storage website.

Checking quota and usage

admin — Wed, 18 Jun 2025 12:14:44 +0000

The FASRC quota command provides file system quota and usage information for all FASRC cluster storage except Cold Storage (Tape).

Basic Usage

quota

Examples

[jharvard@holylogin08 ~]$ quota $HOME
Filesystem Size Used Avail Use% Mounted on
rcstorenfs:/ifs/rc_homes/home01 95G 23G 73G 24% /n/home01

[jharvard@holylogin08 ~]$ quota /n/acc_lab
Filesystem Size Used Avail Use% Mounted on
holy-nfsisilon:/ifs/rc_labs/acc_lab 8.0T 5.8T 2.3T 73% /net/holy-nfsisilon/ifs/rc_labs/acc_lab

Lab directories, netscratch, and active lab storage

For lab directories (/n/holylabs), netscratch (/n/netscratch), and active lab storage (/n/ and other paths), usage/quota is shown for the lab group that owns the directory:

[jharvard@holylogin08 ~]$ quota /n/netscratch/mcfee_lab

Disk quotas for grp mcfee_lab (gid 40227):
Filesystem used quota files quota
/n/netscratch 328.1Mi 45.5Ti 1125 100000000

If the file system mount point is specified (e.g., /n/holylabs, /n/netscratch), your user account’s usage across the entire filesystem, in addition to your primary group’s usage/quota is shown:

[jharvard@holybioinf ~]$ quota /n/netscratch
Disk quotas for user jharvard (uid 21442):
Filesystem space quota limit grace files quota limit grace
netscratch-ib01.rc.fas.harvard.edu:/netscratch/C
11056M 0K 0K 210k 0 0

Disk quotas for grp jharvard_lab (gid 10483):
Filesystem used quota files quota
/n/netscratch 10.8Gi 45.5Ti 209615 100000000

Specifying group with -g

To display quota for a specific lab group, the -g option can be specified:

[jharvard@holybioinf ~]$ quota -g mcfee_lab /n/holylabs
Disk quotas for grp mcfee_lab (gid 40227):
Filesystem used quota files quota
/n/holylabs 8.5Gi 3.6Ti 118318 10000000

The -g option is ignored for other file systems.

Limitations

For lab directories and netscratch, user usage is not displayed if the FASRC `quota` command is executed from a login node
For lab directories and netscratch, usage information is updated approximately every ~10 minutes (use the quota -v option to display the time of last update)
For active lab storage, you must be a member of the specified lab group to list quota/usage for that group

Advanced Usage

The -v option displays the underlying command used to obtain quota info (if applicable):

[jharvard@holybioinf ~]$ quota -v /n/holylfs06 command: lfs quota -h -u jharvard /n/holylfs06 ============================================== Disk quotas for usr jharvard (uid 21442): Filesystem used quota limit grace files quota limit grace /n/holylfs06 0k 0k 0k - 0 0 0 - uid 21442 is using default block quota setting uid 21442 is using default file quota setting
command: lfs quota -h -g jharvard_lab /n/holylfs06
==================================================
Disk quotas for grp jharvard_lab (gid 10483):
Filesystem used quota limit grace files quota limit grace
/n/holylfs06 0k 20T 20T - 0 14784921 14784921 -

Quota command on Active Lab Storage (Tier 0) versus Coldfront/Starfish

The quota command displays the amount of data owned by a lab group, regardless of which lab’s directory the files are located in. Coldfront and Starfish display the amount of data and number of files found in a lab’s directory, regardless of group ownership.

There may be discrepancies between the amounts reported by the quota command and Coldfront/Starfish, particularly for files owned by users who are members of more than one lab group and who originally created files outside of a lab’s directory.

For questions, suggestions, or feature requests, please contact us.

FASRC Cluster Storage Policy

RC Admin — Tue, 13 Aug 2024 19:46:19 +0000

Cluster storage offered and maintained by FASRC should only be used for research taking place on FASRC clusters.

Examples of data that can be stored on FASRC storage are:

Datasets
Code
Scientific software
Research results

Examples of data that should not be stored on FASRC storage include:

Clerical or lab administrative data
Data related to personnel, grant proposals, business operations, or general lab management
Data with personally identifiable or financial information

FASRC storage filesystems are only approved for Data Security Level 1 (DSL1) and DSL2 research data on the Cannon cluster. DSL3 data must be stored in the approved FASSE cluster project. Research data containing information classified as DSL 4 must be stored on an appropriate storage solution that is approved for DSL4 sensitive data.*

*A limited number of DSL4 projects exist in their own isolated environments

If it comes to the attention of the FASRC Staff that non research related data is being stored on the FASRC systems, we will alert the lab’s PI.

To view alternative storage options for administrative data, please refer to the FASRC website. Additional information is also provided on the Harvard Security website regarding Data Security levels.

Administrative Data Storage Options

RC Admin — Tue, 13 Aug 2024 18:48:33 +0000

FAS Research Computing offers a wide variety of storage offerings designed to help meet the needs of the Harvard research community. However, storage offerings provided by FASRC are intended to house only research data. For other data types, such as administrative data, general lab documentation, or finance information, we recommend utilizing one or more of the below storage options. While FASRC does not directly support or manage these tools, they are offered with support from Harvard IT.

Google Drive (Administrative file storage)

Store, access, and share administrative files from any device, as data is stored in the cloud
My Drive is designed for personal documents to be shared individually.
Shared drive is for collaborative files and folders owned at the team level.
Approved for Medium Risk Confidential (L3) data.
Central Administration, GSE, HBS, HDS, HKS, and HMS (Quad), and HSPH require local approval for account requests.
Google Drive storage request form: This form can be used to request a new personal or shared Google Drive, or request additional storage space for an existing Google Drive. The default Google Shared Drive storage limit is 5 GB. Eligible users may request additional storage using the form. The requests will need to be approved by FAS, as they are currently responsible for the costs.

Dropbox (Administrative file storage)

Secure data storage for faculty and research staff
Store, sync, and share data files in the cloud
Collaborate on documents with added version history

Microsoft 365

OneDrive (Administrative file storage)
- Store work-related administrative files
- Share with colleagues within and outside of Harvard
- Users have 2TB of file storage
- Approved for Medium Risk Confidential (L3) data.
Sharepoint (Document management tool)
- Secure location to store, organize, share, and access information
- Default of 10 GBs of expandable storage
  - Submit Microsoft Storage Allocation Request to request quota increase
- Approved for Medium Risk Confidential (L3) data.
- Can be shared externally with non-Harvard faculty and staff.
- Storage for Level 4 data is available upon request with restrictions.

Atlassian Confluence (Wiki web environment)

Website allowing users to create, edit, and publish content collaboratively through a web browser.
Access can be given to individuals, groups, to the Harvard community, or to the public.
Each page has its own URL, page history, access restrictions, file attachments, and comments.

Managing file access with ACLs

admin — Fri, 17 May 2024 15:49:54 +0000

What are ACLs?

Access Control Lists (ACLs or FACLs) are used to manage granular permissions on individual files or directories (folder). Primarily they are used to give one or more persons access to a file or directory that is not dependent upon the owner or group attached to the file/folder. For instance, if one user owns a file and wants to allow another user to also have write access to it without giving the group write access.

While FASRC manages a large number of groups for access to the many storage shares we host, we generally do not micromanage access at the individual level such as ‘this person should have access but not this person’ situations. In some instances with multiple users that might be best handled with an additional group, but that is added support overhead. For access issues involving these individual access scenarios, an ACL may be the best option when you need to grant someone else more granular permissions on files you own.

Usage

From a login or other node on the cluster, type man getfacl and man setfacl

Please Note: Setting ACLs on Tier1 Isilon shares is not supported currently.

Example 1: getfacl

This example shows how to see what FACLs are set.

[harvard_lab]# ls -l
total 12 (the '+' sign indicates that ACLs have been applied)
drwxrwsr-x+ 28 jharvard harvard_lab 4096 Feb 19 20:06 Everyone
drwxrwsr-x+ 7 jharvard harvard_lab 4096 May 9 20:03 Lab
drwxrwsr-x+ 74 jharvard harvard_lab 4096 Oct 10 2023 Users
[harvard_lab]# getfacl .
# file: . (shows the FACL settings, in this case a group, harvard_lab_admins, has special permissions)
# owner: root
# group: harvard_lab
# flags: -s-
user::rwx
group::r-x
other::r-x
default:user::rwx
default:group::r-x
default:group:harvard_lab_admins:rwx
default:mask::rwx
default:other::r-x

Example 2: setfacl

This example shows how to allow another user read/write/execute access to a file you own.

[jharvard]$ ls -l test
-rw-r--r--. 1 jharvard harvard_lab 30 May 17 16:41 test
[jharvard]$ setfacl -m u:testuser:rwx test
[jharvard]$ ls -l test
-rw-rwxr--+ 1 jharvard harvard_lab 30 May 17 16:41 test
[jharvard]$ getfacl test
# file: test
# owner: jharvard
# group: harvard_lab
user::rw-
user:testuser:rwx
group::r--
mask::rwx
other::r--

Additional documentation about the use of ACLS can be found at:

Home directory full

admin — Wed, 27 Jul 2022 18:16:48 +0000

If you receive an error that your home directory is full (“no space left on device [your home directory path]”) or an email saying you are over your 100GB home directory quota (and a 95G soft quota that triggers notifications), you will need to remove files to get back under quota. Ordinarily you would just use rm to remove some files and reduce your usage.

However, a situation may arise where you are not at quota but over quota and when trying to remove files using rm, you receive the error

rm: cannot remove ‘{somefilename}’: No space left on device

NOTE: Any time you are deleting files, it is important that you check to ensure you enter the correct filename. A good rule of thumb is to use the full path to a file (instead of relative path) or cd to the directory containing the file first. Also, be extra cautious when using wildcards like * .

WORKAROUND

A workaround is to identify some larger file(s) to remove and to reduce that file(s) size to zero bytes. Once enough space is recovered to get you under quota, you should be able to use rm again. To do this on files you’ve identified for removal, use the truncate command:

truncate -s 0 FILENAME
To truncate a single file down to zero bytes.

truncate -s 0 FILE1 FILE2 FILE3
To truncate multiple files down to zero bytes

Example:

truncate -s 0 ~/Jobfiles/August/job12653287.out
It’s always safer to use the full path to a file.
~ as used here is a Unix shortcut for the path to your home directory.

Alternately, you could also use the cat command to insert zero bytes of data into a large file:

cat /dev/null > ~/mybig.file

/dev/null is a special Unix device that is always zero bytes in size

CHECKING USED SPACE

You can check to see your current total home directory (shortcut “~”) usage using the du command (plus the summary, total, and human-readable options) like so:

[jharvard@holylogin01 ~]$ du -sch ~
80G .
80G total

If you are on a login node, you can also view your computed quota directly like so:

[jharvard@holylogin01 ~]$ df -h .
Filesystem Size Used Avail Use% Mounted on
rcstorenfs:/ifs/rc_homes/home13 95G 80G 15G 9% /n/home13

This shows that the user jharvard has used 80GB out of 100GB. The 95GB shown as the Size is called a soft quota. That is the threshold at which the system will notify you that you are going over.

Please bear in mind that due to the size of our home directories neither is instantaneous. The notification and re-calculation of quotas happens some time during a 24 hour period. So if you manage to go over quota before the next calculation is done, you won’t receive a soft notice. Similarly, it may take some time for your actual usage and computed quota to match again.

To find which files or directories are using the most space:

[jharvard@holylogin01 ~]$ cd ~
[jharvard@holylogin01 ~]$ du -h --max-depth=1 .384K ./.config 232K ./Test2.0G ./spack...

This will show a listing of files and sizes and you can repeat the command down the directory tree to find files to delete.

What To Do If du and df Are Different

If you find that df says you are at quota while du show a lower number, you may have sparse files which are not being accounted for properly in your du(but are accounted for by the filesystem’s quota check).

To check for this, you will need to use the --apparent-size flag in du to show the logical size and find the culprit(s)

cd ~

du -ch --apparent-size --max-depth=1 .

This will show the logical size of directories and should point you to the cause.

Clearing Disk Space

.local

This hidden folder is located in your $HOME . It typically grows in size when pip install is executed outside of a conda/mamba or python virtual environments to install packages, for example while in a Jupyter or interactive session. See the warning on pip installs. This is because such installations get placed in your ~/.local , resulting in $HOME getting full.

In order to manage ~/.local, do the following:

Make sure that there are no jobs currently running under your profile by executing: squeue -u


Rename/turn-off .local folder: mv ~/.local ~/.local.off


.conda
Conda/Mamba environments can be quite bulky based on the number and types of packages installed inside them and should be stored in your PI’s $LAB directory. See Mamba environments in a desired location. However, if such environments are created using the default location, $HOME/.conda, then the storage size of ~/.conda can be managed as follows:

Remove unused packages and clear caches of Conda/Mamba:

module load python

source activate 

conda clean --all 

This deletes only the unused packages in your ~/.conda/pkgs directory.
Note: One doesn’t need to activate an environment to execute the conda clean command. You can run this command after loading the Python module and it will remove unused packages & caches from $HOME/.conda, which will save you a lot of space.

Remove unused conda/mamba environments:

module load python

conda info -e

conda env remove --name 

.cache
This directory, located in ~/.cache, can grow in size with the general use of the cluster, Open OnDemand, or VSCode. In order to manage this space, do the following:

Make sure that there are no jobs currently running under your profile by executing: squeue -u 

Remove the folder: rm -r ~/.cache

.singularity
This folder, located in ~/.singularity, typically grows in size when a container is pulled to the cluster using Singularity. You can manage the size of this folder by cleaning its corresponding cache: singularity cache clean all
In order to avoid ~/.singularity folder from filling up, you can set a temporary directory while pulling a container and redirect the location for storing its cache. For example: export SINGULARITY_TMPDIR=/tmp/
Then, pull the container using Singularity as usual.
Note: It is best to run the above commands from a compute node, in an interactive session, as login nodes are not performant and are meant for lightweight activities only. One can ignore the interactive session being used for managing $HOME size and move forward with turning off ~/.local or deleting ~/.cache.



New England Research Cloud (NERC)
admin — Mon, 24 Jan 2022 21:12:44 +0000
NERC, the New England Research Cloud (NERC), is operated by Harvard University Research Computing (URC) and Boston University Research Computing Services groups, and is part of the MOC-Alliance. NERC is a self-service cloud service available to many institutions in New England. Research groups can build out their own virtual machines (OpenStack) and data storage (NESE Ceph). The number and diversity of researchers in need of extensive computational capabilities is expanding, and the type of computation needed is shifting to require tools and elasticity that is best provided by cloud-native technologies rather than or in addition to traditional high-performance computing. 
Users and labs seeking cloud computing resources can and are encouraged to make use of NERC. While FASRC does not directly provide NERC services, FASRC users and labs are free to make use of their services for cloud computing needs.
NERC Links and Documentation

NERC User Guide / Onboarding Process
NERC Documentation
NERC Account Request Guide
NERC Coldfront Interface (requires NERC account)
NERC Coldfront Guide

Need help or have questions?
Current NERC users or those with questions about the service should contact NERC via their online help system or by emailing help@nerc.mghpcc.org



Mounting Storage on Desktop or Laptop
admin — Thu, 09 Aug 2018 15:45:29 +0000
Some resources can be mounted to your local computer via Samba (aka SMB or CIFS), mainly home directories. But there are a few other shares where this has been deemed necessary and implemented on a share-by-share basis.
Please note that most, file systems, including lab directories on holylabs, cannot be mounted on your desktop. 

Scratch – Scratch space (/n/netscratch) cannot be mounted in this manner. It is only available on the cluster. If you need to transfer data to/from scratch, you can use an SFTP or SCP client to connect to the cluster and then change to /n/netscratch/[your lab’s space] . You can also use Globus for large external transfers.
Active Lab Storage (Tier 0) and shares whose name begin with holy generally cannot be mounted. If you need to transfer data to/from such shares, you can use an SFTP or SCP client to connect to the cluster and then change to the path of your lab share. You can also use Globus for large external transfers.
Active Lab Storage (Tier 1) and (Tier 2) shares can be exported via Samba if a valid need exists. Otherwise, if you need to transfer data to/from such shares, you can use an SFTP or SCP client to connect to the cluster and then change to the path of your lab share . You can also use Globus for large external transfers.
Long-term storage (tape) cannot be mounted and is only accessible via Globus.

 
Connect to the VPN
If using wireless connections, cluster storage must be routed through a VPN connection. If on wired connections inside Harvard, the VPN client is not required. If you don’t already have one setup, follow the VPN setup instructions.

NOTE: If you have set up custom DNS on your computer, this may cause issues connecting to shares.
Find the filesystem path (if not known)
If you already know the path, skip to instructions for your operating system below.
Mounting your HOME DIRECTORY
If you have cluster access, you can mount your home directory as a drive. You can figure out the path to your home directory by using ssh to login to the cluster. Use cd ~ to go to your home directory (on a Unix-like system, the ~ character is a shortcut to ‘my home directory’). Then type pwd to show where your home directory resides. [jharvard@boslogin02 ~]$ cd ~ [jharvard@boslogin02 ~]$ pwd /n/home08/jharvard  The home08 is the part we need in this example in order to construct the full path to your home directory.  Since all home directories are mounted from the same server, we don’t need to figure that part out. The path, therefore, that you will need for connecting is the combination of the server name, rcstore.rc.fas.harvard.edu. followed by the word homes to signify that it’s a home directory, the sub-folder your home directory resides in (home08 in this example), and your RC username.
For this example, this would result in:

For Windows \\rcstore.rc.fas.harvard.edu\homes\home08\jharvard

For Mac OSX smb://rcstore.rc.fas.harvard.edu/homes/home08/jharvard
Mounting a LAB SHARE
First, it’s important to note that most lab shares are not mountable. Also, cluster-only filesystems such as scratch (netscratch, holyXXXX, or local scratch) are never mountable. If you need to transfer data to/from such shares, you can use an SFTP or SCP/Rsync, or use Globus for large external transfers.
If you don’t already know the path to your lab’s share and believe it should be mountable, asking a lab-mate or your PI for the path is the quickest option. If your lab-mates do not know and you believe your share is mountable, please contact FASRC.
You can also try and see if your lab’s share is mounted on our Samba cluster using the instructions further down the page, but with one of the following paths:

For Windows \\smbip.rc.fas.harvard.edu\ (browse for your lab’s name, see below)

For Mac OSX smb://rcstore.rc.fas.harvard.edu/ (browse for your lab’s name, see below)
If found, you can use the path shown there to mount your lab’s share.
 
Operating System-Specific Instructions
Macs use Connect to Server
If you’re using a Mac, go to a Finder window (or click on the desktop) and choose Go > Connect to Server from the menu.  In the server address box, enter the server and path combination as described above prepended with the smb:// protocol specifier (please note that Macs use “/” where Windows uses of “\”). Using the example information above, the value might be smb://rcstore.rc.fas.harvard.edu/homes/home08/jharvard to mount the home directory of user jharvard. If you are mounting a lab share path, enter that instead (example: smb://smbip.rc.fas.harvard.edu/jharvard_lab).  If you’ve selected the proper volume, you should get a login prompt. Use your FASRC credentials here. Note that you must include the rc\ domain specifier at the beginning of your user name. 
PCs use Map Network Drive
You can connect to shared storage on a Windows PC by using the Map Network Drive button in a file explorer window (click the yellow folder icon in the taskbar).
Select This PC in the left-hand pane. 
Click Computer, which will present a drop-down menu, and then from that menu click Map network drive. 
In the Map Network Drive utility, select a free Drive letter.

Then enter the combination of fileserver address and path in the Folder field.

For the example, in the home directory described above the path would be \\rcstore.rc.fas.harvard.edu\homes\home08\jharvard.

If you are mounting a lab share path, enter that instead (example: \\smbip.rc.fas.harvard.edu\jharvard_lab).

Make sure Connect using different credentials is checked.

Click Finish to continue.Optional: If you want this drive to reconnect every time you log on to your computer, check Reconnect at sign-in. Just bear in mind that it will not re-connect if you are not on the VPN or your normal campus wired jack.





The reason you must check Connect using different credentials in the Map Network Drive box is that your PC has a local account (and a local ‘domain’) and it will default to that if you do not specify another username and domain. If you don’t select this checkbox and attempt to connect, it will try to authenticate with your local PC information and after three failed attempts will result in a lockout (FYI: Don’t worry, lockouts expire automatically in about 5 minutes).
When you are prompted to Enter Network Credentials, prepend your FASRC username withRC\ to specify you are connecting to the RC domain with your username.

Example: RC\jharvard means ‘Connect to the server and path I entered above as RC domain user jharvard’.

This will prompt you for your password. If instead you get an error message about a read-only filesystem, it could be because mount.cifs is not installed on your system. Using this method, you will need to reissue the command every time you boot your computer.Some users prefer using smbclient to connect to Samba/SMB/CIFS shares. This is an optional package you will need to install on your own.
SFTP/Filezilla 
If you are unable to mount your lab storage using one of the above methods, or your lab’s share is simply not available via Samba, you always have the option of using SFTP. This is especially useful if you need to maintain a different VPN connection and cannot connect to our VPN. SFTP to a login node does not require a FASRC/FAS VPN connection as login nodes use two-factor authentication.
We recommend FileZilla as a reliable, cross-platform SFTP client. Note that SFTP uses SSH and our two-factor authentication, so you will need to ensure you have OpenAuth set up, and that you have have cluster access and a home directory. If you are unsure, SSH to a login node first. If you need to request cluster access, see our doc on adding groups/access.




Scratch
admin — Wed, 13 Dec 2017 10:36:16 +0000
RC maintains a large, shared temporary scratch filesystem for general use for high input/output jobs at /n/netscratch .
Scratch Policy
Each lab is allotted 50TB of scratch space for its use in their jobs. This is temporary high-performance space and files older than 90 days will be deleted through a periodic purge process. This purge can run at any time, especially if scratch is getting full and is also often run at the start of the month during our monthly maintenance period.
There is no charge to labs for netscratch, but please note that it intended as volatile, temporary scratch space for transient data and is not backed up. If your lab has concerns or needs regarding scratch space or usage, please contact FASRC to discuss.
Modifying file times (via touch or other process) when initially placing data in scratch is allowed, however doing so subsequently to avoid deletion is an abuse of the filesystem and will result in administrative action from FASRC. To reiterate, you may initially modify the file date(s) on new data so that it is not in the past, but should not modify it further.  If you have longer-term needs, please contact us to discuss options.


Networked, shared netscratch
The cluster has storage built specifically for high-performance temporary use. You can create your own folder inside the folder of your lab group. If that doesn’t exist or you do not have write access, contact us.
IMPORANT: netscratch is temporary scratch space and has a strict retention policy. 



Size limit
4 Pb total, 50TB max. per group, 100M inodes


Availability
All cluster nodes.

Cannot be mounted on desktops/laptops.


Backup
NOT backed up


Retention policy
90 day retention policy. Deletions are run during the cluster maintenance window.


Performance
High: Appropriate for I/O intensive jobs



 
 
 
 
 
 
/n/netscratch is short-term, volatile, shared scratch space for large data analysis projects.
The /n/netscratch filesystem is managed by the VAST parallel file system and provides excellent performance for HPC environments. This file system can be used for data intensive computation, but must be considered a temporary store. Files are not backed up and will be removed after 90 days. There is a 50TB total usage limit per group.
Large data analysis jobs that would fill your 100 Gb of home space can be run from this volume. Once analysis has been completed, however, data you wish to retain must be moved elsewhere (lab storage, etc.). The retention policy will remove data from scratch storage after 90 days.


Local (per node), shared scratch storage
Each node contains a disk partition, /scratch, also known as the local scratch that is useful for storing large temporary files created while an application is running.
IMPORTANT: Local scratch is highly volatile and should not be expected to persist beyond job duration.



Size limit
Variable (200-300GB total typical). See actual limits per partition.


Availability
Node only.

Cannot be mounted on desktops/laptops.


Backup
Not backed up


Retention policy
Not retained – Highly Volatile


Performance
High: Suited for limited I/O intensive jobs



 
 
 
 
 
 
The /scratch volumes are a directly connected (and therefore, fast) to temporary storage location that is local to the compute node. Many high performance computing applications use temporary files that go to /tmp by default. On the cluster we have pointed /tmp to /scratch. Network-attached storage, like home directories, is slow compared to disks directly connected to the compute node. If you can direct your application to use /scratch for temporary files, you can gain significant performance improvements and ensure that large files can be supported.
Though there are /scratch directories available to each compute node, they are not the same volume. The storage is specific to the host and is not shared. For details on the /scratch size available on the host belonging to a given partition, see the last column of the table on Slurm Partitions. Files written to /scratch from holy2a18206, for example, are only visible on that host. /scratch should only be used for temporary files written and removed during the running of a process. Although a ‘scratch cleaner’ does run hourly, we ask that at the end of your job you delete the files that you’ve created.

$SCRATCH VARIABLE
A global variable called $SCRATCH exists on the FASRC Cannon and FASSE clusters which allows scripts and jobs to point to a specific directory in scratch regardless of any changes to the name or path of the top-level scratch filesystem. This variable currently points to /n/netscratch so, for example, one could use the path $SCRATCH/jharvard_lab/Lab/jsmith in a job script. This will have the added benefit of allowing us to change scratch systems at any time without your having to modify your jobs/scripts.



Home and Lab directories
admin — Mon, 31 Mar 2014 10:16:10 +0000
Please see the Data Storage on our main website information on other storage options and for clarification on any unfamiliar terms.
This page describes the resources which are available to each user account and lab, and is a guide for day-to-day usage.
See also our Introduction to FASRC Cluster Storage video

Home Directories
Every user whose account has cluster access receives a 100 GB home directory. Your initial working directory upon login is your home directory. This location is for your use in storing everyday data for analysis, scripts, documentation, etc. This is also where files such as  you .bashrc reside. Home directories paths look like /n/homeNN/XXXX where homeNN is home01–home15 and XXXX is your login. For example, user jharvard’s home directory might be /n/home12/jharvard. You can also reach your home directory using the Unix shortcut ~, as in: cd ~

Size Limit: 100GB (hard limit)
Availability: All cluster nodes. Can be mounted on desktops and laptops
Backup: Daily snapshots. Retained for 2 weeks
Retention policy: indefinite
Performance: Moderate. Not appropriate for I/O intensive or large numbers of jobs
Cost: Provided with each user account

Your home volume has good performance for most simple tasks. However, I/O intensive or large numbers of jobs should not be processed in home directories. Widespread computation against home directories would result in poor performance for all users. For these types of tasks, the scratch filesystem is better suited.
Home directories are private to your account and will follow you no matter should you change labs, but are not suitable for storing HRCI/level 3 or above data. This is a violation of Harvard security policies.
Your home directory is exported from the disk arrays using CIFS/SMB file protocols and so can be mounted as a ‘shared drive’ on your desktop or laptop. Please see this help document for step-by-step instructions.
Home directories are backed up into a directory called .snapshot in your home. This directory will not appear in directory listings. You can cd or ls this directory specifically to make it visible. Contained herein are copies of your home directory in date specific subdirectories. Hourly, daily, weekly snapshots can be found. To restore older files, simply copy them from the correct .snapshot subdirectory. NOTE: If you delete your entire home directory, you will also delete the snapshots. This is not recoverable.
The 100 GB quota is enforced with a combination of a soft quota warning at 95GB and a hard quota stop at 100 GB. Hitting quota during processing of large data sets can result in file write/read failures or segmentation faults. You can check your usage using the df command: df -h ~ (where ~ is the unix shortcut for ‘home’)
TIP: If you are trying to determine usage, you might try using du -h -d 1 ~ to see the usage by sub-directory, or du -ax . | sort -n -r | head -n 20 to get a sorted list of the top 20 largest.
When attempting to log in when your home directory is over quota, you will often see an error in the .Xauthority file:

/usr/bin/xauth: error in locking authority file .Xauthority Logging into an NX or other virtual service will fail as the service cannot write to your home directory.
When at or over quota, you will need to remove unneeded files. Home directory quotas are global and cannot be increased for individuals. You may be able to use lab or scratch space to assist with copying or moving files from your home directory to free up space.
 

Lab Directories
Each lab that uses the cluster receives a 4 TiB lab directory (as of 2025 – these will reside in /n/holylabs). This location is for each lab group’s use in storing everyday data for analysis, scripts, documentation, etc. Each such lab will have a directory on our high-performance scratch filesystem (see below).

Size Limit: 4TiB (hard limit), 1 million files
Availability: All cluster nodes. Cannot be mounted on desktops and laptops
Backup: Highly redundant, no backups
Retention policy: Duration of the lab group
Performance: Moderate. Not appropriate for I/O intensive or large numbers of jobs
Cost: Provided with each lab group

Lab directories have good performance for most simple tasks. However, I/O intensive or large numbers of jobs should not be processed in lab directories. Widespread computation against lab directories would result in poor performance for all users. For these types of tasks, the scratch filesystem is better suited.
This lab directory is owned by the lab’s PI and is intended only to be used for research data on the cluster. research storage should not be used for administrative files and data.
Lab directories are not suitable for storing HRCI/level 3 or above data. This is a violation of Harvard security policies.
The 4 TB quota is enforced with a combination of a soft quota warning and a hard quota stop at 4 TB. Hitting quota during processing of large data sets can result in file write/read failures or segmentation faults. If your lab requires additional storage, see our Data Storage page for a list of available storage options.

Size limit	4 Pb total, 50TB max. per group, 100M inodes
Availability	All cluster nodes. Cannot be mounted on desktops/laptops.
Backup	NOT backed up
Retention policy	90 day retention policy. Deletions are run during the cluster maintenance window.
Performance	High: Appropriate for I/O intensive jobs

Size limit	Variable (200-300GB total typical). See actual limits per partition.
Availability	Node only. Cannot be mounted on desktops/laptops.
Backup	Not backed up
Retention policy	Not retained – Highly Volatile
Performance	High: Suited for limited I/O intensive jobs