Changes between Initial Version and Version 1 of Workshops/IntroToHpc2015/FileTransfer


Ignore:
Timestamp:
10/12/15 15:52:25 (9 years ago)
Author:
pdejesus
Comment:

Legend:

Unmodified
Added
Removed
Modified
  • Workshops/IntroToHpc2015/FileTransfer

    v1 v1  
     1= File Transfer =
     2
     3== Transferring Files ==
     4=== Linux and Mac Command Line ===
     5You may transfer files between your workstation and the cluster on the command line using the ''scp'' command. This command behaves much like the basic Linux ''cp'' command, except you may use a remote address as the source or destination file. The syntax is as follows:
     6{{{
     7scp source_file destination_file
     8}}}
     9
     10The following command will copy the file testfile from the ''/home/remoteuser/'' directory on the remote server ''cypress1.tulane.edu'' to your workstation's local directory "." (a period represents the current working directory).
     11{{{
     12user@localhost> scp remoteuser@cypress1.tulane.edu:/home/remoteuser/testfile .
     13}}}
     14
     15To copy a directory along with all its contents you will need to add the -r recursive flag. The following command will copy the ''simdata directory'' and all its contents to your local machine.
     16{{{
     17user@localhost> scp -r remoteuser@cypress1.tulane.edu:/home/remoteuser/simdata .
     18}}}
     19
     20=== Graphical Software ===
     21There are many graphical file transfer solutions available. The following are the three most popular and are fairly intuitive. Be sure to set each to connect to the cluster using the Secure File Transfer Protocol (SFTP).
     22
     23''Filezillla'' is available on all platforms. Be careful when downloading and installing as the hosting site, sourceforge, has begun to bundle bloatware with its downloads.
     24[[https://filezilla-project.org/|FileZilla]]
     25
     26''Fetch'' is a full-featured file transfer client for Mac and is free to the academic community [[http://fetchsoftworks.com/|Fetch]]
     27
     28''WinSCP'' is a free Windows client. [[http://winscp.net/eng/download.php|WinSCP]]
     29
     30== Example ==
     31[[cypress/FileTransfer/Example|Let's try out FileZilla]]
     32
     33
     34= Storage on Cypress =
     35Every Cypress user has two locations in which to store data: A small, high security, low performance, personal home directory and a large, secure, group shared Lustre directory.
     36
     37== Storage: home directory ==
     38
     39Your home directory on Cypress is intended to store customized source code, binaries, scripts, analyzed results, manuscripts, and other small but important files.  This directory is limited to 10 GB (10,000 MB), and is backed up.  To view your current quota and usage, run the command:
     40
     41{{{
     42quota -f /home
     43}}}
     44
     45Please do not use your home directory to perform simulations with heavy I/O (file read/write) usage.  Instead, please use your group's Lustre project directory.
     46
     47== Storage: Lustre group project directory ==
     48
     49Cypress has a 699 TB Lustre filesystem available for use by active jobs, and for sharing large files for active jobs within a research group.  The Lustre filesystem has 2 Object Storage Servers (OSSs) which provide file I/O for 24 Object Storage Targets (OSTs).  The Lustre filesystem is available to compute nodes via the 40 Gigabit Ethernet network.  The default stripe count is set to 1.
     50
     51Allocations on this filesystem are provided per project/research group.  Each group is given a space allocation of 1 TB and an inode allocation of 1 million (i.e. up to 1 million files or directories) on the Lustre filesystem.  If you need additional disk space to run your computations, your PI may request a quota adjustment.  To request a quota adjustment, please provide details and an estimate of the disk space used/required by your computations.  Allocations are based on demonstrated need and available resources.
     52
     53The Lustre filesystem is not for bulk or archival storage of data.  The filesystem is configured for redundancy and hardware fault tolerance, but is not backed up.  If you require space for bulk / archival storage, please contact us, and we will take a look at the available options.
     54
     55Your group's Lustre project directory will be at:
     56
     57{{{
     58/lustre/project/<your-group-name>
     59}}}
     60
     61"your-group-name" is your Linux group name, as returned by the command "id -gn".  Your group is free to organize your project directory as desired, but it is recommended to create separate subfolders for different sets of data, or for different groups of simulations.
     62
     63To view your group's current usage and quota, run the command:
     64
     65{{{
     66lfs quota -g `id -gn` /lustre
     67}}}
     68
     69To view your own usage, you can run:
     70
     71{{{
     72lfs quota -u `id -un` /lustre
     73}}}
     74
     75== High Performance Data transfer ==
     76For high speed transfer of large files (1GB or larger), Cypress is currently equipped with the data transfer tool '''bbcp'''. The use of bbcp is beyond the scope of this workshop but an excellent treatment can be found at http://pcbunn.cithep.caltech.edu/bbcp/using_bbcp.htm
     77
     78=== Next Section ===
     79[[cypress/WorkingOnClusters|Working on a Unix System]]