Using Bioarchive

From Carl R. Woese Institute for Genomic Biology - University of Illinois Urbana-Champaign
Revision as of 10:33, 24 February 2021 by Danield (talk | contribs) (Get S3 Keys)
Jump to navigation Jump to search

This page is a work in progress while we are getting the archive service into production[edit]

This documentation will change frequently as we learn more about the archive, and what users need to access it. This is our attempt to get the most information out to users as quickly as possible.

Request An Account[edit]

For now, email help@igb.illinois.edu. The owners and primary users of Strongbox Archive will have accounts created for them automatically as we transfer data over to the new archive.

Change Password[edit]

Once we get the data transferred, or once you request an account, you will need to change your password to ensure that access to your data is secure.

Get S3 Keys[edit]

Although a bit confusing, the main credentials to access your data are the S3 keys that are assigned to your account.

Option A - Get a Biocluster Account[edit]

The tools to access the archive are already installed on Biocluster. To get an account on Biocluster please fill out the form at https://www.igb.illinois.edu/content/biocluster-account-form

Option B - Install Eon Browser[edit]

You can download and install the Eon Browser GUI for the archive on your local system by downloading the software at https://developer.spectralogic.com/clients/

Option C - Install Command Line Tools[edit]

You can download and install the command line tools for the archive on your local system by downloading the software at<a class="external free" href="https://developer.spectralogic.com/clients/" rel="nofollow" data-mce-href="https://developer.spectralogic.com/clients/">https://developer.spectralogic.com/clients/</a>​

Using Eon Browser[edit]

The Eon Browser is the simplest method of transferring data into and out of the archive. CNRG expects that most users will use the EON Browser. The Spectralogic User Guide for the Eon Browser is at https://developer.spectralogic.com/wp-content/uploads/2018/11/90990126_C_BlackPearlEonBrowser-UserGuide.pdf

Using Command Line Tools[edit]

It is important that if you are using the archive on cluster you substitute 172.16.28.54 in as the name of the archive. This will give you the fastest access possible to the archive. If you are not using the cluster it is important that you use bioarchive.igb.illinois.edu as the name of the archive as you will not have access to the 172.16.28.0/22 network. The Spectralogic user guide for the command line tools is at https://developer.spectralogic.com/java-command-line-interface-cli-reference-and-examples/

Upload A Directory[edit]

ds3_java_cli -a S3_Access_ID -k S3_Secret_Key -b Bucket_Name -e bioarchive.igb.illinois.edu --insecure -c put_bulk -d Directory_Name

  • S3_Access_ID is the S3 access id for your account, you can get this by logging into the archive web interface
  • S3_Secret_Key is the S3 secret key for your account, you can get this by logging into the archive web interface
  • Bucket_Name is the S3 bucket you want to put your data in
  • bioarchive.igb.illinois.edu is the dns name of the archive. If you are using in biocluster, you should use 172.16.28.54 instead
  • insecure does not check the certificate of the service
  • put_bulks tells the program to upload everything inside the directory given
  • Directory_Name is the directory to upload, note this will upload everything in this directory, not the directory itself