View on GitHub

archiving_workflow

Development of this documentation was generously funded by the ARC Centre of Excellence for the Dynamics of Language (CoEDL): www.dynamicsoflanguage.edu.au

Getting Started with PARADISEC

This page is dedicated to helping you begin the seemingly overwhelming process of getting your materials organised and eventually archived with PARADISEC (Pacific and Regional Archive for Digital Sources in Engangered Cultures).

STEP 1: Sign up as a user with PARADISEC

Signing up with PARADISEC means you become a PARADISEC user. As a user you can access data in an open collections after agreeing to PARADISEC’s conditions of access, or you may send a request to access data that has restrictions on viewing by following instructions provided by the collector on their catalog page.

You also need to sign up with PARADISEC if you wish to deposit material to the archive.

The process of signing up is simple- go to the PARASDISEC catalog web page: http://catalog.paradisec.org.au/, and click on Sign up in the upper right‐hand corner. Once you sign up, you get an automated message informing you to check your email to confirm your account,

“A message with a confirmation link has been sent to your email address. Please open the link to activate your account”

Click on the link in your email to confirm.

Congratulations, you are now a PARADISEC user! It is recommended that you browse the collections held in PARADISEC so that you can investigate the different ways collections can be organised. Below are a some examples:

STEP 2: Contact PARADISEC to help you create a new collection

Only PARADISEC administrators can create a collection in the catalog. When you are ready for us to create a collection, contact the PARADISEC administration at admin@paradisec.org.au or the CoEDL Data Manager. To begin with you will need to provide us with the following information:

Preliminary exploration of your files

Before you contact PARADISEC, take a look at your files and determine how you want your collection(s) presented. Do you need one collection? Or more than one? This is the time for you to reflect upon how you want your material organised in the archive.

Information to provide to PARADISEC

Once you have decided upon which files you would like to archive, come up with a name and description for your collection. As for the Collection ID, PARADISEC administrators can help you come up with this- the collection ID should be a unique identifier (could be your initials in all caps and a digit, or the ISO language code).

Example collection IDs: JM1, JCM4, OE1, MEU1, etc.
The digit is required and it distinguishes your collection from that of someone with the same initials (John Mansfield, Julia Miller, etc), but also allows for multiple collections, should you require, e.g. JM1, JM2… JM85
.

STEP 3: Organise materials to be sent to collection

PARADISEC’s metadata spreadsheet

PARADISEC has created a spreadsheet that allows us to automatically create the catalog metadata for items in your collection. You can acces the spreadsheet here. (If the .xlsx file does not download automatically, then right-click on the link to save the file). Please do not alter the structure of the spreadsheet, as this will prevent the auto-upload process for us.

Once your collection has been created, and has been populated with the metadata for your items, you can edit you collection information directly via the catalog pages. You can add details such as:

PARADISEC’s file naming conventions

Now comes the thrilling task of making sure your files are named appropriately. Getting your file names in order should come after you have created your collection and after you have settled on the organisation of your items (recording events) within your collection.

ANATOMY OF A PARADISEC FILE NAME
Say you have, for example, a file that you named 20150908‐wordlist‐01.wav while in the field. This in‐field naming convention includes the date in ISO format (YYYYMMDD). The (‐wordlist) marks this as being from a session involving wordlist elicitation. The (‐01) indicates that this is the first track of the session. You will have your own conventions.

The name 20150908‐wordlist‐01.wav does not fit with the PARADISEC naming conventions. First of all, there needs to be a collection ID followed by a hyphen, and then only two more parts separated by hyphens: CollectionID-ItemID‐ContentFile.

  1. CollectionID is your PARADISEC collection ID and it typically has the shape of your initials in capital letters and a number. This ID can be determined by you, but must be confirmed as available for use by the CoEDL Data Manager or a PARADISEC administrator. This ID is created when you set up a collection in PARADISEC. Mine could be JCM1

  2. ItemID is your PARADISEC item ID. An item could be a cassette tape, or now in the born-digital age, it is typically an recording event containing all related files from that session- audio, video, transcriptions). You must use a terse description here. You may use an underscore ( _ ) if you need a separator; hyphens ( ‐ ) are prohibited within your Item ID as they are reserved PARADISEC operators. You could have speaker initials in your item name (JB), description of the task (wordlist), field site (BIMA).

  3. ContentFile is the final necessary part to a file name. This part of the name allows you to enumerate files of the same format, i.e. photos with 001, 002, etc, or multiple tracks of a single audio or video session. You can distinguish type of microphone used for an audio track in this part of the file name, e/g/ -LM (lapel mic) or -HM (for head mounted mic). So if your original file was named 20150908‐wordlist‐01.wav, you could try something like this: JCM01‐20150908_wordlist‐01.wav

Note: Even if your item contains only one file, and thus does not need to be differentiated from other files in the item, you still need to have this third part of the file name. You could use -00, -01, -wordlist01, -original, etc.

TEMPLATE: [CollectionID] - [ItemID] - [ContentFile] . file extension

Coll ID Item ID Content Extension
JCM01 20150908_wordlist 01 .wav

Note: No spaces or special characters are allowed in your file names, and only the two hyphens seen above are allowed. Underscores are fine. File names should not exceed 30 characters, excluding file extensions.

PARADISEC’s deposit form

And lastly, you need to fill out a deposit form that provides PARADISEC with your contact information, access conditions for your collection, including any restrictions. Fill in the form, sign it, scan or photograph it, and email it back to us. We keep this digital record with your collection.

STEP 4: Sending your files to PARADISEC

If you are planing to archive digital files with PARADISEC, and your institution is part of the Australian Access Federation (AAF), you can sign up for a Cloudstor account. Once you do, you get 1TB of free storage available to you instantly. You can also send us OneDrive, Dropbox, Google Drive links. These options are the best means to transfer files to PARADISEC. If you do not use any of those cloud storage options, but would like to transfer your files this way, contact PARADISEC or the CoEDL Data Manager and we can create a Cloudstor folder for you to use for transferring files to us.

If uploading files to a cloud service is not possible due to a lack of computer access or poor internet connections, you can post a hard drive of your materials to PARADISEC at:

PARADISEC
Room 3019, Sydney Conservatorium of Music C41
The University of Sydney
NSW 2006 AUSTRALIA

Or to the CoEDL Data Manager at:

ATTN: CoEDL Data Manager
The ARC Centre of Excellence for the Dynamics of Language
College of Asia Pacific
The Australian National University
Coombs Building, 9 Fellows Road
Acton 2601 ACT AUSTRALIA