- Article Type: General
- Product: Rosetta
- Product Version: 5.0
- Relevant for Installation Type: Local;
Desired Outcome Goal:
To create CSV file with a customized preservation type that can be ingested into Rosetta
1. Go to Producers > Advanced Tools > CSV Templates and click "Add CSV Template" to create a CSV template:
a. Select metadata fields from the right column and double-click or drag them to the left column and click "Save" to save the template.
See "Additional Information" for example csv with basic metadata fields.
2. Go to Producers > Advanced Tools > Content Structures
a. Select the "CSV Loader Converter" under Add Content Structure and click "Add" to create a new content structure for the CSV file.
b. Give the new content structure a name, link it to the appropriate CSV template (drop-down), and choose the appropriate Generate CSV Option (drop-down).
c. Download the newly-created CSV template by clicking on the "Download CSV Template" button and save locally.
d. Click "Save" to save the newly-create content structure.
3. Go to Producers > Advanced Tools > List of Submission Formats and click "Add Submission Format"
It's possible to create either a "Detailed CSV" which requires submission of a csv and zip files or an "NFS Acquiring" which only requires submission of the csv, as long as the "File Original Path" includes a resolvable URL to the source file.
If using Detailed CSV use the default "Detailed CSV" submission format which is not editable (as of Rosetta v4.2.1).
If using NFS Acquiring:
a. Under "Submission Format Details" add the following:
NFS Path: (e.g. /mnt/operational_shared/sipTmpDir/test_load)
Allow Navigation: Yes
Min. Number of Files: 1
4. Go to Producers > Deposit Arrangements > Material Flow List and click "Add Material Flow"
a. Update the following mandatory fields:
Material Flow Definition (manual or automated)
-Name: <add name>
-Select content structure: <add content structure defined in step 2. above>
-Select submission format: <add submission format defined in step 3. above>
-Select Metadata form: <add a metadata form appropriate for CSV ingest>
b. Click "Save" to save the Material Flow
5. Deposit with the material flow and review the log and BIRT reports for results.
Example CSV metadata Fields to include:
SIP: Title (DC)
Collection: Publish Collection [Title (DC) and Is Part Of (DCTERMS) are added automatically]
IE: Title (DC), Creator (DC), Type (DC), Identifier (DC), Identifier - URI (DC), Description (DC), Date (DC), Subject (DC), IE Entity Type
REP: Preservation Type, Usage Type
File: FILE - Identifier (DC) [File Original Name and File Original Path are added automatically]
NOTE: Rosetta Collection Management only displays dc:title in the Title column of the Content tab (not dcterms:title).
How to add File Labels for viewer display
During csv ingest the file names in the "File Original Name" column are added as the file labels by default (e.g. "123.jpg" - complete file name).
If it is preferred to use instead the DC titles as the file labels, in addition to the "File Original Name" column, add two additional columns:
1. File - Title (DC) with the "123.jpg" (complete file name as would be located in the path provided in the "File Original Path")
2. File Label with the file title "Map of Indonesia" or whatever file label that should display in the viewer's table of contents on the left.
Important notes about the CSV file when using the "Detailed CSV" submission format:
a. Make sure the file names listed in the "File Name" column of the CSV match the names of the actual files
b. File original path is: '/' unless the zip contains internal folders
Important notes about the CSV file when using the "NFS Acquiring" submission format:
In v4.2.1 the CSV automated deposit now supports absolute and HTTP paths.
To add a remote path, add the url, starting with “http://” to the 'File Original Path' column including the file name.
Make sure that the 'File Original Name' column is empty.
To add an absolute path add the url to the 'File Original Path' and the file name to 'File Original Name', make sure it starts with '/' ."
Scientific notations and Foreign language characters:
1. If automating CSV creation, the script should assure that non-byte order mark (BOM) UTF-8 encoded CSV is created.
2. For manual CSV creation, use LibreOffice or Notepad++ instead of Excel to save CSV files in non-byte order mark (BOM) UTF-8.
Collection Hierarchy Building Structure:
Object Type Title (DC) Is Part Of (DCTERMS) Publish Collection
Collection collection1 TRUE
Collection collection2 collection1 TRUE
Collection collection3 collection1/collection2 TRUE
Collection collection4 collection1/collection2/collection3 TRUE
Will create the following collection hierarchy in Rosetta's Collection Management: collection1/collection2/collection3/collection4/<IEs>
Refer to the "Depositing SIPs in CSV Structure" section of the Rosetta's Producer Guide:
Refer to the "CSV Content Structure" and "CSV Templates" sections of the Rosetta Staff User's Guide:
For explanation with screen shots see attached file
Please see the relevant documentation for additional information and limitations
CSV Loading Pre-checks
Following these pre-checks will reduce the number of potential errors encountered during loading.
If they are not linked, the producer/producer agent will not display in the Submission Job.
- Article last edited: 2/11/2016