Skip to main content
ExLibris
  • Subscribe by RSS
  • Ex Libris Knowledge Center

    Cleanup Job

    • Product: Rosetta 
    • Product Version: 5+
    • Relevant for Installation Type: Local

    Description

    All cleanup activities are logged /operational_shared/logs/server.log
    Directories that are actively used by the Rosetta application are not cleaned with the Cleanup Job.
    Where possible, technical, business, and performance cleanup is executed in real time.

    To review, change, or schedule the Cleanup Job:

    1. Connect to the Administrative Module and navigate to Scheduled Jobs Management
    2. Click "Update" for the Cleanup Job

    You can schedule the Cleanup Job to run hourly, daily, weekly, monthly, or as a cron (advanced).

    Deletes are made in the DB (HDpDepositActivity and sip_registry)and file system.

     

    Below are the list of Job Parameters that are inactive by default:

     

    1. Clean staff_work_area Directories

    Default parameter: Older than days: 30

    Source Directory: /operational_shared/staff_work_area

    Notes: Cleans all files

     

    2. Clean Delivery Cache

    Default parameter: Older than days: 90

    Notes: Cleans files and thumbnail creation for delivery viewers (in database)

     

    3. Clean Update Metadata Job Directories

    Default parameter: Older than days: 30

    Source Directory: /operational_shared/md_update

    Notes: Cleans files from 'done' subdirectory.

     

    4. Clean Add Derivative Rep Job Directories

    Default parameter: Older than days: 30

    Source Directory: /operational_storage/derivative_copies

    Notes: Cleans only METS + files directories with ‘DONE’ file

     

    5. Clean Old Deposit Jobs

    Default parameter: Older than days: 183

    Source Directory: /deposit_storage/1-1000

    Notes: Clean files and empty parent folders as follows:

    Database: Cleans records from HDpDepositActivity with statuses DECLINED, APPROVED older than the input parameter (e.g. older than 30 days).
    Cleans all DELETED status records and all INCOMPLETE status records older than 7 days.

    File system: Cleans directories and caches.

     

    6. Clean Preservation Alternative Plans Directories

    Default parameter: Older than days: 90

    Source Directory: /operational_shared/operational_export_directory

    Notes: Cleans the Preservation directories and files exported from several places within Rosetta

     

    7. Clean Finished SIPs

    Default parameter: Older than days: 183

    Source Directory: /operational_shared/sipTmpDir

    Notes: Used as a temporary directory for uploaded file when replacing files.

    Clean files and empty parent folders as follows:

    Database: Cleans all records from the sip_registry DB table that have a "FINISHED" status and are older than the input parameter (e.g. older than 30 days).

    SIPs are deleted only after its related deposit_activity is deleted.

    File system: Cleans only SIPs (XMLs + files) that are in the permanent repository.

     

    8. Clean Deleted OAI-PMH Records

    Default parameter: Older than days: 90
    Source Directory: /operational_shared/<location varies depending on submission format definition>
    Notes: It is recommended that you set the OAI deletedRecord policy to transient in the oaiproviderconfig.xml configuration file.

     

    9. Clean Old Events
    Default parameter: Older than days: 365

    All / Exclude / Include Event Types:
    Notes: Can be configured to include / exclude specific event types in the audit event table based on date and event type.

    Removes HFREVENT and HFREVENTKEYS.
     

    10. Clean Process History
    Default parameter: Older than days: 183
    Source Directory: /operational_shared/sipTmpDir
    Notes: Cleans any orphan process automatically (result of delete process)

    Additional Information

    Other directories that are cleaned:

     

    /operational_shared/backoffice/system_jobs (cleans files used for system jobs)

    /operational_shared/bytestream_work (used as a temporary directory for bytestream processing)

    /operational_shared/format_library_downloads (cleans old Format Library downloaded files)

    /operational_shared/tmp/metsTmpDir (cleans only XMLs and files / directories from Move to Permanent that are now in the permanent repository)

    /operational_shared/tmp/ScheduledReportsFiles (cleans BIRT Excel, Word, PDF, etc. report output).

    /operational_shared/tmp/Scripts (cleans files not in use by Rosetta)

    /operational_shared/tmp/storage/destPath (cleans small text files used by the NFS Plugin)

     


    • Article last edited: 29-Aug-2021