Skip to main content
ExLibris

Knowledge Assistant

BETA
 
  • Subscribe by RSS
  • Back
    Rosetta

     

    Ex Libris Knowledge Center
    1. Search site
      Go back to previous article
      1. Sign in
        • Sign in
        • Forgot password
    1. Home
    2. Rosetta
    3. Knowledge Articles
    4. METS encoded UTF-8 with byte order mark may cause the ingest to fail

    METS encoded UTF-8 with byte order mark may cause the ingest to fail

    1. Last updated
    2. Save as PDF
    3. Share
      1. Share
      2. Tweet
      3. Share
    1. Description
    2. Resolution
    3. Additional Information
    • Product: Rosetta
    • Product Version: v5
    • Relevant for Installation Type: Local

     

    Description

    When creating METS and other metadata files for ingest it's important to be aware of what kind of UTF-8 encoding is being used.
     

    Resolution

    METS encoded UTF-8 with byte order mark (BOM) may cause the METS ingest to fail with "unexpected CDATA" error.
     The presence of the extra BOM characters can be detected by using the 'less' command to view the ie.xml which will reveal the following string that preeceeds the header:
       <U+FEFF><mets:mets xmlns:mets="http://www.loc.gov/METS/">
     Therefore all metadata files (e.g. METS, CSV, dc.xml, etc.) need to be encoded UTF-8 without BOM.
     Refer to the Additional Information section below for more information about BOM.

     

    Additional Information

    The byte order mark (BOM) is a Unicode character, U+FEFF BYTE ORDER MARK (BOM), whose appearance as a magic number at the start of a text stream can signal several things to a program consuming the text:
       -What byte order, or endianness, the text stream is stored in;
       -The fact that the text stream is Unicode, to a high level of confidence;
       -Which of several Unicode encodings that text stream is encoded as.
     BOM use is optional, and, if used, appears at the start of the text stream.

     


    • Article last edited: 03-January-2017
    View article in the Exlibris Knowledge Center
    1. Back to top
      • Metadata Profile is not filtered by institution
      • ImageMagick Security Issue CVE-2016–3714 "ImageTragick"
    • Was this article helpful?

    Recommended articles

    1. Article type
      Topic
      Content Type
      Knowledge Article
      Language
      English
      Product
      Rosetta
    2. Tags
      1. METS
      2. UTF-8
    1. © Copyright 2025 Ex Libris Knowledge Center
    2. Powered by CXone Expert ®
    • Term of Use
    • Privacy Policy
    • Contact Us
    2025 Ex Libris. All rights reserved