Skip to main content
ExLibris
  • Subscribe by RSS
  • Ex Libris Knowledge Center

    Ingest XML files fails in some servers

    • Article Type: General
    • Product: Rosetta

    Problem Symptoms:
    Some xml files are going through correctly in one environment and fail format validation in the other.

    The error we are seeing in the logs is this:
    2014-01-03 07:36:17,622 WARN [com.exlibris.dps.repository.plugin.mdExtractor.AbstractJhoveMDExtractorPlugin] (SIP_PROCESSING_QUEUE Queue Job Receiver 315) [] jhove message: schema_reference.4: Failed to read schema document 'http://www.loc.gov/standards/marcxml/schema/MARC21slim.xsd', because 1) could not find the document; 2) the document could not be read; 3) the root element of the document is not <xsd:schema>.

    - Both environment, the file validation uses same JHOVE, XML-hul1.4
    - Both environment use the same plugin XML-hul for fmt/101(not 1_10)

    Cause:
    Firewall issues are stopping JHOVE using a validation xsd

    Resolution:
    run from the server:
    >wget http://www.loc.gov/MARC21/slim http://www.loc.gov/standards/marcxml/schema/MARC21slim.xsd

    If you fail to connect, JHOVE will fail identifying the xml format.


    • Article last edited: 2/17/2014