Skip to main content
ExLibris
  • Subscribe by RSS
  • Ex Libris Knowledge Center

    Problems importing zero width non-joiner in Arabic (MARC8_TO_UTF conversion)

    • Article Type: General
    • Product: Aleph
    • Product Version: 18.01

    Description:
    When converting Arabic records from MARC8 to UTF during OCLC import, we receive this error:

    Error: character X"8e" is not defined in marc8_ara_to_unicode.

    X’8E’ is the Zero Width Non-Joiner (ZWNJ) represented in Unicode as X’200C’. The corresponding X’8D’ is the Zero Width Joiner (ZWJ), represented in Unicode as X’200D’.

    The resulting record in ALEPH contains errors.

    Note that this problem does not appear when importing the same record from OCLC specifying UTF-8 encoding instead of MARC8.

    Resolution:


    • Article last edited: 10/8/2013