Skip to main content
ExLibris
  • Subscribe by RSS
  • Ex Libris Knowledge Center

    Generic MARC 21

    This information is not applicable to Primo VE environments. For more details on Primo VE configuration, see Primo VE.
    The following sections describe the mappings used for Generic MARC normalizations.

    Control Section

    Generic MARC 21 Control Section
    Normalized Record Field Source/Content Note
    Source ID
    From data source definitions
     
    Original Source ID
    From data source definitions
     
    Source Record-ID
    From header of source file
     
    Record ID
    Source ID + Source Record-ID
     
    Additional Record-ID
       
    Source Type
     
    Not in use.
    Source Format
    From data source definitions
     
    SourceSystem
    Aleph
     

    Display Section

    General Notes

    • String multiple occurrences with a semicolon unless indicated otherwise. If the source data has a period at the end and it is not the final occurrence, remove the period.
    • Remove the following end punctuation: : , = ; /

    Notes Regarding Subfields and Indicators

    • If no subfields are listed explicitly, data from all non-numeric subfields will be displayed.
    • If a field or a subfield is repeated, all instances should be displayed.
    • Subfields are listed in alphabetical order for the sake of clarity, but should be displayed in the order they are recorded in the source record.
    • If all the subfields or specified non-numeric subfields are taken, numeric subfields are not considered.
    • If a numeric subfield is specifically included, no other numeric subfield will be included.
    • If a numeric subfield is excluded, the mapping will take other numeric subfields.
    • If no indicators are defined, all indicators will be taken.
    880 tags are mapped together with the standard tag (the 880 tags are added first) in the following fields: Contributor, Publisher, Creator, Description, Edition, Subject (for 600, 610, 611, and 630), Relation, and Is part of.
    Generic MARC 21 Display Section
    Display Element Source Note
    Source
    Source from the data source definition
     
    Resource Type
    See mapping below
     
    Title
    If FMT=SE, then use 130 OR 245; otherwise use 245 with the following subfields:
    130 ##adfklmnoprs
    245 ## $$abfgknp
    The 130 was added for serials, because for serials additional information will typically be in 130.
    Uniform title
    130 admnprs
    OR
    240 admnprs
     
    Vernacular title
    880 where $$6=245
    subfields:
    abfgknp
     
    Creator
    100 abcdejqu
    110 abcde
    111 abcdn
    If the creator is derived from 100 and the first indicator is 1 or 2 then all of the text after the comma is taken (there must be a comma) and is displayed before the text that precedes the comma (deleting the comma itself).
    For example:
    Lippe, Ole von der --> Ole von der Lippe
    Van Der Wise, Fred --> Fred Van Der Wise
    Disabled rules do not reverse the author name.
    Contributor
    700, 710, and 711 except for second indicator=2
    With the following subfields:
    700 abcdejqu
    710 abcde
    711 abcdn
    For 700 the first indicator is 1 or 2. Therefore all of the text after the comma is taken (there must be a comma) and is displayed before the text that precedes the comma (deleting the comma itself).
    For example:
    Lippe, Ole von der --> Ole von der Lippe
    Van Der Wise, Fred --> Fred Van Der Wise
    Analytic 7XX field are excluded. They will be added to the description.
    Disabled rules do not reverse the author name.
    Description
    505, 520 $a
    700, 710, and 711 with the second indicator=2 using the following subfields:
    700 abcdemnopst
    710 abcdemnopst
    711 acdenpqst
    Every field is a separate occurrence.
    Edition
    250 $a $b
     
    Publisher
    502 a or 260 a,b or 264 a,b
    And equivalent fields from 880.
     
    Subject
    All 6XX fields
    Strip all numeric subfields.
    Language
    008/35-37;
    041 subfields $$a $d, $e (all occurrences should be taken)
    Validate code against list of ISO 639-2 codes. If the code cannot be translated, leave it as is.
    Physical Format
    300 and 340 fields
    If the 300 field does not end with a period, add it.
    Identifier
    020 $$a – prefix the value with ISBN:
    022 $$a – prefix the value with ISSN:
    024 2# $$a – prefix the value with ISMN
    This mapping is disabled in the out-of-the-box template since the identifiers by default do not display in the Front End.
    Relation
    Prefix the value with Series: 400, 410, 411, 440, 490, 800, 810, 811, 830, 840
    780 (first indicator -1): Prefix the value with Earlier Title:
    785 (first indicator -1): Prefix the value with Later Title:
    Strip subfield $w, $x, $y
    Every field should be a separate occurrence.
    The prefix should be added to $$C and the value to $$V.
    Display constant codes are used:
    series
    earlier_title
    later_title
    Is Part Of
    773
    Strip subfield $w, $x, $y
     
    Creation Date
    260 $c
    OR
    008/07-10
    For a date created from 008, create a date only if it starts with a digit that is not zero and replace missing digits with a question mark. For example:
    19-- > 19??
    19uu > 19??
    Library Level Availability
     
    The Library Level Availability field subfields include:
    $$I Primo Institution
    $$L Primo Library
    $$1 Sublocation
    $$2 Call number
    $$S Availability status
    $$3 No. of items
    $$4 No. of unavailable items
    $$5 multi-volume flag
    $$6 number of loans

    Mapping to Resource Type

    The mapping is based on the format type derived either from LDR positions 6 and 7 or tag and position. Use the following tables to determine the mapping.
    LDR Positions
    Leader pos. 6/7 Record type Format
    a Language material + pos.7= a,c,d,m
    Books
    BK
    a Language material + pos.7= b, i. s
    Continuing Resources
    SE
    c Notated music
    Music
    MU
    d Manuscript notated music
    Music
    MU
    e Cartographic material
    Maps
    MP
    f Manuscript cartographic material
    Maps
    MP
    g Projected medium
    Visual materials
    VM
    i Nonmusical sound recording
    Audio materials
    AM
    j Musical sound recording
    Audio materials
    AM
    k Two-dimensional non-projectable graphic
    Visual materials
    VM
    m Computer file
    Computer files
    CF
    o Kit
    Visual materials
    VM
    p Mixed material
    Mixed materials
    MX
    r Three-dimensional artifact or naturally occurring object
    Visual materials
    VM
    t Manuscript language material
    Books
    BK
    w Rare books
    Used by KORMARC.
    Rare Books RB
    Default
     
    BK
    Record Type Derived from Tag and Position
    Format Based on (tag and position) TYPE Note
    BK
     
    book
    The catch-all for BK if no further information is available is Book
    CF
    008-26
    h
    audio
     
    CF
    008-26
    j
    database
     
    CF
    008-26
    d, e
    text_resource
     
    CF
     
    other
     
    MP
     
    map
    DC defines a map as a type of image.
    AM
     
    audio
     
    MU
     
    score
     
    SE
    008 21
    d,w
    other
     
    SE
    008 21
    L
    text resource
     
    SE
    008 21
    M
    book
     
    SE
     
    journal
     
    VM
    008 33
    I,k,l,n,s,t
    image
     
    VM
    008 33
    F,m,v
    video
     
    VM
     
    other
     
    MX
     
    other
     

    Links

    Links Section
    Type of Link Source Note
    OpenURL
    Based on resource type from display:
    If type=article then:
    $$Topenurl_article
    Otherwise:
    $$Topenurl_journal
    SFX has two sources for Primo: one for articles in which case the data is used and one for journals in which case the date is ignored. There is a different template per source.
    OpenURL_fulltext
    Based on resource type from display:
    If type=article then:
    $$Topenurlfull_article
    Otherwise:
    $$Topenurlfull_journal
    SFX has two sources for Primo: one for articles in which case the data is used and one for journals in which case the date is ignored. There is a different template per source.
    OpenURL_servicetext
       
    Backlink
       
    LinktoHoldings
       
    Linkto Holdings_available
       
    Linkto Holdings_unavailable
       
    Linkto Holdings_doesnotexist
       
    LinktoRequest
       
    LinktoResource
    856 40 $u and 856 41 $u
    Add display text ($$D) from $y + $3 + $z. If not available, then use code: "Online version"
    856 1#, 856 10, and 856 11
    Add display text ($$D) from $y + $3 + $z. If not available then use code: "Online version"
    Validate that the link is to the resource by checking the content of subfield 3.
    Additional links
    856 42 $u.
    Add display text ($$D) from $y + $3 + $z. If not available then use code: "Related online content"
    506 $u $$Dlink to restrictions on access
    538 $u $$Dlink to system details
    540 $u $$D Link to terms governing use and reproduction
    545 $u $$D Link to biographical or historical information
    856 41 $u if $3 is "Sample Text" or "Publisher description"
     
    Thumbnail
    $$Tsyndetics_thumb (disabled)
    $$Tgoogle_thumb
    For Syndetics, this field requires an ISBN.
    For Google, this field requires an OCLC and LCCN.
    linktotoc
    505 $u
    $$Tamazon_toc
    $$Tsyndetics_toc (disabled)
    856 4# $u if $3=Table of Contents
    Create Amazon and Syndetic links only if there is an ISBN.
    linktoabstract
    $$Tsyndetics_abstract
    Add if there is an ISBN (020 $a).
    linktoreview
    520 1# $u
     
    linktofa
    555 0# $u
    Add subfields abcd to $$D
     
    linktouc
    $$Tamazon_uc – add if there is ISBN
    $$Tworldcat_isbn – add if there is ISBN ELSE add
    $$Tworldcat_oclc – if there is OCLC number
     
    linktoexcerpt
    $$Tsyndetics_excerpt
    Add if there is an ISBN.

    Search

    880 tags are mapped together with the standard tag in the following fields: Creator/Contributor, Title, Additional title, Description, Subject (for 600, 610, 611, and 630), TOC, and General (except for identifiers).
    Search Section
    Index Source tag Notes
    Creator/contributor
    100 abcdejqu
    110 abcde
    111 abcdn
    245 c
    505 r
    508 a
    511 a
    700 abcdejqu
    710 abcde
    711 abcdn
    720 a
    800 abcdejqu
    810 abcde
    811 abcdn
     
    100 a
    700 a
    800 a
    For the 100, 700, and 800 fields, if the first indicator is 1 or 2, then take only the second uppercase character in the string, following a comma.
    Title
    If type = Journal:
    245 a
    245 a,b,f,g,n,p
    130 a
    Else:
    245 a,b,f,g,n,p
    For journals up to three exact titles are indexed.
    Additional title
    100 fgklnpt
    110 fgklnpt
    111 fgklnpt
    247 abnp
    400 fklnptv
    410 fklnptv
    411 fklnpstv
    440 anpv
    490 av
    700 fklmnoprst
    710 fklmnoprst
    711 fklnpst
    730 adfklmnoprs
    740 anp
    800 fklmnoprstv
    810 fklmnoprstv
    811 fklnpstv
    830 adfklmnoprstv
    840 adfklmnoprstv
    760,762,765,767,770,772,
    773,774,775,776,777,780,
    785,786,787 subfields st
     
    Alternative Title
    130
    210
    240
    243
    246 abnp
     
    Description
    520 $a
     
    Subject
    6XX fields – Strip all numeric subfields
    Translation of LCC by enrichment
     
    ISBN
    020 az
     
    ISSN
    022 ayz
     
    Resource type
    Resource type from display
     
    Creation date
    008/07-10 and 008/11-14 are digits and not 9999
    260 $c
     
    Full Text
       
    TOC
    505 $a
     
    RecordSource
    Source ID from the control section
    Required to filter out certain sources.
    RecordID
    Record ID from the control section
    Required to retrieve record based on system number.
    General
    260 $b
    502
    511
    508
    518
    521
    534
    586
    0242 az
    0243 az
    027 az
    028 a
     
    Search scope
    From PNX:
    delivery/institution
    control/sourceid (for example the data source is added as a scope)
     
    Restricted search scope
       
    Scope
    Copies from the Search scope and Restricted search scope from the sections above
     

    Sort

    Sort Section
    Sort type DC field
    Creation Date
    008/07-10 OR 260 $c
    Author
    A single author sort key is created from one of the following tags. Subfields are the same as in the display section:
    880/100
    100
    880/110
    110
    880/111
    111
    880700
    700
    880/710
    710
    880/711
    711
    Title
    A single title sort key is created from one of the following:
    880/245
    130 if FMT=SE
    245
    Popularity
     

    Facets

    Facet Section
    Facet Source Note
    Resource type
    Create this based on the Resource type field from display section as follows.
    Book -> books
    Journal -> journals
    Article -> articles
    Text Resource -> books
    Image -> images
    Audio -> media
    Video -> media
    Score -> Scores
    Map -> Maps
    Other -> other
    In some cases, two values should be created, each as a separate field.
    Language
    008/35-37 and 041 subfields a, d, e.
    If the language is not a valid ISO 639 code it should not be created.
    Creator/Contributor_
    100/700 $a
    110/710 $a
    111/711 $a
    The normalized format.
    For 100 and 700, if the first indicator is 1 or 2 then take second upper case character in the string, following a comma.
    7XX except for second indicator 1.
    Topic
    6XX except for 655
    First facet level is all data up to the first occurrence of subfield $$v, x, y or z. Each subfield division (v, x, y or z) constitutes the next level.
    The first facet level might have multiple occurrences in one record; these multiple occurrences should be "de-duplicated."
    Punctuation that is in the field should be retained, except for periods at the end.
    For example:
    <datafield ind1="0" ind2="0" tag="630"/>
    <subfield code="a">Bible.</subfield>
    <subfield code="p">O.T.</subfield>
    <subfield code="p">Pentateuch</subfield>
    <subfield code="x">Sermons.</subfield>
    Should become:
    Bible - O.T. - Pentateuch-Sermons (the hyphen between Pentateuch and Sermons is for the levels).
    Genre
    655 $a
    6XX $v
     
    classification.lcc
    Added by enrichment
     
    Creation Date
    008/07-10 OR 260 $c
    Truncate 260 $c so that it has only 4 digits. If the date cannot be normalized to 4 digits, do not create the facet.
    File size
    Not in use
     
    Collection
       
    Physical format
    Not in use
    Not in use.
    Top-level
    online_resources -- assign if the delivery category is Online Resource, SFX Resource, or MetaLib Resource.
    new – as tagged before load.
    Available in Library map based on availability information in the source record.
     
    Pre-filter
    Based on Resource Type from the display section:
    Book -> books
    Journal -> journals
    Article -> articles
    Text Resource -> books
    Image -> images
    Video -> audio_video
    Audio -> audio_video
    Maps -> maps
    Score -> scores
     
    Related record
       

    Duplicate Record Detection Vector

    Currently two types of record matching vectors exist:
    • T1 – for non-serials
    • T2 – for serials
    The mapping of record to T1 or T2 is based on the format type. The format type is based on the extraction procedure that creates the format (FMT) field from pos. 6 and 7 in the leader.
    • T1 – All formats except for SE
    • T2 – SE

    Vector for T1 - "non-serials"

    Vector for T1
    Field ID Nature of field Content of Field/Source Tag + Subfield Note
    T
    Type
    1
    Created if the format is not SE.
    The following fields are for the candidate selection:
    C1
    UnivID, UnivID_invalid
    010 $a $z
    Take prefix and number and remove any suffixes.
    Multiple occurrences are delimited by a semicolon.
    C2
    ISBN, Invalid_ISBN
    020 $a $z
    Use data until a blank character or the end of subfield.
    Multiple occurrences are delimited by a semicolon.
    C3
    Short title
    245 $abnp
    Use normalization routine #1
    Exact match on first 20 and last 10 char.
    The result is a single string of 30 characters.
    C4
    Year
    008 7-10
     
    The following fields are for the matching program:
    F1
    UnivID
    010 $a
    Take prefix and number and remove any suffixes
    F2
    UnivID_Invalid
    010 $z
    Take prefix and number and remove any suffixes
    Multiple occurrences are delimited by a semicolon.
    F3
    ISBN
    020 $a
    Use data until a blank character or the end of subfield.
    Multiple occurrences are delimited by a semicolon.
    F4
    ISBN_Invalid
    020 $z
    Use data until a blank character or the end of subfield.
    Multiple occurrences are delimited by a semicolon.
    F5
    Short title
    245 $abnp
    Same as C3.
    F6
    Year
    008 7-10
     
    F7
    Full title
    245 $abnp
    Use routine #2 from
     
    F8
    Country of publication
    008 15-17
     
    F9
    Pagination
    300 $$a
     
    F10
    Publisher
    260 $$b
    Use filing routine #3 to normalize
    Take only first occurrence of 260 tag and first occurrence of subfield b.
    F11
    Main entry (author, corporate body, meeting)
    100 $abcdq
    OR
    110 $abcdn
    OR
    111 $abcdenq
    Use normalization routine #3 to normalize
     

    Vector for T2 - "serials"

    Vector for T2
    Field ID Nature of field Content of Field/Source Tag + Subfield Note
    T
    Type
    2
    Created if the format is SE.
    The following fields are for the candidate selection:
    C1
    UnivID, UnivID_invalid
    010 $a $z
    Use data until a blank character or the end of subfield.
    Multiple occurrences are delimited by a semicolon.
    C2
    ISSN, Invalid_ISSN, cancelled_ISSN
    022 $a $y $z
    Use data until a blank character or the end of subfield.
    Multiple occurrences are delimited by a semicolon.
    C3
    Short title
    245 $abnp
    Use filing procedure #1
    Exact match on first 25 char.
    The result is a single string of 25 characters.
    C4
    Place of publication
    260 $$a normalized using routine 75
    After applying routine #3 then take only the first string (up to first blank).
    Take only first occurrence of 260 and first occurrence of subfield a.
    The following fields are for the matching program:
    F1
    UnivID
    010 $a
    Use data until a blank character or the end of subfield.
    F2
    UnivID_Invalid
    010 $z
    Use data until a blank character or the end of subfield.
    Multiple occurrences are delimited by a semicolon.
    F3
    ISSN
    022 $a
    Use data until a blank character or the end of subfield.
    Multiple occurrences are delimited by a semicolon.
    F4
    ISSN_Invalid
    022 $y
    Use data until a blank character or the end of subfield.
    Multiple occurrences are delimited by a semicolon.
    F5
    ISSN_Cancelled
    022 $z
    Use data until a blank character or the end of subfield.
    Multiple occurrences are delimited by a semicolon.
    F6
    Year
    008 7-10
     
    F7
    Full title
    245 $abnp
    Use filing routine #2
     
    F8
    Truncated title
    245 $a
    Use normalization routine #2
     
    F9
    Country of publication
    008 15-17
     
    F10
    Place of publication
    260 $$a normalized using routine #3
    After applying routine, take only the first string (up to first blank).
    Take only first occurrence of 260 and first occurrence of subfield a.
    F11
    Main entry (author, corporate body, meeting)
    110 $abcdn
    OR
    111 $abcdenq
    OR
    130 $a adlmnoprst
    Use filing routine #3
     

    FRBRization

    Refer to Normalization Routines for Duplicate Record Detection, for the normalization routines for the author and title parts.
    The key field has two subfields:
    • $$K key part
    • $$A key part type that determines the algorithm
    FRBRization
    Field ID Source (value of $$K for K fields) Key part type (value of $$A for K fields) Note
    T
    Always 1
     
    MARC 21 algorithm
    K1-Kn
    For every record a different number can be created
    100 OR, 110 OR, 111 OR, 700 ADD, 710 ADD, 711 ADD
    A
    Single occurrence of 100, 110, and 111;
    Multiple occurrences of 700, 710, 711, 100, 110.
    Take subfields a, b, c, d, q
    111, 711 - a, b, c, d, n, q
    Do not generate key from 700 or 710 if subfield e = "former owner"
    Kn
    130
    TO
    Subfield a, d, m, n, p, r, s
    Do not generate a key if subfield a or k contains "selections" or "census."
    Kn
    If format is not SE:
    240 ADD
    245 OR
    242 OR
    246 OR
    247 OR
    740 OR
    245 subfield k
    If format is SE:
    240 ADD
    245 OR
    242 OR
    246 OR
    247 OR
    740 OR
    245 subfield k
    T
    240 – Subfields a, d, m, n,p,r, s
    245 – a, b, e, f, g, n, p
    242 – a, b, f, g, n, p
    246 – a, b, f, g, n, p
    247 – a, b, f, g, n, p
    740 – Subfields anp
    Do not generate a part key from 240 if it starts with any of the following: selections, laws, treaties, bills, statutes, Acts, public general acts, acts, rules, works, or census.
    Note: If the format is not a serial (FMT=SE), then the title part keys will be generated from both 240 and 245.

    Delivery and Scoping

    Delivery and Scoping Section
    Delivery Field Source Additional normalization notes
    Institution
     
    Using ILS Institution Codes mapping table.
    Delivery category
     
    Restricted delivery scope
       

    Ranking

    Local mapping required as relevant.
    Ranking Section
    Booster Field Source Additional normalization notes
    booster1
    1 or as added by enrichment program
     
    booster2
     
    Not in use.

    Enrichment

    Local mapping required as relevant.
    Enrichment Section
    Enrichment Field Source Additional normalization notes
    classification.lcc
    050 $a, 090 $a
    All occurrences added to separate fields.
    fulltext
       
    TOC
       
    Abstract
       
    Review
       
    Rank-parent-child
       
    Rank-Number of copies
       
    Rank-Date first copy
       
    Rank-Number of loans
       

    Additional Data

    This includes multiple occurrences in separate fields.
    Additional Data
    Additional data field Source Additional normalization notes
    Author Last
    100 1# OR 100 2# OR 700 1# OR 700 2# $a
    Takes text until first comma.
    Only one occurrence should be created.
    Author First
    100 1# OR 100 2# OR 700 1# OR 700 2# $a
    Takes text after first comma and until first space.
    Only one occurrence should be created.
    Author initials
       
    Author first initial
       
    Author middle initial
       
    Author suffix
       
    Author
    100 abcdejqu
     
    Corporate Author
    110 abcde
    111 abcdn
     
    Additional author
    700 abcdejqu
    710 abcde
    711 abcdn
     
    Series author
    800 abcde
     
    Book Title
    If resource type is not an article or a journal:
    245 abfgknp
    Because the PNX cannot be used in conditions, this is based on LDR and 008.
    Article title
       
    Journal title
    If resource type is Journal:
    245 abfgknp
    Since the PNX cannot be used in conditions, this is based on LDR and 008.
    Short title
    210 a
     
    Additional title
    246 abnp
     
    Series title
    400, 410, 411, 440, 490, 800, 810, 811, 830, 840
    Strip subfield x.
    Date
    008/07-10 or 260 $c
    Normalize to 4 characters.
    RISDate
    260 $c or 008/08-10
     
    Additional Date
       
    Volume
       
    Issue
       
    Part
       
    Season
       
    Quarter
       
    Start page
       
    End page
       
    Pages
       
    Article number
       
    ISSN
    022 a
    Use data up to a blank character or end of subfield.
    eISSN
    776 x
    Use data up to a blank character or end of subfield.
    ISBN
    020 a
    Use data up to a blank character or end of subfield.
    CODEN
    030 a
    Use data up to a blank character or end of subfield.
    SICI
       
    Metadata Format
    If there is a 502 -> dissertation
    Else based on Resource type from display:
    • book -> book
    • journal -> journal
    • article -> journal
    • conference_proceeding & record has ISSN -> journal
    Else -> book
     
    Genre
    The Genre mapping table maps the resource type from the display section of the PNX to the genre that is required by the OpenURL.
    Use Genre mapping table.
    RISType
    Based on Resource type from display:
    If there is a 502 then -> THES
    book -> BOOK
    journal -> JOUR
    map -> MAP
    video -> VIDEO
    audio -> SOUND
    music -> MUSIC
    article -> JOUR
    Else -> GEN
     
    City of Publication
    260 a
     
    Publisher
    260 b
     
    Abstract
    520 ab
     
    Miscellaneous1
       
    Miscellaneous2
       
    Miscellaneous3
       
    OCLC ID
    035 $$a – if text (OCoLC) is present in 035.
    Take all digits following the text OCLC and until space.
    Example:
    035 $$a(OC0LC)814782
    LCCN
    010 $$a
    Take prefix and number.
    DOI
       
    URL
       
    Local fields 1-5
       

    Browse

    The system can create multiple occurrences in separate fields.
    Browse
    Browse field Source Additional normalization notes
    Institution
    PNX: delivery/institution
     
    Author
    All of the following:
    100,110,111,700, 710, 711, 720, 800, 810, 811, and equivalent 880 fields
    $$D (display form) and $$E (normalized form) are created.
    Title
    All of the following:
    130, 210, 240, 243, 245, 246, 247, 440, 490, 730, 740, 830
    And the following using $$t:
    100,110,111, 700, 719, 711,800, 810, 811
    And equivalent fields from 880.
    $$D (display form) and $$E (normalized form) are created.
    Subject
    600, 610, 611, 630, 648, 650, 651, 654, 655
    $$D (display form) and $$E (normalized form) are created.
    Call number
    Rules not added.
     

    Normalization Routines for Duplicate Record Detection

    Certain characters are translated in XML:
    Normalization Routines for Dedup Record Detection
    Special character Special meaning Entity encoding
    >
    Begins a tag.
    >
    <
    Ends a tag.
    <
    Quotation mark.
    "
     
     
    'Apostrophe.
    '
    &
    Ampersand.
    &
    The publishing platform removes all leading and trailing spaces and packs double spaces.

    Normalization Routine #1

    1. Remove non-filing characters
      Drop initial text using non-filing indicator. The non-filing indicator is the second indicator in the following MARC tags: 222, 240, 242, 243, 245, 440, and 830. The second indicator contains a number from 0-9 indicating how many characters to drop. (There are some fields where the non-filing indicator is in the first position: 130, 630, 730, and 740.)
      Remove all text that appears within <<>> or within the Unicode characters 0088 and 0089.
    2. Delete the following characters: '
    3. Change the following characters to blank: !@#$%^&*()_+-={}}[]:";<>?,./~`
    4. Convert characters using the "FILING-KEY-01' character conversion table.
    5. Change characters to lower case.
    6. Remove all spaces.
    7. Take first 10 and last 10 characters.

    Normalization Routine #2

    1. Remove non-filing characters.
      Drop initial text using non-filing indicator. The non-filing indicator is the second indicator in the following MARC tags: 222, 240, 242, 243, 245, 440, and 830. The second indicator contains a number from 0-9 indicating how many characters to drop. (There are some fields where the non-filing indicator is in the first position: 130, 630, 730, and 740.)
    2. Remove all text that appears within <<>> or within the Unicode characters 0088 and 0089.
      For example:
      <datafield ind1="1" ind2="0" tag="245"> <subfield code="a"><<the>> book : its history in England in the middle ages!</the></subfield> </datafield>
      Should become:
      "book: its history in England in the middle ages"
    3. Delete the following characters: '
    4. Change the following characters to blank: !@#$%^&*()_+-={}}[]:";<>?,./~`
    5. Convert characters using the "FILING-KEY-01' character conversion table.
    6. Change characters to lower case.

    Normalization Routine #3

    1. Delete the following characters: '
    2. Change the following characters to blank: !@#$%^&*()_+-={}}[]:";<>?,./~`
    3. Convert characters using the "FILING-KEY-01' character conversion table.
    4. Change characters to lower case.
    Normalization Routines for FRBR
    The publishing platform will delete leading and trailing blanks and remove double spaces.

    Author Part Normalization

    1. Delete characters: | [ ] '
    2. Change characters to space: $~'^%*/\?@.:;<>{}}-()"!¿¡,
    3. Convert characters using the NACO_diacritics character conversion table.
    4. Change characters to lower case.

    Title Part Normalization

    1. Remove non-filing characters.
      Drop initial text using non-filing indicator. The non-filing indicator is the second indicator in the following MARC tags: 222, 240, 242, 243, 245, 440, and 830. The second indicator contains a number from 0-9 indicating how many characters to drop. (There are some fields where the non-filing indicator is in the first position: 130, 630, 730, 740.)
    2. Delete characters: | [ ] '
    3. Change characters to space: $~'^%*/\?@.:;<>{}}-()"!¿¡
    4. Convert characters using the NACO_diacritics character conversion table.
    5. Change characters to lower case.

    Defining the Delivery Category Algorithm

    The following out-of-the-box algorithm is used for MARC 21. It should be possible to distinguish between the following resource types:
    • Physical items (except for microfolm)
    • Microform
    • SFX resources
    • Online resources
    The algorithm is read from top to bottom. Once a record is assigned a category, the algorithm stops.
    When there are several definitions for the same category the priority is given to the "safest" option.
    In the algorithm, priority has been given to online resources based on the assumption that users most often prefer this option. Primo will include a display of the location and availability status of physical items.
    The format is based on the definitions used for each resource type. For more information on these definitions, see LDR Positions.
    Defining the Delivery Category Algorithm
    Condition Delivery Category Note
    If 035=SFX
    SFX Resources
     
    007/00=c and 007/01=r
    Online Resource
     
    If there is a 8564- or 85640 or 85641
    Online Resource
    Add conditions based on $$3 to prevent this category from being assigned if the link is not to the resource (e.g. $$3 is Table of Contents, or Abstract).
    If 007/00=h
    Microform
     
    If FMT=BK or MU or SE or MX and 008/23=a or b or c
    Microform
     
    If FMT=MP VM and 008/29=a or b or c
    Microform
     
    If 245 $$h includes the string micro
    Microform
     
    If not any of the above
    Physical Item
     
    • Was this article helpful?