Skip to main content
ExLibris
  • Subscribe by RSS
  • Ex Libris Knowledge Center

    Word indexing of 856 subfield u (URL) doesn't work

    • Article Type: General
    • Product: Aleph
    • Product Version: 16.02

    Description:
    We find that when we specify tab_word_breaking routine 35 in tab11_word for the 856 field:

    35 # del_subfield
    35 # abbreviation

    a subfield u, such as the following:

    $$uhttp://afraf.oxfordjournals.org.ezproxy.library.und.edu/

    is not searchable, using the search term "ezproxy".

    Resolution:
    If you look at this subfield u in util f/1/28 ("Display Word Indexing for a Single Record") or util f/1/2 ("Display/Check Word Building Routines") you will see that it is treated as one long word, therefore, it is not possible to search on the string "ezprozy".

    In contrast, if you do util f/1/2 with "01" as the " Procedure Identifier", you will see that the $$u is broken into different words, with "ezproxy" being one of them.

    The word indexing routines look for a space to indicate the end of one word and the beginning of another. This subfield u contains no spaces. Word breaking routine 01 has this line:

    01 # to_blank -!@#$%^()_={}[]:";<>,.?/|\

    This causes the period's (".") to be changed to spaces, letting each of the components be indexed as a separate word.


    • Article last edited: 10/8/2013