Skip to main content
ExLibris

Knowledge Assistant

BETA
 
  • Subscribe by RSS
  • Back
    Aleph
    Ex Libris Knowledge Center
    1. Search site
      Go back to previous article
      1. Sign in
        • Sign in
        • Forgot password
    1. Home
    2. Aleph
    3. Knowledge Articles
    4. Z39_server bottleneck between Aleph and OCLC

    Z39_server bottleneck between Aleph and OCLC

    1. Last updated
    2. Save as PDF
    3. Share
      1. Share
      2. Tweet
      3. Share
    1. Description
    2. Resolution

     

    • Product: Aleph
    • Product Version: 20, 21, 22, 23
    • Relevant for Installation Type: Dedicated-Direct, Direct, Local, Total Care

     

    Description

    We are seeing queries submitted in WCL (World Cat Local) (coming in directly to port 9991) timing out with no OPAC data received.  When that happens, the z39.50 log doesn't even see the query - not in the log at all. So it seems to be an incoming problem. It is fine when there is not a lot of activity (i.e. Saturday and Sunday), but started up as a problem again today - Monday.

    Our IT guys says: I can see a few blocked requests on the firewall:
    "Sep 8 15:16:31 libprod1 ipmon[454]: [ID 702911 local0.warning] 15:16:31.327347 bge219000 @11016:2 b 132.174.100.234,58625 -> 132.216.30.61,9991 PR tcp len 20 40 -AR IN
    "Sep 10 00:20:08 libprod1 ipmon[454]: [ID 702911 local0.warning] 00:20:08.666420 bge219000 @11016:2 b 132.174.100.234,20196 -> 132.216.30.61,9991 PR tcp len 20 40 -AR IN
    "Sep 10 00:23:08 libprod1 ipmon[454]: [ID 702911 local0.warning] 00:23:08.413673 bge219000 @11016:2 b 132.174.100.234,20394 -> 132.216.30.61,9991 PR tcp len 20 40 -AR IN
    "Sep 10 00:24:13 libprod1 ipmon[454]: [ID 702911 local0.warning] 00:24:13.192987 bge219000 @11016:2 b 132.174.100.234,44390 -> 132.216.30.61,9991 PR tcp len 20 40 -AR IN "

     

    Resolution

    Our Unix admin upped this:

       zlogin blink ndd -get /dev/tcp tcp_conn_req_max_q : 128 to 4K !  

    We think that this OS parameter was the cause of the congestion.

     

    Additional Information

    We got OCLC to up the Max sockets to 100.   But it's unclear that this helped in resolving this problem.

    The z39_server logs showed thousands of processes being started and killed ("Server killing child pid: nnnnn").  But these are still present even though the performance is now much better and CPU usage is much lower, so it doesn't seem that, by themselves, they are an indication of a problem.   

     

     


    • Article last edited: 1-Sep-2017
    View article in the Exlibris Knowledge Center
    1. Back to top
      • Thousands of Z07's created; p_manage_62 changes date in all z30 records
      • Thousands of zero-length f_symbol files in $TMPDIR directory
    • Was this article helpful?

    Recommended articles

    1. Article type
      Topic
      Language
      English
      Product
      Aleph
    2. Tags
      1. contype:kba
      2. Prod:Aleph
    1. © Copyright 2025 Ex Libris Knowledge Center
    2. Powered by CXone Expert ®
    • Term of Use
    • Privacy Policy
    • Contact Us
    2025 Ex Libris. All rights reserved