Below is a list of questions Backstage Library Works posed to David W.
Reser, Senior Cataloging Policy Specialist with the Library of Congress.
David is working with OCLC on the non-Latin in authority references project.
Backstage thought our clientele would be interested in this new development
at the Library of Congress. We invite you to respond with your thoughts
about these changes on this listserv or contact Backstage directly with any
questions or concerns.
Non-Latin Characters in Name Authority Records Questions
1. How will they be distributed, all at once or over a period of time?
ANSWER: For the pre-population to be done by OCLC, they will contribute X
number per day (number still being negotiated with the NACO nodes, but will
likely be 25-30K per day on top of the regular daily files). Can I assume
you subscribe to the weekly distribution file? If so, multiply that number
by about 7. We don't know the total number of records that will be
impacted, but hope that it will take only about a month to do the
pre-population, assuming we can run them all through quickly (LC will be
doing a version upgrade to its system in May, so if the pre-populated
records aren't all done by then, there will be a hiatus for the upgrade).
Once that is complete, regular NACO catalogers can begin adding/editing
records with non-Latin characters, so that will obviously be an ongoing way
of doing business.
2. Will all languages be included? If not, which will be included and
which will be excluded?
ANSWER: All languages that can be fully accommodated by one of the MARC-8
script repertoires (Arabic, Extended Arabic, CJK, Cyrillic, Extended
Cyrillic, Greek, and Hebrew) are possible for the first phase. We plan to
extend beyond the MARC-8 repertoire in a later phase, but no timelines have
been set for that yet.
3. What normalization scheme will you be using for these headings?
Currently NACO normalization is used and it is a scheme for MARC8 encoded
data. What normalization scheme are you planning to use with UTF8 data?
ANSWER: A revised NACO normalization scheme was approved recently, and is
posted at: http://www.loc.gov/catdir/pcc/archive/PCCNormalization_Final.pdf
(people smarter than I about such things assure me it covers the UTF-8
environment).
4. Will there be any special normalization rules followed for these
records?
ANSWER: Since the non-Latin forms will only appear in 4XXs, and since 4XXs
are allowed to conflict (except within the same record), we're not expecting
a major issue here, but we will receive reports from OCLC on those records
that are flagged as normalization errors, just as we do now.
5. Will you also populate the 670 tag with Non-Latin Character data?
ANSWER: The pre-population routines from OCLC will not generate 670
citations, but once NACO members begin adding data themselves after the
pre-population, we expect non-Latin characters in 3 note fields, 667, 670,
and 675 during the initial phase. We'll expand as/if we discover a need,
but thought it would be nice to have a conservative target to start with.
[Although our system doesn't care what fields non-Latin script data is used
in, OCLC will check and notify us if they encounter records outside of the
expected fields.]
6. Will you be making any changes to the bib records that you are
harvesting the data from?
ANSWER: If OCLC is planning to do anything to the bib records, I'm not aware
of it. We expect that the harvesting will kick up a lot of dirt and make it
more noticeable (e.g., typos, incorrect characters)-- I expect this will
cause catalogers to update bibs on an as-needed basis.
7. Will this include NAME/TITLE and Corporate Bodies as well?
ANSWER: We believe that OCLC's pre-population will cover personal names and
corporate bodies tagged as X10s, but don't have a final analysis yet as to
what exactly is covered. Once NACO members can begin adding non-Latin data
after the pre-population, all name authority records are candidates,
including geographic, titles, and name/titles.
8. Are there any plans for Subjects?
ANSWER: We don't have plans to add non-Latin data to LCSH authorities at
this time, but will probably re-evaluate this position from time to time.
You are probably aware that we already distribute MARC Classification
records with non-Latin scripts.
Hope this helps, talk to you Wednesday at 9:00 am eastern.
Dave
John Reese
Product Manager
Backstage Library Works
Voice (800) 391-5210 Ext. 249
Fax (801) 356-8220
Email jreese(a)bslw.com