Monday, August 15, 2011

Additions to the MARC Country and Geographic Area Code Lists

de Südsudan en Southern Sudan ru Южный СуданImage via WikipediaAs the result of the split of Republic of the Sudan (or conventional short form: Sudan) into Sudan and South Sudan in July 2011, new country and geographic area codes are being defined for use in MARC records.

The coding for Republic of the Sudan will remain the current coding for Sudan: country code sj and GAC f-sj.
  1. MARC country code changes

    The new country code is:
    • sd - South Sudan
    South Sudan was previously coded sj for Sudan before August 2011.
  2. MARC geographic area code changes

    The new geographic area codes is:
    • f-sd - South Sudan
    South Sudan was previously coded f-sj for Sudan before August 2011.

Wednesday, August 10, 2011

New Vocabulary Data Added to LC Authorities and Vocabularies Service

A logo of the Unites States Library of Congres...Image via WikipediaLC has announced that new vocabulary data has been added to the LC Authorities and Vocabularies Service.
The Library of Congress is pleased to make available additional vocabularies from its Authorities and Vocabularies web service (ID.LOC.GOV), which provides access to Library of Congress standards and vocabularies as Linked Data. The new dataset is:
  • Library of Congress Name Authority File (LC/NAF)
In addition, the service has been enhanced to provide separate access to the following datasets which have been a part of the LCSH dataset access:
  • Library of Congress Genre/Form Terms
  • Library of Congress Children's Headings
The LC/NAF data are published in RDF using the MADS/RDF and SKOS/RDF vocabularies, as are the other datasets. Individual concepts are accessible at the ID.LOC.GOV web service via a web browser interface or programmatically via content-negotiation. The vocabulary data are available for bulk download in MADS and SKOS RDF (the Name file and main LCSH file will be available by Friday, August 12).

Please explore it for yourself at http://id.loc.gov.

The new datasets join the term and code lists already available through the service:
  • Library of Congress Subject Headings (LCSH)
  • Thesaurus of Graphic Materials
  • MARC Code List for Relators
  • MARC Code List for Countries (which reference their equivalent ISO 3166 codes)
  • MARC Code List for Geographic Areas
  • MARC Code List for Languages (which have been cross referenced with ISO 639-1, 639-2, and 639-5, where appropriate)
  • PREMIS vocabularies for Cryptographic Hash Functions, Preservation Events, and Preservation Level Roles

Tuesday, August 09, 2011

Unshelved

Today's Unshelved has cataloging references.

In other comic news, Rachael Rising by Terry Moore is off to a great start. Very creepy.

Linking to LCSH and LCC

Linking to LCSH and LCC: Controlled Subject Headings and Classification Systems through the Web Barbara Tillett, Libby Dechman, and Loche McLean will be presented on August 16, 2011, at the World Library and Information Congress : 77th IFLA General Conference and Assembly in San Juan.
The Library of Congress is making its controlled vocabularies and classification system freely accessible on the Web. This presentation describes our services related to two of those controlled vocabularies and classification schemes: the Library of Congress Subject Headings and Library of Congress Classification. It also describes the work to link various language versions of LCSH together.

The initial structure for this service uses SKOS, Simple Knowledge Organization System, which “Provides a model for expressing the basic structure and content of concept schemes such as thesauri, classification schemes, subject heading lists, taxonomies, folksonomies, and other similar types of controlled vocabulary.

Additionally we are exploring RDF XML as another structure to use for presenting this data. The service can be found at http://id.loc.gov and the LCSH/SKOS is at http://id.loc.gov/authorities.
Thanks to Gary Price for bringing this to my attention.

OCLC Documentation

Online Computer Library CenterImage via WikipediaOCLC's Technical Bulletin 260, “OCLC-MARC Format Update 2011,” is now available.

This Technical Bulletin covers all of the Library of Congress’s MARC 21 Bibliographic, Authority, and Holdings Formats Update No. 12, dated October 2010, elements from other recent MARC 21 Updates whose implementations had been postponed, code list additions and changes published chiefly since May 2010, and other suggestions from WorldCat users and OCLC staff. Many of these elements, including those from MARC 21 Update No. 12, are related to Resource Description and Access (RDA).

New IFLA Publications

Basic Group 1 entities and relations of the FR...Image via WikipediaVarious reports have been posted on the IFLA web site.

ISBD Review Group activities report and meeting reports

ISBD/XML Study Group activities report

A new version of Full ISBD Examples

FRBR Review Group activities report

Tuesday, August 02, 2011

Updated Cover Sheets for MARC Proposals and Discussion Papers

From LC's Network Development and MARC Standards Office.
The cover sheets for the proposals and discussion paper presented at the 2011 Annual meetings of the MARC Advisory Committee have been updated with the results of the discussions. They are available at:

Proposal 2011-02: RDA Production, Publication, Distribution and Manufacture Statements in the MARC 21 Bibliographic Format
(http://www.loc.gov/marc/marbi/2011/2011-02.html)

Proposal 2011-03: Encoding Date of Copyright Notice in the MARC 21 Bibliographic Format
(http://www.loc.gov/marc/marbi/2011/2011-03.html)

Proposal 2011-04: Adapting Field 377 (Associated Language) for Language of Expression in the Authority and Bibliographic Format
(http://www.loc.gov/marc/marbi/2011/2011-04.html)

Proposal 2011-05: Broadening field 373 (Affiliation) for Associated institutions in the MARC 21 Authority Format
(http://www.loc.gov/marc/marbi/2011/2011-05.html)

Proposal 2011-06: RDA Fuller Form of Personal Name Attribute in the MARC 21 Authority Format
(http://www.loc.gov/marc/marbi/2011/2011-06.html)

Proposal 2011-07: Additional Corporate Body Attributes for RDA in the MARC 21 Authority Format
(http://www.loc.gov/marc/marbi/2011/2011-07.html)

Proposal 2011-08: Treatment of Controlled Lists of Terms for Carrier Characteristics in RDA in the MARC 21 Bibliographic Format
(http://www.loc.gov/marc/marbi/2011/2011-08.html)

Proposal 2011-09: Identifying the Source of Thematic Index Numbers in Field 383 in the MARC 21 Authority and Bibliographic Formats
(http://www.loc.gov/marc/marbi/2011/2011-09.html)

Proposal 2011-10: Geographic Codes in Classification Records
(http://www.loc.gov/marc/marbi/2011/2011-10.html)

Proposal 2011-11: Addition of 1st Indicator Value 7 (Other edition specified in $2) in DDC Number Fields in the MARC 21 Bibliographic, Authority and Community Information Formats
(http://www.loc.gov/marc/marbi/2011/2011-11.html)

Discussion Paper 2011-DP05: Additional Means of Identifying Medium of Performance in the MARC21 Bibliographic and Authority Formats
(http://www.loc.gov/marc/marbi/2011/2011-dp05.html)

HotForWords

The latest video podcast by HotForWords is What is going to happen to the libraries? by Marina Orlova. She sees the demise of libraries looming. It is a combination of hearsay and personal experience that needs some correcting. Consider leaving a comment, she has a large following.

First RDA Vocabularies Published

Seen everywhere.
The Joint Steering Committee for Development of RDA (JSC), the DCMI/RDA Task Group (http://dublincore.org/dcmirdataskgroup/), and ALA Publishing (on behalf of the co-publishers of RDA) are pleased to announce that the first group of RDA controlled vocabularies have been reviewed, approved, and their status in the Open Metadata Registry (OMR) changed to ‘published.’

This status change, from ‘new-proposed’ to ‘published’ signals that the final steps have begun in reviewing the work of the DCMI/RDA Task Group and ensuring that the RDA vocabularies (both elements and controlled vocabularies/concepts) are available in a stable form for the builders of applications. Additional reviews of the controlled vocabularies are in progress, with the parties involved expecting to finalize that group before starting on the element vocabularies. Both groups of vocabularies should be complete by the end of 2011, but developers and others should expect to see rolling announcements as reviews are completed.

Alan Danskin, Chair of the Joint Steering Committee, noted, “The RDA vocabularies are a fundamental component of RDA, promoting consistent description and discovery of bibliographic resources. The Committee is committed to publishing and maintaining the content of the RDA vocabularies, synchronized with the text of RDA, in order to support their use by the resource description community and by developers of Semantic Web applications.” JSC is grateful to the Task Group members and to the Open Metadata Registry for making possible the publication of RDA vocabularies as linked open data.

Gordon Dunsire, co-Chair of the DCMI/RDA Task Group, said: “This achievement is a significant waypoint on the collaborative journey initiated by the London meeting in 2007 between representatives of RDA and Semantic Web communities developing and maintaining metadata models covering bibliographic resources. We have learned much about each other along the way, and look forward to continuing to provide a bridge which supports our mutual interests to the benefit of all users of information. The RDA vocabularies represent many decades of library experience and practice which is now shared with the rest of the world.”

Troy Linker, Publisher, ALA Digital Reference, said: "The publishing of the RDA vocabularies in an open registry is an important step forward in making RDA accessible to the international library community and system vendors, and, significantly, beyond the library community to the rest of the Semantic Web. Working with the JSC, the Co-Publishers for RDA are committed to ensuring synchronicity between RDA Toolkit and the registry".

The finished vocabularies can be viewed using the links below. (The links lead to the description of the vocabulary itself, the specific terms can be viewed under the tab for ‘concepts’).
All the RDA vocabularies can be viewed in the OMR by using this page: http://metadataregistry.org/rdabrowse.htm. Those interested in following the work of review and publication of the vocabularies can subscribe to the Registry RSS feeds linked from that page. Questions on the OMR can be conveyed using the ‘Feedback’ link on each Registry page.

Monday, August 01, 2011

No More Star Wars Films?

A logo of the Unites States Library of Congres...Image via WikipediaThe heading Star War films, and all other headings dealing with a series or character might be removed from the lcgft.
The recent separation of the genre/form thesaurus from LCSH provides an opportunity to reexamine the genre/form terms approved during the moving image experiment in order to determine whether they fit into the overall thesaurus, given the trajectory of development.

One ongoing issue has been what may be loosely referred to as “character- and franchise-based terms.” These are authorized terms that include either a character name or the title of a film or television program (e.g., Die Hard films; Dracula television programs; Scooby-Doo television programs; Star Wars films). PSD proposes cancelling all of these terms from the genre/form thesaurus, and has posted a discussion paper explaining the rationale on the Library of Congress’ genre/form web page, http://www.loc.gov/catdir/cpso/genreformgeneral.html. The direct URL for the paper is http://www.loc.gov/catdir/cpso/character_franchise_disposition.pdf. PSD requests input from interested parties before making a final decision.

Go To Hellman

Eric Hellman has been writing some posts I find very interesting over on Go To Hellman.
I think it's important to convert this level of interest into action. The question is, what can be done now to get closer to the vision of ubiquitous interoperable data? My last three posts have explored what libraries might do to better position their presence in search engines and in social networks using schema.org vocabulary and Open Graph Protocol. In these applications, library data enables users to do very specific things on the web- find a library page in a search engine or "Like" a library page in a Facebook. But there's so much more that could be done with the data.

Thursday, July 28, 2011

Tematres 1.4

We have the glad to invite to test the beta version of Tematres 1.4

TemaTres is a web tool to manage, publish and exploit controlled vocabularies and other formals representation of knowledge (thesauri, taxonomies, glossaries, etc).

This release includes the following fixes and improvements:
  • Quality indicators about controlled vocabularies The Quality assurance was improved with reports about the following quality indicators:
  • Free Terms
  • Terms without hierarchical relationships
  • Average number of words per term
  • Terms per N Broader terms
  • Terms per N narrower terms and depth
  • Terms words with not supported prefixes or suffixes
For more details: MARTÍNEZ, A.M.a et al. Concepto, forma y longitud de los términos preferentes del tesauro: una propuesta de indicadores de calidad. Anales de Documentación, 2010, vol. 13, p. 185-195. http://revistas.um.es/analesdoc/article/view/107151

MARTÍNEZ, A.M.a et al. Indicadores para evaluar el vocabulario y la estructura sistemática de un tesauro. I Jornada de Intercambio y Reflexión acerca de la Investigación en Bibliotecología, La Plata, 6-7 de diciembre de 2010. Facultad de Humanidades y Ciencias de la Educación de la Universidad Nacional de La Plata. http://www.jornadabibliotecologia.fahce.unlp.edu.ar/jornada-2010/martine
  • User-defined notes
    Has been added management capabilities to manage and create user-defined notes. (Thanks to Observatorio Estatal de la Discapacidad Spain: http://www.observatoriodeladiscapacidad.es/)
  • Advanced configuration options
    Has been added detailed configuration options available to the administrator of the controlled vocabulary.
  • Import controlled vocabularies
    Now TemaTres can import controlled vocabulary from plain tagged text.
For example:
  • IMS VDEX Scheme (Vocabulary Definition and Exchange) Now with TemaTres you can Display, export and publish terms and controlled vocabularies through VDEX IMS XML schema (Vocabulary Definition and Exchange). http://www.imsglobal.org/vdex/
  • Controlled vocabularies RESTful Services TemaTres have support for web services accessible through a clear and simple syntax. The service support a wide variety of queries and data can be viewed in XML, JSON or SKOS-Core.
Examples
Status:
http://www.vocabularyserver.com/gemet/en/api/

Terms beginning with the letter B:
http://www.vocabularyserver.com/gemet/en/api/letter/b
http://www.vocabularyserver.com/gemet/en/api/letter/b/skos
http://www.vocabularyserver.com/gemet/en/api/letter/b/json

Search Terms:
http://www.vocabularyserver.com/gemet/en/api/search/fish/
http://www.vocabularyserver.com/gemet/en/api/search/fish/json
http://www.vocabularyserver.com/gemet/en/api/search/fish/skos

Vocabulary Data
http://www.vocabularyserver.com/gemet/pt/api/fetchVocabularyData

Top of vocabulary terms
http://www.vocabularyserver.com/gemet/pt/api/fetchTopTerm
  • Minor bugs was solved and was added some minor features
Seen on the Code4Lib email.

Monday, July 25, 2011

Corporate Names

A new paper from HP discusses the problems and an automated solution to distinguishing corporate names. Company Names Matching in the Large Patents Dataset by Timofey Medvedev and Alexander Ulanov, HP Laboratories, HPL-2011-90R1.
This paper addresses the name matching (duplicate detection) problem in the US patent dataset. It contains more then 400K unique company names spellings. In order to solve the matching problem we choose appropriate string similarity measure and clustering approach and estimate their parameters. Finally we apply them to the whole dataset and estimate the positives and negatives rates.

Friday, July 22, 2011

Additions to Source Codes for Vocabularies, Rules, and Schemes

Codes for the following sources were announced June 14, 2011. The codes for these sources have been changed.

The codes should not be used in exchange records until 60 days after the date of this notice to provide implementers time to include newly-defined codes in any validation tables.

Classification Scheme Source Codes

The following source codes have been added to the Classification Scheme Source Codes list for usage in appropriate fields and elements.

Additions:
ubtkl/2 (formerly klasskj)
Klassifikasjonsskjema (Trondheim: Fellesbiblioteket, Kongelige Norske videnskabers selskab, Museet, Norges laererhogskole)
Subject Heading and Term Source Codes

The following sources code have been added to the Subject Heading and Term Source Codes list for usage in appropriate fields and elements.

Additions:
noubomn (formerly nobomn)
University of Oslo Library Thesaurus of Science (Realfagstermer) (Norway: Universitetsbiblioteket i Oslo)
noubojur (formerly noubojor)
Thesaurus of Law (Norway: Universitetsbiblioteket i Oslo)

Tuesday, July 05, 2011

Genre/Form Headings Webinar

OCLC has made available the recording of the webinar held on May 12, 2011 entitled: Genre/Form Headings. This one hour webinar covers the definition of genre/form headings, different sources of genre/form terms, searching in Connexion (authority and bibliographic), and tagging/coding.

Friday, July 01, 2011

SCATNews

The June issue of SCATNews, the newsletter of the IFLA Cataloguing Section, is now available.

Tuesday, June 28, 2011

W3C Library Linked Data Incubator Group Call for Comment

The W3C Library Linked Data Incubator Group has been chartered from May 2010 through August 2011 to prepare a series of reports on the existing and potential use of Linked Data technology for publishing library data. The group is currently preparing:
  • A report which consists of
    • "Benefits"
    • "Vocabularies and Datasets"
    • "Relevant Technologies"
    • "Implementation challenges"
    • "Recommendations"
  • Use Cases, a survey report describing existing projects

  • Vocabularies and Datasets", a survey report
We (LLD XG) invite comments from interested members of the public. Feedback can sent as comments to individual sections posted on our dedicated blog or by email to a public mailing list (public-lld@w3.org, archived at http://lists.w3.org/Archives/Public/public-lld/ ) using descriptive subject lines such as '[COMMENTS] "Benefits" section'

Comments will be especially welcome in the next four weeks (through 22 July). Reviewers should note that as with Wikipedia, the text may be revised and corrected by its editors in response to comments at any time, but that earlier versions of a document may be viewed by clicking on the History tab.

It is anticipated that the three reports will be published in final form by 31 August.

Wednesday, June 22, 2011

Information Standards Quarterly

The Spring 2011 issue of Information Standards Quarterly (ISQ) is now available online in PDF format and is open access. Both the full issue and the individual articles can be downloaded.

This special edition of ISQ presents Views of the E-book Renaissance. Guest content editor, October Ivins of Ivins eContent Solutions has pulled together a broad range of perspectives on what is happening today with e-books and particularly with e-book standards. As she states in her introductory letter: “Our goal for this issue of ISQ is to present an overview of the status of e-books from multiple perspectives—publishers and other content producers, librarians, and the many vendors who support their creation, management, sales, and distribution. Not coincidentally, it also illustrates the scope of the NISO community.”

In our first feature article, Bill Kasdorf (Apex Content Solutions) provides an update on EPUB 3, the new generation of the EPUB specification just issued by the IDPF, and likens it to opening a Pandora’s box, but where “all the creatures bursting out can be made to behave in a civilized way.” This is followed with answers by Marlie Wasserman (Rutgers University Press) to 10 questions on the state of e-book publishing for university presses.

For our in practice section Mollie Pharo and Marcia Learned Au describe the public library experience with e-books from their perspective over the last decade at the Evansville Vanderburgh Public Library. Wendy Allen Sherburne (University of Illinois) provides an opinion piece on drinking the e-book kool-aid in an academic library setting.

Mark Bide (EDItEUR) has written our standards spotlight feature where he discusses the challenge for standards in the e-book supply chain. Michael Gorrel (EBSCO Publishing) provides our member spotlight where he shares with us his company’s plans for merging the recently acquired NetLibrary e-books with the EBSCOhost platform.

In NISO reports, Matt Garrish and Markus Gylling (DAISY Consortium) reveal the evolution of accessible publishing that occurred with the revision of the Z39.86 DAISY standard. Todd Carpenter follows with an announcement of a new NISO Ebook Special Interest Group that is in the process of formation.

As always, our issue concludes with Noteworthy news items such as JSTOR’s foray into e-books, ProQuest’s acquisition of Ebrary, the Project Muse and UPeC partnership to offer e-books, the trial use issuance of the Journal Article Tag Suite standard (Z39.96), and several others. And check our State of the Standards table to see the status of all of NISO’s in development projects for standards or recommended practices.

You can find the complete Table of Contents to the Spring issue of ISQ, with links to the articles and PDF downloads here: http://www.niso.org/publications/isq/2011/v23no2/

--NISO email announcement.

Awards

This past year I had the pleasure of serving on the Texas Library Association's Award Committee. I have two observations after the experience.

First, there are a lot of very talented, hard-working, innovative people in our profession. And not just catalogers, but school librarians, youth, children's, reference, managers all doing amazing things. Librarians really are a special group of people.

Second, that nowhere near enough of us apply for awards. If my experience was typical there should have been many more submissions than we received. We did ask and ask again. One notice I sent out to the cataloging and special library community had almost 400 click-throughs and yet zero submissions. It does take some time to fill in the application but to recognize a special employee or group of employees should be worth the time. Even if they don't win, seeing that application and knowing their work was appreciated is worth something. Having the committee read just how wonderful your staff and library is also has value. You never know when good PR will benefit your institution.

So, next time you see an announcement for an award think about who you know that deserves one and toss their name in the hat.

Tuesday, June 21, 2011

Identifying Entity Types in MARC Records

The report on the RDA/MARC Working Group's discussion of Identifying Entity Types in MARC records is now available.
The RDA/MARC Working Group had discussions of the many points that were brought out in the MARBI discussion of 2011-DP03, Identifying Work, Expression, and Manifestation records in the MARC 21 Bibliographic, Authority, and Holdings formats. They found that while there was some sentiment for defining such a field, there was disagreement in the group and in the community about how the values in the field should be defined. It was pointed out in the discussion that there were various reasons for establishing this marker: to be able to validate that a record for a certain level contained the appropriate fields for that level; to correctly organize displays; and for applications that do not yet exist.