ITALica

Monday, March 2, 2009

CatQC and Shelf-Ready Material: Speeding Collections to Users While Preserving Data Quality, by Michael Jay, et al

Abstract:
Libraries contract with vendors to provide shelf-ready material, but is it really shelf-ready? It arrives with all the physical processing needed for immediate shelving, then lingers in back offices while staff conduct itemby-item checks against the catalog. CatQC, a console application for Microsoft Windows developed at the University of Florida, builds on OCLC services to get material to the shelves and into the hands of users without delay and without sacrificing data quality. Using standard C programming, CatQC identifies problems in MARC record files, often applying complex conditionals, and generates easy-to-use reports that do not require manual item review.

LaneConnex: An Integrated Biomedical Digital Library Interface, Debra S. Ketchell, et al

Abstract:
This paper describes one approach to creating a search application that unlocks heterogeneous content stores and incorporates integrative functionality of Web search engines. LaneConnex is a search interface that identifies journals, books, databases, calculators, bioinformatics tools, help information, and search hits from more than three hundred full-text heterogeneous clinical and bioresearch sources. The user interface is a simple query box. Results are ranked by relevance with options for filtering by content type or expanding to the next most likely set. The system is built using component-oriented programming design. The underlying architecture is built on Apache Cocoon, Java Servlets, XML/XSLT, SQL, and JavaScript. The system has proven reliable in production, reduced user time spent finding information on the site, and maximized the institutional investment in licensed resources.

A Semantic Model of Selective Dissemination of Information for Digital Libraries, by J. M. Morales-del-Castillo, et al

Abstract:
In this paper we present the theoretical and methodological foundations for the development of a multi-agent Selective Dissemination of Information (SDI) service model that applies Semantic Web technologies for specialized digital libraries. These technologies make possible achieving more efficient information management, improving agent–user communication processes, and facilitating accurate access to relevant resources. Other tools used are fuzzy linguistic modelling techniques (which make possible easing the interaction between users and system) and natural language processing (NLP) techniques for semiautomatic thesaurus generation. Also, RSS feeds are used as “current awareness bulletins” to generate personalized bibliographic alerts.

Classification of Library Resources by Subject on the Library Website, by Mathew J. Miles and Scott J. Bergstrom

Abstract:
The number of labels used to organize resources by subject varies greatly among library websites. Some librarians choose very short lists of labels while others choose much longer lists. We conducted a study with 120 students and staff to try to answer the following question: What is the effect of the number of labels in a list on response time to research questions? What we found is that response time increases gradually as the number of the items in the list grow until the list size reaches approximately fifty items. At that point, response time increases significantly. No association between response time and relevance was found.

One Law with Two Outcomes: Comparing the Implementation of CIPA in Public Libraries and Schools, by Paul T. Jaeger and Zheng Yan

Abstract:
Though the Children’s Internet Protection Act (CIPA) established requirements for both public libraries and public schools to adopt filters on all of their computers when they receive certain federal funding, it has not attracted a great amount of research into the effects on libraries and schools and the users of these social institutions. This paper explores the implications of CIPA in terms of its effects on public libraries and public schools, individually and in tandem. Drawing from both library and education research, the paper examines the legal background and basis of CIPA, the current state of Internet access and levels of filtering in public libraries and public schools, the perceived value of CIPA, the perceived consequences of CIPA, the differences in levels of implementation of CIPA in public libraries and public schools, and the reasons for those dramatic differences. After an analysis of these issues within the greater policy context, the paper suggests research questions to help provide more data about the challenges and questions revealed in this analysis.

Monday, October 20, 2008

Update to Monitoring Script

When we last updated the software for our system highlighted in the
Sept. 2008 issue, we noticed that our monitoring script had some issues.
This was related to a regular expression found in the script. I'll spare
the details and just provide a link the new updated script:

http://lance.mckendree.edu/wireless/wirelessScript_v2.txt

jim

Monday, October 13, 2008

MyLibrary is about creating relationships

MyLibrary is more about creating relationships than it is about personalization and customization.

When MyLibrary was made available in 1997/8 the application was all about creating "my pages". Since then it has matured and evolved. First and foremost, it is not a turn-key application like it was previously. Instead, it is more like a digital library framework and toolbox akin to Fedora, just written in Perl. Second, with the advent of object-oriented programming techniques, MyLibrary is about four types of things: 1) resources, 2) patrons, 3) librarians, and 4) facet/term combinations. The first three things are are well-known to us librarians. The fourth is not.

Using facet/term combinations it is possible to create strong relationships between resources, patrons, and librarians. By "cataloging" resources, patrons, and librarians with facet/term combinations it is possible to address things like:

The resources I curate are...
My patrons are...
The resources recommended for me include...
This resource is like that resource...
My librarian is...
People like me include...

Moreover, facet/term combinations are not just about subjects and formats. They can also be about audiences, resource tools, true/false values, academic classes, dates, genres, etc. By "cataloging" resources, patrons, and librarians with these sorts of facet/term combinations it is possible to create relationships between them in a much fuller way than what we are doing now.

Monday, October 6, 2008

Update on PDF accessibility

Dear readers,

from the submitting of the original work for the article, many things have happened in the publishing world. Mainly XML is spreading its domain more and more in many areas, one of them the accessibility fields, of course.

For PDF three remarkable things could change the landscape:

- The inminent approval of PDF/UA standard as an addendum to ISO 32000-2 [1]

- The Adobe Mars project and its downloadable plugin to convert back and forward PDF documents to XML documents => This could open many possibilities to include PDF in XML workflows [2]

- The CWA 15778 report [3], on document processing for accessibility, showing many scenarios in which PDF is included in an accessible multichannel publishing workflow

Adobe is working hard to enhance its flagship format and Adobe 9 includes the optimization of file size for PDF files, a long desired option.

Still, authoring tools fall short in their capacity to process PDF and XML for accessibility. Adobe InDesign is pushing hard against QuarkXPress for its better support to XML.

Other formats in the accessibility world have take their place: DAISY and its component DTBook [4], with free conversion tools from Microsoft Word and Open Office, is also pushing hard.

Will PDF maintain its niche position? the discussion is open.

Mireia Ribera
http://bd.ub.es/pub/ribera

[1] http://pdf.editme.com/PDFUA
[2] http://labs.adobe.com/technologies/mars/
[3] http://www.cen.eu/CENORM/BusinessDomains/BusinessDomains/ISSS/activity/ws-dpa.asp
[4] http://www.daisy.org/

Monday, August 18, 2008

Repurposing "Born MARC" metadata

Just wanted to repeat the question that I posed in my original comment but under its own header:

What opportunities and challenges can we anticipate as we start working with (e.g. manipulating, searching, augmenting, cleaning up, transforming) MARC metadata from an ILS (or from another MARC-based system) outside of that system?

How can we ensure that reusing MARC data is a worthwhile undertaking, and convince those outside the library world of this, if necessary?

Tuesday, July 8, 2008

Metadata to Support Next-Generation Library Resource Discovery: Lessons from the eXtensible Catalog, Phase 1 by Jennifer Bowen

Abstract:
The eXtensible Catalog (XC) Project at the University of Rochester will design and develop a set of open-source applications to provide libraries with an alternative way to reveal their collections to library users. The goals and functional requirements developed for XC reveal generalizable needs for metadata to support a next-generation discovery system. The strategies that the XC Project Team and XC Partner Institutions will use to address these issues can contribute to an agenda for attention and action within the library community to ensure that library metadata will continue to support online resource discovery in the future.

LITA members may read the full text of this article at http://www.ala.org/ala/lita/litapublications/ital/272008/2702jun/bowen.cfm. Discussion of the article will be open from 18 August to 15 September 2008.