|
Found @ Your Repository: Increasing Visibility of IR Content
 |
| Eric Van de Velde |
The number of institutional repositories worldwide has grown exponentially in the last three years. Much progress has been made on setting up and managing a repository but less on increasing the visibility of an institute’s research output. Today, being found is as important, if not more important, as being online. The more visible the content, the more it will be used and shared. Library Connect talked to Eric Van de Velde, Director of Library Information Technology at Caltech, to find out more about their institutional repository, Caltech CODA (
http://coda.library.caltech.edu).
LC: Can you tell us about Caltech’s institutional repository?
Eric Van de Velde: Our earliest repository goes back to 1998 when we put a collection of computer science technical reports online. The real kick-start was the Santa Fe meeting in October 1999, which established the Open Archives Initiative (www.openarchives.org). At the meeting, I was introduced to the Networked Digital Library of Theses and Dissertations (www.ndltd.org).
Upon returning from Santa Fe, we started experimenting with the NDLTD software, and the library started working with Professor Arden Albee, then our Dean of Graduate Studies, on a pilot project for electronic theses. In 2002, we introduced the requirement for Caltech theses to be submitted electronically. Currently our repository houses well over 2,000 theses and is the largest repository collection we have. Caltech produces about 200 theses per year. Our collection includes older theses we scanned. When starting this project, we didn’t intend to scan our archives, but after a flood damaged about 200 circulating theses, we scanned archival copies as a preventative measure. This project went so well that we started scanning older material as time permits.
After the Santa Fe meeting, Stevan Harnad started the EPrints initiative (www.eprints.org), and when the EPrints software became available about a year later, we started using it almost immediately. First, we moved the computer science technical reports. Then, we built a historical archive from the earthquake engineering research laboratory. As the service became more well-known, some faculty became interested in making their research papers, technical reports, books and even conference proceedings available online through the Institutional Repository. There’s also an interesting collection of Caltech oral histories, consisting of the archive department’s interviews with retiring professors and senior administrators.
LC: What’s the take-up of repository services at Caltech?
Van de Velde: We have to get to a situation where faculty consistently submit content to the repository so the library doesn’t have to do all the work. That’s a social change that still needs to happen.
LC: How are you increasing awareness of the repository?
Van de Velde: Lots of informal conversations! The librarians do more formal recruiting, but it’s at a low pitch at the moment. We’re making the service available, mentioning it wherever possible but not hitting people over the head with it. We’re always looking for ways to make the process as easy as possible. My guess is, students graduating now who have put their theses online and seen the response will be faculty in a couple of years, here and across the country. They will want the same interactivity for their research papers. I see this as an evolutionary process, not a revolution. It’s simple, but these changes take a while.
LC: What level of usage are you seeing on materials in the repository?
Van de Velde: Quite a bit actually. Via the library website, we provide usage statistics for all repository collections. I did some studies on thesis usage and discovered an average of six to seven unique accesses per thesis per year. It doesn’t sound like a lot, but compared to use in print or even microfilm, it’s a tremendous increase in visibility.
LC: Can you tell us a bit about the work you’ve been doing with Scirus to increase visibility of your IR’s content?
Van de Velde: The Open Archives Initiative is a three-layered system. First, you have the content providers, such as institutional and disciplinary repositories, but these could also include journals (open-access journals as well as conventional journals). Second is an intermediary layer, provided by the OAI initiaitve, giving a standard interface to the disparate repositories available. The vision of the Santa Fe meeting was that service providers would come along and use this standard intermediate layer to access data underneath. That’s what is starting to happen now with service providers like Scirus.
So, whenever a service provider comes along and wants to access our data we are happy to collaborate to make our collection visible. We want our content out there: the more service providers (i.e. the more entry points to our content), the better.
Our collaboration with Scirus has also provided us with one search interface for the disparate Caltech repositories. Until now you had to search each respository individually.
LC: Would you recommend other libraries explore similar collaborations?
Van de Velde: Yes, my big recommendation for academic libraries is to make their unique materials available through the Open Archives Initiative. Service providers can then make the material visible in a variety of contexts.
LC: What do you see coming down the line for institutional repositories?
Van de Velde: Well, this is certainly only the beginning. Right now the vast majority of theses are text files. The NDLTD recently announced awards for innovative multi-media theses. Once we are truly in a multi-media age, that will open up the realm of what a scientific publication really could be and what institutional repositories will really have to offer.
|