A I always (usually...sometimes...rarely) try to do I will be commenting on the sessions at the 2009 Open Repositories Conference.
Jeremy Frumkin (Arizona) - Global Registries Initiative is a registry of collections of web resources and related services. The mantra of the project is "register locally discover globally". Rather than letting Google do it, or use LinkedData as the registry, they decided to build their own to accommodate specific metadata needs and requirements, such as accessibility via OAI-PMH or other appropriate standards. Vic Lyte talked about the IESR (Information Environment Service Registry), which is a registry of UK resources and services, accessible via web search, web widget, OpenSearch, RSS, OAI-PMH, SRU/W, OpenURL and yes, you know it, Z39.50. I like the multiple access tools/protocols - why can't library systems/databases do the same? Chris Blackall (Australian National Data Service) talked about their effort, which uses the ISO 2146 Registry Services standards and is called ANDS Collection Services Registry. Jeremy also showed a demo of the LibraryFind system doing a federated search across registries - all 3 mentioned here. There is no doubt that this is a step in the right direction, especially since many of these would be inaccessible (dark web) to the standard engines like Google. Now all we have to do is make them even more accessible by moving them into a semantic weby context.
Simeon Warner (Cornell), gave a session called Author Identifiers in Scholarly Repositories which reflects arXiv's approach to this issue. The need for author IDs is familiar to anyone who has looked for papers by a specific author in Google or even the standard databases. The key is to get help from the source (ie. repository), but there are challenges, such as the act that IRs have an institutional boundary and subject repositories have a domain boundary - often research crosses boundaries. Simeon raised some of the challenges with creating an ID repository, including privacy, accuracy, longevity, openness, etc. There are a number of existing author ID services (one of the best is RePEc, which is for economists), including domain specific, national, etc. arXiv's approach is to get authors to claim papers at some point in time by providing useful services around it. One thing they could do with this is provide a list of publications for that author with a CSS to present it in a specific style. This is very similar to what we want to do with our IslandScholar project, providing a widget that lists author pubs and even reformats for SSHRC or NSERC style, for example. They also have built a Facebook application that is fed by the arXiv author ID. A great example of what can be done with the linked data.
Matt Zumwalt (MediaShelf) talked about rapid application development for Fedora repositories. Matt gave an example (live) of building a couple of Ruby-Fedora apps, using the Jewish Women's Archive and a couple of others as examples. The challenge is to bridge the two worlds of agile development vs the "enterprise" style approach which typically has a plethora of big and complex standards and applications. "Eyes on the Horizon (big goals), Feet on the Ground (iterative development)" is one mantra. Matt also related big system design to the DeathStar, which was used before it was finished...and then it blew up. A nice analogy to what can happen with non-agile, big systems :-) Another nice quote re standards: "When they serve you, make standards the assumed, thus hidden convention. When they're harmful, say so." All in all a great talk about how to balance enterprise and agile approaches and what each brings to the table.