Will Linked Data mean an early end for Marc & RDA
For the uninitiated, NGC4LIB is a library focused mailing list which has a reputation for often engaging in massive discussions and disagreements around the minutiae of future cataloguing and library focused metadata practices. They have recently been involved in one of these great debates stimulated by the comments of Sir Tim Berners-Lee in a recent interview. As is often is the case on this list, the debate wandered well off topic in to the realms of FRBR and it’s alternatives before being brought back on topic by Jim Weinheimer, who started the conversation in the first place.
A statement in Jim’s contribution caught my eye:
Implementing linked data, although it would be great, is years and years away from any kind of practical implementation
Implementing linked data is already well underway with many groups across the Globe. For instance there are couple that we at Talis are closely involved with. Following on from Sir Tim’s interview comments, the British Government are currently running a, soon to be opened, closed beta of data.gov.uk. Through this site they are not only opening up data in many forms such as CSV, like their American cousins at data.gov, but they are also starting to encode in RDF and publishing it via the Talis Platform which provides a SPARQL (the query language of the Linked Data web) end point. This approach not only lets anyone download the raw data, but also enables them to query it for whatever they have in mind. If you want a sneak preview of how such data is queried, take a look at some of theses examples. In a similar vein, metadata from BBC programmes and music is being harvested in to Talis Platform stores. Again these are open to anyone to innovate with – check out these screencasts to see some of the early possibilities.
Ah but that is not bibliographic data, I hear someone cry – It’ll never catch on in libraries. I get the impression from some comments on the NGC4LIB list, that it will not be possible for ‘our’ data to participate in this Link Data web until ‘we’ have predicted all possible uses for it, analysed them, and developed a metadata standard to cope with every eventuality. There are already a few examples of the library world engaging with RDF and Linked data, one obvious one being the Library of Congress with LCSH another the National Library of Sweden. Neither of these examples are encoding the kind of detail you would expect in a Marc record, they are using ontology to describe associated concepts such as subjects.
There has been some ontology development towards this larger goal with Bibo (Bibliographic Ontology Specification). Although not there yet, Bibo is good enough to be used in live applications whishing to encode bibliographic data. Such an example is Talis Aspire. Underpinned by the same Platform as the UK Government and BBC Linked Data services, it uses the Bibo ontology to describe resources an an academic context.
Alongside data.gov.uk there is a Google Group conversation taking place. The refreshing part of this conversation is that it is between the producers of the data sets, those developing the way it should be encoded in to RDF, and those who want to consume it. Several times you will see a difference of opinion between those that want to describe the data to it’s fullest, and those that wish to extract the most value from it. “I agree that is a cleaner way of encoding, but can you imagine how complex the query will be to extract what I want!”. This approach is not unusual in the Linked Data world, where producers and consumers get together, pragmatically evolving a way forward. Dataincubator.org is an open place where such pragmatic development and evolution is taking place. Check out examples of a subset of Open Library data. (note this is an example of data, not a user interface).
Another, bibliographic focused, experiment can be found at semanticlibrary.org. From some of the example links on the home page, you can see that building in this way enables very different ways of exploring metadata. People, subjects, publishers, works, editions, series, all being equally valid starting points to explore from.
Doth the bell toll for Marc and RDA?
Not for a long old time – Ontology like Bibo, and the results of work at Dataincubator.org and semanticlibrary.org, may well lead to more open useful, and most importantly linked, access to data previously limited to library search interfaces. That data has to come from somewhere though, and the massive global network of libraries encoding their data using Marc ,and maybe soon RDA, are ideally placed to continue producing rich bibliographic metadata. Metadata to be fed in to Linked Data web in the most appropriate form for that purpose. There will continue to be a place for current cataloguing practices and processes for a significant period -supporting and enabling the bibliographic part of the Linked Data web, not being replaced by it.
No doubt the NGC4LIB conversation on this topic will continue. Regardless of how it progresses, there is a current need and desire for bibliographic data in the linked data web. The people behind that desire, and the innovation to satisfy it, may well have come up with a satisfactory solution, for them, whilst we are still talking.





October 26th, 2009 at 11:43 am
I definitely think the Bibliontology work is showing a huge amount of promise, and it looks a good place to focus effort in terms of moving bibliographic data representations as linked data forward.
It’s probably worth noting that this isn’t always a ‘either/or’ choice. The approach taken by Linked Data/Semantic Web allows for the mixing of vocabularies where appropriate. In a recent discussion about the ‘Place of Publication’ on the BIBO (Bibliontology) group at http://groups.google.com/group/bibliographic-ontology-specification-group/browse_thread/thread/cbd6dedcdbda537b/6a623f9849e2b396?hl=en&lnk=gst&q=rda#6a623f9849e2b396, it is suggested that it may be appropriate to use the RDA:placeOfPublication element (http://metadataregistry.org/schemaprop/show/id/89.html)
December 9th, 2009 at 10:57 am
I just saw this post and want to thank you for showing some great examples of sites using linked data. But I would like to clarify where you quoted me when I stated: “Implementing linked data, although it would be great, is years and years away from any kind of practical implementation.” I guess I should have written that it is years and years away from any kind of practical implementation *by libraries.*
Implementation at such a level simply involves too much change in a field that has always been traditionally conservative. Practically our only way of sharing information is still using ISO2709 records using Z39.50, which we were doing in the 1970s! There is also the concern of genuinely sharing library-created records and information in ways where librarians are not in 100% control. This is an argument that flared up recently with OCLC claiming ownership over library records, and they backed down, although I have not heard of any final decision as yet.
So yes, implementing linked data is being done, and technically speaking, could be done rather soon in the library world as well, but at least I think it simply involves too many changes for librarians and its associated bureaucracy to adopt anytime soon.
December 9th, 2009 at 11:54 am
Hi James,
If we are talking about librarians changing the practices to create bib metadata in a Linked Data (RDF / Bibliontology) way, I think you are correct.
Continuing to catalogue in Marc, but then export via transformations in to a Linked Data SPARQL queryable form, may well be a faster way forward to produce ‘good enough’ data for the wider web.
December 10th, 2009 at 9:43 pm
Karen Coyle has written a Library Technology Report for ALA TechSource, to be published in January 2010, “Understanding the Semantic Web: Bibliographic Data and Metadata. Building on ideas from her presentations and articles, she explains the concepts and possibilities for bibliographic data in a linked data environment.