Nodalities

From Semantic Web to Web of Data
Nodalities

Subscribe

  • Any Podcatcher
  • Any Feed Reader

Categories

Archives

License

Creative Commons License

Archive for the 'Uncategorized' Category

Semantic Yellow Pages…

I was sent a link late last week to a white paper over on ZDNet by some Finnish researchers looking into the possibilities of opening up the services offered by Yellow Pages to the Semantic Web. Basically, the Finns looked at the traditional Yellow Pages service (specifically in the Helsinki area, but also more broadly), and found it lacking in several key ways.

The basic idea behind Yellow Pages is to provide a directory service for the general public, and monetise the advertisements of businesses wanting more prominence in the directory. The way current services work (online, that is) leaves much to be desired when it comes to matching the requirements of both users and businesses. Businesses miss out whenever a potential customer fails to notice their ad (because they found a different listing on a different page), and the user loses whenever their search turns up wrong or incomplete results. The reasons behind this mis-match is essentially technical: search and directory tables don’t provide the flexibility required to match users and businesses together whenever linguistic (homonyms, synonyms or hyponymic…) problems occur or when a service simply fails to produce the best case for the request.

I suggest reading the paper for the more technological perspectives (you will need to join ZDNet.co.uk, though I believe it’s free of charge), but the possibilities unearthed could potentially benefit both users of Yellow Pages services and the Semantic Web community itself.

A person looking for a business or seeking an answer to a problem in his area could benefit from the fact that his Yellow Pages results will become much more focused. If he looks for a “camera shop” because his is broken, he’s much more likely to find a shop running a repair service, for example. Businesses behind the service would also benefit from their advertisements being more well-placed to attract people actually looking for them.

However, a significant benefit from this system, is that the data made available through such a service could be reused by the Semantic Web itself. Imagine developing an application and being able to tie directly into a massive, international business directory? I’m sure your imagination (if you happen to be an imaginative developer) could be enlivened by this idea.

I’d be very interested to hear what you have to say about this kind of service, and what you’d build using this kind of data. Let me know where you’d take it, and what you’d like to see from such a directory.

Zemanta gets personal

I just found out that Zemanta has been updated. I had a bit of a mini-review and a discussion with Jure from Zemanta on my blog in June, and they’ve been ironing out some of the difficult bits since. As a very brief intro, Zemanta is a plugin which suggests related items while you blog. It suggests images and related articles to include in the new post, and handles citations for the images you might include.

The updates are about personalisation. It has added suggestions from your own flickr feed, and from “my friends” (from facebook, twitter and mybloglog) so the recommendations you get are now more relevant to you. I’ve installed the update, and will be having a play with the new service at some point, but it doesn’t work on the Nodalities Blog, so I can’t show you any of its features at the moment.

It’s certainly interesting to see these applications being brought to a usable level, and it’s also cool to see the personal-focus being introduced. Andraz Tori, Zemanta’s CTO, wrote about semantic applications in Nodalities Magazine. There are things the Semantic Web community are working towards, but there are also applicable technologies being developed right now

WWWF

This week saw the launch of the World Wide Web Foundation with a speech by Sir Tim Berners Lee introducing what it will be doing, and what it’s for. The points put forward in the speech seemed centred around three bullets from Berners-Lee’s talk:

  • to advance One Web that is free and open,
  • to expand the Web’s capability and robustness,
  • and to extend the Web’s benefits to all people on the planet.

The Foundation’s strapline is “Humanity Connected”, and seems to be based out of concerns from the way the Web is now.
My first reaction to this was the moderately paradoxical: “Cool! Why?” And a long look at the Foundation’s FAQ’s have brought out more questions.

The Cool part comes—apart from the support from TBL—from the idea that an organisation bringing together top minds from web, business, public sector and educational spaces to focus on the connectivity of mankind could produce some of the most profound, interesting, and useful research ever entered into. One of the FAQ’s sums this up:

Why did this Foundation emerge from W3C and WSRI and why now?
Those organizations have contributed significantly to the vision of the Web as humanity connected, but still more is required to include more people, in particular in underserved communities. The Foundation seeks to extend the benefits of a Web, improved by further research and technology development to all people.

The “Why?” comes from the mission it will involve. Is this a venture in trying to control the web itself (is that ominous?)? While they’re looking into security and other areas of personal and corporate trouble with the Web, are they also going to be looking at ways to eliminate this themselves, or is this more of a “watchdog”?

As you can see, there are many, many questions raised by this new foundation launch, but as with anything involving the W3C I’m sure there will also be a lot of information coming our way soon!

Semantic Web and Startups

Following both DEMO and TC50 the past week, there has certainly been no shortage of startup energy about the blogosphere. TC50’s winner, Yammer, has made an appearance in just about every techblog, some with mixed reviews. Essentially a Twitter set up for the enterprise (it’s a locked-down version of micro-blogging) it feels, instinctively, like a step away from open, linked data and a vibrant web-space. I don’t know that much about it, however, so I’d be happy for someone to explain its benefits to the enterprise and the web itself.

Much more exciting to me, was the launch of fotonauts at TC50 (No, I wasn’t there, but there is a brilliant summary here). The premise is to manage photographs in a much more linked way. The service can take information from Google Maps, Wikipedia and other sources and mash them up with your photos—or, apparently, photos from other usable sources like flickr. These photos are then published with all the extracted metadata to a website without you having to do any of the scraping yourself. I’ve signed up for a beta, but it’s not open yet, so watch this space! (And, fotonauts, if you’re watching: “fill this space!”)

At Demofall, (again online, rather than physically at the conference), I noticed the appearance of Data Essence.  Data Essence, from what I’ve read, is looking to accurately match investors with potential opportunities through semantic web analysis of websites. Although they seem to be aiming in a Semantic Web direction, they’re not hanging about waiting for a revolution. One quote from Amnon Mishor—Data Essence CEO and co-founder—I thought particularly interesting was:

“Clever semantic tagging cannot stand for itself,” Misher said. “There is a need for sophisticated algorithms that utilize the semantic data to intelligently hand pick information tailored to match the users’ interests and profile. Without this ability, any natural language  solution or semantic database will become a not so useful utility, and will not really solve the problem of information overflow.”

hmmm… That sounds like a conversation in itself. What algorithms are necessary, and what applications do you see being/needing developing NEXT. What are the blockers? I’d be particularly interested to hear what other startups and news events you’ve seen lately. Also, what you think of what these Startups mean for Semantic Web technologist (both established and emerging).

Right, I’m running out of HotSpot credit, and I’m loath to pay a well-known mobile broadband supplier any more, so I’ll end there. Over-arching message of this post? Lots of startups, some of them catching media attention (and I don’t know why?), others seem to be sleeping potential giant-killers. Lots of “calls to arms”… what do you see?

Ubiquity

ubiquityA couple days ago, Michael Hausenblas suggested I look at something called Ubiquity, and sent me a link. Because it came in the middle of editing the current version of Nodalities Magazine, I did what I often do with interesting concepts: I opened a new firefox tab and left it there for two days, hoping I would notice it before firefox crashed with all tabs on board. Well, since then, there has certainly been a lot of discussion about Ubiquity—both around the office and on the web. To introduce it, I can’t do much worse than pointing at their video and introduction page, and just say that it’s a Mozilla labs project and a Firefox plug-in.

However, what makes it interesting to me, is that it possibly introduces a new metaphor for interacting with web content—and, vicariously, linked data. The thought process behind it is that whenever we want to “do something” online, we are generally forced into round-about processes. Say, for example, I want to email a friend to tell him about a new restaurant I went to, maybe even invite him to meet me there for lunch. To accomplish this task, I’d typically open up three or four tabs in Firefox, and maybe open ICal and Mail application windows too: I’d google the restaurant, find its phone number and address from yell.com; map its address using maps.google.com or similar; I’d check which date I’m free; and finally email him the info, copying and pasting links and map images between multiple tabs, and—if I’m not using gmail’s web interface—into other applications as well. If you followed that last sentence, you’re doing well: it’s long, and complex (technically complex-compound, but we won’t get pedantic here), and it reflects the process.

Although it’s a beta, and many of its functions are very much less-than-polished, it offers a glimpse of a possible interaction future, with drastically more simple processes to complete tasks. What it creates is the ability to interact with content more directly, so you can select some content and start telling the application to DO stuff TO the content, by typing. So, i can select a physical address and type: “map this” into Ubiquity, and it’ll pull up google maps for that address (at the moment, it’s having trouble with some UK addresses because it’s using google.com and therefore not contextualising through the .co.uk which works better for addresses here). I can then use that information on the same screen. I can “yell florists in birmingham” and have a list of flower vendors in Birmingham from Yell.com (yellow pages service), which I can then drop into an email or whatever.

Very quickly, I ran into a conceptual problem with Ubiquity’s idea of natural-language interaction, however. Their strapline is: “An experiment into connecting the Web with language.” The idea being that you can “tell” the computer to “do something with/to this information” or “command” for something to happen, changing the basic interaction metaphor from a visual click/drag/drop/open-window process to a linguistic “I’m telling the computer what I want and it happens” framework. My immediate reaction was: “This isn’t linguistic, it’s command-line”, and was instantly transported to trying to learn Linux without a technical background, with all the frustration of a non-technical user trying to interact with software using a command-line.

You see, from my perspective as a linguist, I often feel frustrated with the computing community’s view of what language actually is. Without exploring propositionality, conceptual metaphor framework or anything else, it’s sufficient to say that language is both simpler and more complex than anything we’ve got software to emulate yet. What Ubiquity actually is, is a very simplified command-line which is “aware” of the information you’re already interacting with. From that perspective, it seems to work very well, with a more streamlined set of commands and more “natural language” feel to the words you actually type.

The upshot of this is that users have to learn a set of commands to interact with their applications, but that these commands are intended to be transparent in meaning. So, you “map this” or “help” or “add 1PM lunch with Dave”. After reading some of the reasoning behind this from one of the designers, Aza Raskin, I started to appreciate it more and more. The current contrasting model to this “Linguistic Command Line” is menus and windows. Menus and windows are inefficient, if you think about it. You have to select text, or images or whatever and physically move your curser to a menu somewhere in the extreme side of a window on your screen, finding and selecting the command from a drop-down list from which you need to remember the path to each command. The problem accelerates when you incorporate windows and applications into this. So if I were to incorporate some text from one window into another, linking to the original, and maybe dropping in a customised image too; I’d have to open multiple windows, executing menu commands or application-specific keyboard “short-cuts” at each stage.

But, I already know what I want to do with the stuff, right? Why not just activate a single keyboard shortcut and begin typing your instructions to the system: send link to <email>. Ubiquity allows this. In this framework, Firefox becomes a bit of a microcosm of the operating system (with tabs being windows, and sites and web-apps being desktop applications). As you type, it short-lists commands, so you don’t even type the full thing: typing “t r a n” ends up with the translate, so you can skip it and begin typing “to eng”, and it will offer you “translate text to English”.

Now, imagine having this ability with any form of Linked Data? Imagine if that bit of text were automatically recognised as a date, or co-ordinates, or person. Imagine selecting a picture of a restaurant and typing: “invite fred for lunch at 3PM on monday, enter”. The system could automatically know that the picture was of a restaurant (whose profile could include co-ordinates, contact info, and even a hypothetical automatic table-reservation system for invites from the web), that fred is your colleague (whose FoaF profile includes email or instant messenger preferences),  that lunch is an email subject and a social event, and that 3PM on Monday is a date (in your calendar and in Fred’s calendar once the message is sent) which corresponds with your name + su. All of that information is being used in several processes (Copy/paste, lookup restaurant profile, map location, lookup email or IM, create iCal event, create email or IM message, send)  but all you’re really doing is : “inviting fred for lunch at 3PM on Monday.”

This is incredibly intriguing, because it begins to show how some systems can begin to scale up to the immensity of the Web. We, as people, know what we want to accomplish, and if we could just tell our computers that, we’d be much happier. I think this could be a first step, and while I’m not completely convinced with the command-line metaphor, I can see this as a definite step, and a different perspective. My new copy of  Aza’s father’s book the Humane Interface, arrived this morning to supplement this, and I’ll be blogging more about that, if it’s ever returned to my desk.

Person Michael Hausenblas

Right click for SmartMenu shortcuts

TWSW delayed, don’t panic!

Sorry, I’ve had a mass of real-world distractions. Will try and get This Week’s Semantic Web out tomorrow.

Linked Data Planet - Free Conference Passes

A significant slice of the Linked Data/Semantic Web community will be gathering next week in New York City for the first Linked Data Planet conference. The programme is packed full of talks from experts in the field, including keynotes from Tim Berners-Lee, Kingsley Idehen and our very own Ian Davis. As sponsors of the conference we have a small number of free passes to give away. The passes give access to the full conference (not just the exhibition hall and keynotes), but do not include travel, accommodation or meals etc - these will need to be covered by the recipients of the free passes. So, if you can make it to NYC next week and would like to have a chance of winning a free pass to this groundbreaking event, just email Paul Miller (firstname.surname at talis.com) with the words “Linked Data Planet passes” in the subject line. The free passes will be given out on a first-come, first-served basis. Good luck, and hope to see you in New York next week!

Empathic Web

karstadt_connection.jpgLast week I listened to a talk by Shane Hipps, a Porsche “consumer anthropologist” turned Mennonite minister. The speaker, clearly aware of the contradictory nature of his background, made a very interesting observation about the digital age. He essentially said that we are, as a society, experiencing a shift of great magnitude in history which reflects one of the greatest changes humanity has ever experienced: literacy.

Moving from an oral tradition to a literate society—in which letters allow people to commit their thoughts to memory—fundamentally changed the way society and individuals thought. It freed individuals to think on their own without having to commit their ideas to the collective memory of their tribe. It also changed the ability of the social groups to sense the emotional state of its individuals, because they could now exist in an abstract, individual mindset.

So, agrarian societies relied on the community to remember and structure their ideas. The result of this community conceptual framework was an empathic connection between members of the community. Writing, on the other hand, allows individuals to remove themselves from the older framework and commit their thoughts to paper resulting in a loss or shift in the empathy of the culture or society. In a digital age, however, there is a new shift: one of removal to connection at a distance.

This concept: empathy at a distance or a digitally-connected community, made me consider the connections in the Semantic Web. The in’s and out’s of the SemWeb have been argued, discussed, debated, and explored technologically. Many blogs and sites have huge amounts of content devoted to the definitions of SPARQL and RDF. Abstractions have been published discussing the applications of this new technology. Sir Tim Berners-Lee refers to the Semantic Web as ‘The Web done right.’

But, what is being done right? Is the Semantic Web the Web done technologically right? Is it an upgrade to the existing framework or a patch to fix what was wrong? Maybe. But it makes me wonder about looking at this from a sociological or communicative perspective. The Semantic Web, technologically, is important to humanity only so far as it’s a medium for our connections.

So, when we make new semantic connections, and the software is increasingly able to associate us with concepts, people, items and communities (like academic institutions or or organisations); what is actually happening? People are making connections, and committing them not only to their own memories but to a community.

Publishing, you might argue, has been around since not that long after the first scribblings of meaning. But, publishing is one-way and narrow. A message or idea is only committed to the memories and added to the mental repertories of those who actually read the message. The same is true in a digital age (with multi-channels for ‘reading’ such as podcasts and video also) but the distinct difference is the access to concepts and the ease of utilising or ‘consuming’ the material. Firstly, digital goods are infinitely (in practice if not in absolute purist terms) copyable. There is no limit to the number of times you can copy and distribute an electronic text or a podcast so society does not have to wait for an idea to filter through because your dad hasn’t finished their Times crossword. Secondly, the connections made digitally (and more semantically-enriched) are increasingly collaborative. With software doing the heavy lifting in terms of data mining and content distribution, more ideas get to more people in more accessible ways.

Finally, although the Semantic Web is far from complete in application, the glimmers it allows us to see could have huge sociological implications. It’s the human element of the Semantic Web which makes it so exciting and so potentially disruptive. It’s possible that people, finding and synthesising ideas before feeding back their individual perspectives into the community, are increasingly able to connect with people and concepts in a more empathic way; without losing the abstract and logical abilities of the literate age.

Is this a new age? Undoubtedly. What will it look like? I’d say: like you and me—people.

cc.jpg

Image: “karstadt connection” by rastafabi from Flickr—Creative Commons

This Week’s Semantic Web…is on holiday

Well, not exactly. The arrival of conference season has meant 1. there’s been a lot happening in this space and 2. I’ve been traveling (and chatting in the lobby) way too much to be able to sit down and compile this thing. So apologies for the hiatus, but normal service will be resumed as soon as possible. I am thinking hard on how to better automate the info gathering process, without having to write too much code…suggestions welcome. In the meantime, the most significant events in this space generally crop up on Planet RDF, the Semantic Web Interest Group’s IRC Scratchpad and/or the W3C Semantic Web mailing list, and there’s also now Paul’s Semantic Web blog on ZDNet (though as he’s had more or less the same schedule, don’t expect too much activity over there right now either).

First impressions of “Semantic Web for the Working Ontologist”

Semantic Web for the Working Ontologist

Semantic Web for the Working Ontologist Effective Modeling in RDFS and OWL is a new book authored by Dean Allemang and Jim Hendler. I can offer a quick summary by teasing apart the title. For starters, the Semantic Web it discusses is generally in line with the current consensus view of the developer community, though with a lean in the direction of the ‘O’ word. The emphasis in the book is very much on Working and Modeling. It is practically oriented, and while it covers most of the technologies associated with the Semantic Web, its focus is on how to describe things using RDF, RDFS and OWL.

There’s a serious shortage of approachable books in the Semantic Web space - if you check the ESW Wiki list, there are only a handful that aren’t heavy duty academic works. Aside from the issue of convincing publishers there’s a market for such material (a problem that’s no doubt evaporating), there’s the difficult problem of what to write about. In the 2003 book Practical RDF, Shelley Powers used the parable of the Blind Men and an Elephant to suggest how RDF has many different aspects and can mean different things to different people - and RDF is just one Semantic Web technology (though arguably the most important). What’s more the elephant changes over time and is lavishly decorated: while the core standards solidified in 2004, since then we’ve seen various auxiliary specifications come along: the SPARQL query language, Turtle/N3 syntax, RDFa, GRDDL and so on. Ideas on best practices have also developed considerably over the years. This book is scoped to modeling with RDF, RDFS and OWL, and covers that ground admirably.

Allemang and Hendler are known experts, well-versed in the subject matter, but what’s more they have spent considerable time teaching courses on the Semantic Web, and this experience shows. The writing is clear and the book’s full of well-illustrated examples, along with a short but very handy FAQ at the end. The practical side is hinted at in their decision to devote significant space to the SKOS Simple Knowledge Organization System and FOAF Friend of a Friend vocabularies. The syntax used throughout is N3/Turtle, which makes a refreshing change from the eyestrain of RDF/XML.

There aren’t any programming (as in running code) examples, and the coverage of things like HTTP and the use of these technologies on the Web is really confined to illustrated prose. I must admit I was disappointed by the limited coverage of SPARQL, I do think this has relevance to modeling decisions. Given the rise of Linked Data in the wild, I would also have expected maybe a chapter devoted specifically to this approach (the ideas are all there in the text, but they don’t jump out).

On the other hand the coverage of reasoning with Semantic Web languages is excellent, material that can be very hard to get a handle on is here presented in an easily digestable form. Similarly the fundamental theory is explained in simple terms without recourse to arcane notation, and common misconceptions around the Semantic Web are disposed of without malice.

Contents

  1. What is the Semantic Web?
  2. Semantic Modeling
  3. RDF – the Basis of the Semantic Web
  4. Semantic Web Application Architecture
  5. RDF and Inferencing
  6. RDF Schema Language
  7. RDFS-Plus
  8. Using RDFS-Plus in the Wild
  9. Basic OWL
  10. Counting and Sets in OWL
  11. Using OWL in the Wild
  12. Good and Bad Modeling Practices
  13. OWL Levels and Logic
  14. Conclusions
  15. Frequently Asked Questions

RDFS-Plus is RDFS with the addition of some handy bits of OWL (IFPs etc).

In conclusion, this is an approachable book for anyone with interest in the field, and gives excellent coverage of the Semantic side of the Semantic Web, as it pertains to modeling the real world. With the caveat that this is the scope of this book, I’d personally strongly recommend it. I do intend to read this book cover to cover thoroughly, it is insightful writing, and as an occasional OWL user I’ll be keeping it on hand for the recipes.

See also: Henry’s [p]review