Why the term “Linking Open Data” might be misleading
A lot of activities around Linking Open Data (“LOD”) and the associated data sets which are nicely visualised as a “cloud” are going on for quite a while now. It is exciting to see how the rather academic “Semantic Web” and all the work which is associated with this disruptive technology can be transformed now into real business use cases.
What I have obeyed in the last few months, especially in business communities, is the following:
- “Linked Data” sounds interesting for the business people because the phrase creates a lot of associations in a second or two; also the database crowd seems to be attracted by this web-based approach of data integration
- “Web of Data” is somehow misleading because many people think that this will be a new web which replaces something else. Same story with the “Semantic Web”
- “Linking Open Data” sounds dangerous to many companies
For insiders it is clear, that the “openness” of data, especially in commercial settings, can be controlled and has to be controlled in many cases. Which means, it can be one-way or mutual. In some use cases data from companies will be put into the cloud, and can be opened up for many purposes, in other use cases it will stay inside the boundaries. In other scenarios only (open) data from the web will be consumed and linked with corporate data, but no data will be exposed to the world (except the fact, that data was consumed by an entity).
And of course: In many other occasions datasets and repositories will be opened up partly depending on the CCs and the underlying privacy regulations one wants to use.
This makes clear that LOD / Linking Open Data is just one detail of a bigger picture. Since companies (and governments) play a crucial role to develop the whole infrastructure, we need to draw a new picture:

I´ll be happy to have a lively discussion about this topic also at the first Linked Data Camp in Vienna.
Linked Data: Standing on the shoulders of giants
When Mariano Consens explained at this year´s Triplify Challenge @ I-Semantics in Graz how he built together with Oktie Hassanzadeh the winning project “Linked Movie Database“, one important thing became clear to me: Linked Data isn´t just a playground anymore, no: it´s a very efficient way to build useful applications standing on the shoulders of giants. Congratulations to the winners!
Become a Web Expert!
The Semantic Web has evolved constantly over the last few years. Nevertheless, in many cases I have experienced a huge demand for profound knowledge in this area. Many potential end-users of semantic web technologies have quite a few ideas of how to apply semantic web, but still many software-projects will never happen, because of the lack of knowledge, because of the fear of getting trapped by too complex technologies. Obviously it´s not the technology anymore but the awareness and personal knowledge about the semantic web, which is the actual bottleneck for the semantic web getting real.
The Semantic Web Company (SWC) is offering in 12 daily seminars a training-course for persons who want to become a Web expert. Participants get step by step advanced in methods and technologies for semantic projects. Each seminar is a stand-alone-module and can also be booked singularly.
From 27th of Mai to 4th of July 2008 the SWC curriculum will take place. A special focus will be on applications and solutions of semantic technologies to support social processes.
The curriculum will provide profound insights into the topics Semantic Web and Social Software. Therefore the seminars are grouped into three comprehensive modules:
* “Next Step”: Social Web & Social Software
27. – 30. May 2008
* “Advanced Level”: Textmining & Enterprise Search
10. – 13. June 2008
* “Expert View”: Semantic Web & Metadata Management
01. – 4. July 2008
The modules will take place at the Austrian Computer Society in Vienna. The main language in the courses is German. English courses can be provided on demand.
3 Semantic Apps to Watch
As mentioned on Read/WriteWeb there are at least 10 (rather commercial) semantic web applications “around” which claim to use semantic web technologies for different purposes: “10 Semantic Apps to Watch”. (Besides this at least 100 prototypes from various research programms exist in this field).
My “short list” of those 10 apps consists of the following three:
To my opinion these 3 projects have the highest potential to become a “big player” in the next generation web. Instead of “improving” what Google does, they try to fulfill a totally new mission:
Twine isn´t organising the “knowledge of the whole world” (like Google would like to do) they rather focus on the users themselves: Using a semantic graph (including the social graph) for each user, information in a social network will flow in a more efficient way. Information will come to the users instead of searching around. Twine is a combination of many of the well known Web 2.0 applications like Facebook or del.icio.us but will use base technologies from the semantic web and will provide a SPARQL API and a REST API.
Possible Risks:
- It´s still not clear if people will accept personal semantic graphs rather as an advantage or rather as a possible danger for privacy
- Semantic Web database technologies (Triple stores) are still very young. Although some of the existing systems have already proved that they are scalable none of them have been used so far for really big systems.
USP:
Twine is the first company which will combine social tagging, social networking, natural language processing and semantic web on a professional level. So it has the potential to become a very popular service for many people to support their daily business. Sooner or later the same system might be offered also as a very attractive business solution. Nevertheless, twine hasn´t opened its portal for the public so far, so it´s still not clear if all the promises will be held…
Talis is a “domain-agnostic” technology platform which supports developers to build applications on the principles of “mass collaboration”. It is a new breed of a distributed programmatic interface heavily deploying all opportunities the Web of Data may offer. “DNS is used as a robust routing mechanism to connect requests with the closest data or service both for the native platform services, but also for third party data access services.“
Possible Risks:
Talis mission sounds great, and its success depends a lot on how this company will be able to build an ecosystem around its services. My forecast: Talis will be acquired in 2009 by one of the big web companies.
USP:
Talis tries to establish a new way of organizing information flows throughout the Web of Data. Since it relies on open standard protocols like RESTful Web Services a lot of applications will use Talis technologies. Talis as a company has a well founded background since it has been provided services for governmental organizations or libraries for the last 30 years. Some of the people working at Talis rank among the best semantic web thinkers.
(Clearforest was acquired by Reuters) was bought by Thomson. ClearForest’s technology automatically categorizes documents and structures entities contained inside text. The Semantic Web without text extraction algorithms which really work will never take place. And Clearforest really works. Just try it out!
Possible Risks:
Clearforest is well embedded in a giant: Thomson is the world´s largest media company. This is, of course a great opportunity to sell these new kind of semantic solutions to many of the global Top 5000. On the other hand, it might be a risk since “traditional” media companies still tend to forget about the long tail and open APIs.
USP:
Simply spoken, the USP of Clearforest is that the technology works and it can be integrated into existing architectures without being a semantic web expert. It can become one of the cornerstones of an integrated corporate semantic web architecture.
Visiting the cradle of human kind
I spent the last few days in Addis Ababa, Ethiopia. It wasn´t a holiday trip at all, no – it was my first business trip to Africa.
The project I am working for is the “Nile Basin Initiative” (NBI). This partnership was initiated and is led by the riparian states of the Nile River (like Egypt, Ethiopia or Sudan to name just a few). The goal of the NBI is to develop this big, big river in a cooperative manner: The promotion of regional peace and security and the provision of substantial socioeconomic benefits are the most important visions.
For my personal career it was always important what I am working for, so this project is a great challenge and a huge opportunity. Our task is to provide a Web based Information System which will be the knowledge base for all participating stakeholders in the next few years. On top of our system a Decision Support System (DSS) will be implemented, which will help to understand the river system behaviour better and to evaluate alternative development and management schemes.
We spent two days for our initial workshop to discuss the system specification and it turned out quickly that the specialists at NBI are knowledgeable people. We focused not only on “typical” tasks like document management or full-text search but also on a well structured metadata layer on top of standards like Dublin Core. At the end it also turned out that some semantic web methods will be either implemented at the first stage or it is desirable at least for the next development phase. So a moderated search will be there at system´s first stage and later on an associative semantic search will be implemented.
After work was done I spent one day for looking around in Addis Ababa. I visited “Lucy” – the first known individual of partially human type. She lived in Ethiopia 3.2 Million years ago. When I saw her in the museum I thought: If Ethiopia is the cradle of human kind why shouldn´t be our project the cradle for an African Semantic Web, which also helps that the countries in that region start to understand each other a bit better?
Yahoo Researcher Declares Semantic Web Dead – and reborn again…
When Mor Naaman from Yahoo said in a special track on Web 3.0 at WWW2007 that the “Semantic Web” is dead, he obviously tried to attract attention. Nevertheless, in my opinion he is absolutely right – there is no chance to “teach” people to annotate web content in a more sophisticated way than “social tagging” (and I´m pretty sure that also in the future it will always be a small community which will tag their content).
But in one point Mor Naaman missed the point: The “Semantic Web” was always there, under-cover more or less. Living in a tin with a lousy HTML-lid. And inside the tin there has always been enough semantics. There is no need to re-invent the data models, the namespaces, the ontologies (at least for most of the basic “things”) as Naaman proposes in his talk (slide 13). How easily all the existing semantics can be released and mapped against the “Semantic Web” (and suddenly it was born again
) is demonstrated by projects like [1] or [2].
RDF-Data is the Next Intel Inside…
…is the ultimative design pattern for the Semantic Web. In search for high quality data W3C provides a platform called “Linking Open Data on the Semantic Web“. Some basic datasets like geographic or encyclopedic information are already available. The more meshups from that datasets people will build the more data will be available, and so on…
Around two years ago virtually NO data for the semantic web was available, therefore no applications…
The RDF-Web has just started…
Reminder: Call for papers TRIPLE-I
TRIPLE-I
The Innovation Conference for Knowledge Management, New Media Technology and Semantic Technologies
5 – 7 September 2007
Graz, Austria
http://www.triple-i.info
The TRIPLE-I Conference series is a joint venture of the conferences
- I-KNOW – International Conference on Knowledge Management
- I-MEDIA – International Conference on New Media Technology
- I-SEMANTICS – International Conference on Semantic Technology
Regular paper submissions for I-KNOW may include but are not limited to:
- Identifying Relationships between different kinds of Knowledge Entities
- Service-oriented Architectures for Knowledge Management Systems
- Service-based Knowledge Management
- Orchestration of Knowledge Management Services
- Distributed Knowledge Management
- Social Network Analysis
- Knowledge Work Productivity
- Communication and Collaboration in Knowledge Management
- Context and Usage Pattern Identification and Management
- Agile Approaches to Knowledge Management
Regular paper submissions for I-MEDIA may include but are not limited
to:
- Web 2.0 Applications for Content Providers
- Business Models for New Media
- Social Media Platforms
- User-Generated Content
- Cross-Media Content Production and Delivery
- Cross-Media Search and Retrieval
- New Media Services
- Innovative User Interfaces for Media Devices
- The Geospatial Web
- Marketing of Products and Services via New Media
Regular paper submissions for I-SEMANTICS may include but are not limited to:
- Semantic Social Software
- Semantic Wikis and Weblogs
- Semantic Desktop
- Social Tagging and Folksonomies
- Ontology Engineering
- Terminology Management
- Visualisation of Semantic Models
- Interoperability, Integration and Reasoning
- Semantic Web Applications
- Experiences, Studies and Metrices
Deadlines
21 May 2007: Full paper submission (4-8 pages)
18 June 2007: Notification of acceptance
13 July 2007: Final version (8 pages)
5-7 September 2007: TRIPLE-I Conference
Semantic Search for the WWW2007
When I tried to figure out which tendencies, hot topics and trends at the upcoming WWW2007 will be discussed, I first looked up the official website of the conference. I thought to myself – why isn´t there a semantic search implemented for the WWW2007? But the weblog revealed: There is one!
After doing some research with this really well implemented tool, I´m really impressed: Not only by the possibilities the search engine offers ( also searching the archive is there), but also by the relative importance the semantic web has gotten by 2007.




