//
you're reading...
intelligent search, linked data, text extraction

Using DBpedia to generate SKOS thesauri

In recent years, we have constantly discussed the application of thesauri and other knowledge models to improve search. Many people understand that thesaurus based search is in many cases better than search algorithms purely based on statistics. Of course the big contra always was, “the costs are too high to establish a good-enough thesaurus or even a high-quality one”.

Imagine you could generate any thesaurus you would like for nearly any knowledge domain you can think of with quite a good quality! Sounds impossible? Reminds you of all the promises made by text mining software which generates “semantic nets” from scratch?

Here at the Semantic Web Company we have been working on SKOSsy for a while. I will explain what this web service can do for you:

SKOSsy generates SKOS based thesauri in German or in English for a domain you are interested in. SKOSsy extracts data from DBpedia, so it can cover anything which is in DBpedia. Thus, SKOSsy works well whenever a first seed thesaurus should be generated for a certain organisation or project. If you load the automatically generated thesaurus into an editor like PoolParty Thesaurus Manager (PPT) you can start to enrich the knowledge model by additional concepts, relations and links to other LOD sources. But you don´t have to start in the open countryside with your thesaurus project.

With SKOSsy in place custom-tailored text extractors can be produced with low effort. To sum up,

  • SKOSsy makes heavy use of Linked Data sources, especially DBpedia
  • SKOSsy can generate SKOS thesauri for virtually any domain within a few minutes
  • Such thesauri can be improved, curated and extended to one´s individual needs but they serve usually as “good-enough” knowledge models for any semantic search application you like
  • SKOSsy based semantic search usually outperform search algorithms based on statistics since they contain high-quality information about relations, labels and disambiguation
  • SKOSsy works perfectly together with PoolParty product family

Which domains are you interested in? Give it a try!

About these ads

About ablvienna

Bridging the gap between Collective Intelligence, Expert Views and Machine Learning

Discussion

Trackbacks/Pingbacks

  1. Pingback: WordPress plugin to make use of linked data |The Semantic Puzzle - December 11, 2011

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

RSS Semantic Puzzle

  • American Physical Society Taxonomy – Case Study
    Taxonomy Strategies has been working with the American Physical Society (APS) to develop a new faceted classification scheme. The proposed scheme includes several discrete sets of categories called facets whose values can be combined to express concepts such as existing … Continue reading →
  • Why SKOS should be a focal point of your linked data strategy
    The Simple Knowledge Organization System (SKOS) has become one of the ‘sweet spots’ in the linked data ecosystem in recent years. Especially when semantic web technologies are being adapted for the requirements of enterprises or public administration, SKOS has played … Continue reading →
  • SEMANTiCS 2014: Call for Industry Presentations
    SEMANTiCS 2014 will take place in Leipzig (Germany) this year from September 4-5. The International Conference on Semantic Systems will be co-located with several workshops and other meetings, e.g. the 2nd DBpedia community meeting. SEMANTiCS conference (formerly ‘I-Semantics’) focuses on transfer … Continue reading →
  • Online checker for SKOS vocabularies now available
    Create better SKOS vocabularies PoolParty team likes to announce the availability of the new online vocabulary quality checker for SKOS vocabularies. It finds over 20 kinds of potential quality problems in controlled vocabularies that are expressed using SKOS. The service … Continue reading →
  • Linked data based search: Make use of linked data to provide means for complex queries
    Two live demos of PoolParty Semantic Integrator demonstrate new ways to retrieve information based on linked data technologies Linked data graphs can be used to annotate and categorize documents. By transforming text into RDF graphs and linking them with LOD … Continue reading →
  • European Data Forum’s Call for Contributions
    The European Data Forum (EDF) is an annual meeting place for industry, research, policy makers, and community initiatives to discuss the challenges and opportunities of data in Europe, especially in the light of recent developments such as Open Data, Linked … Continue reading →
Follow

Get every new post delivered to your Inbox.

%d bloggers like this: