R. Pirrone, A. Pipitone, G. Russo

Semantic sense extraction from Wikipedia pages

Natural Language Processing

14/05/2010

This paper discusses a modality to access and to organize unstructured contents related to a particular topic coming from the access to Wikipedia pages. The proposed approach is focused on the acquisition of new knowledge from Wikipedia pages and is based on the definition of useful patterns able to extract and identify novel concepts and relations to be added in the knowledge base. We proposes a method that uses information from the wiki page's structure. According to the different part of the page we define different strategies to obtain new concepts or relation between them. We analyze not only structure but text directly to obtain relations and concepts and to extract the type of relations to be incorporated in a domain ontology. The purpose is to use the obtained information in an intelligent tutoring system to improve his capabilities in dialogue management with users.

This article is authored also by Synbrain data scientists and collaborators. READ THE FULL ARTICLE