A Knowledge Extraction and Management Component to Support Spontaneous Participation
Edited Conference Meeting Proceeding
Harnessing spontaneous contributions of citizens on Social Media and networking sites is a major feature of the next generation citizen-led e-Participation paradigm. However, extracting information of interest from Social Media streams is a challenging task and requires support from domain specific language resources such as lexica. This work describes our efforts at developing a Knowledge Extraction and Management component which employs a lexicon for extracting information related to public services in Social Media contents or streams as part of a holistic technology infrastructure for citizen-led e-Participation. Our approach consists of three basic steps – (1) acquisition and refinement of public service catalogues, (2) organization of the public service names into a lexicon based on different semantic similarity measures and (3) development of a dictionary-based Named Entity Recognizer (NER) or “spotter” based on the lexicon. We evaluate the performance of the NER solution supported by contextual information generated by two well-known general-purpose information NER tools (DBpedia Spotlight and Alchemy) on a dataset of tweets. Results show that our strategy to domain specific information extraction from Social Media is effective. We conclude with a scenario on how our approach could be scaled-up to extract other types of information from citizen discussions on Social Media.
14th IFIP Electronic Government (EGOV) and 7th Electronic Participation (ePart) Conference 2015
7th IFIP 8.5 International Conference, ePart 2015 Thessaloniki, Greece, August 30 – September 2, 2015
Digital Object Identifer (DOI):
Open access repository: