Semantic Portal Business & Economics - German Science Foundation (DFG)
 

Abstract

Aim of the project is the optimization of retrieval results in distributed and heterogeneous bibliographical data and full text collections. We will implement a retrieval system which supports the search for economic literature by means of thesaurus concepts. This implementation is done in cooperation with the company SyynX. In this project, we index articles from various magazines in the economic domain with different thesauri and develop appropriate analysis techniques to evaluate the indexing results and the quality of the retrieval system.


Goals

1. Optimization of the subject enquiry in distributed and heterogeneous bibliographical data collections.

How can we reach a retrieval quality with a search engine that is comparable to a subject enquiry in a library catalog with a controlled vocabulary.


2. Development of methods for automatic document indexing.

We evaluate different approaches for automatic document indexing by means of the SyynX technology with repect to an possible improvement of retrieval results.

At the end of the project, we implement an automatic document indexing and retrieval system that is used by the University Library. It is important for us that other libraries can use these results and the output of our system as well, without being forced to use the commercial SyynX technology. The generated metadata and the automatically assigned keywords will be available in a computer readable format that can easily be integrated in other systems.


3. Development of methods to the automated genesis of cross concordances.

We assume that an optimal result cannot always be obtained by automatic indexing. On the one hand, this can be caused by the employed method, but also by the nature of the used thesaurus: certain thematic areas often aren't covered sufficiently or not at all.

In connection with this, we examine, whether the last-named problem can be solved by the use of different thesauri at indexing step. As the economic sciences are an inherently interdisciplinary field of research, it has to be examined, whether the use of additional thesauri (particularly law and social sciences) from related domains can lead to better results.

On the basis of documents indexed with multiple thesauri, we try to create correspondences between these different thesauri that can be used to access contents directly that are indexed by only one thesaurus.


4. Optimization of the enquiry in the magazine archives purchased by the German Science Foundation with national licenses.

With this project we focus on the economic sciences and the indexing of economic journal articles published by Elsevier. This is especially desirable as the German Science Foundation purchased a national license for these journals and all German universities have access to these articles. So this project should improve the usability of this purchased license and contributes to the sustainability of the employed financial resources. As mentioned above, we give access to the automatically assigned keywords in a computer readable form, so that they can be used for example in ECONIS. So the results would be available for the virtual library for economics and business studies (Project EconBiz).


Publications

1.   Kai Eckert, Magnus Pfeffer and Heiner Stuckenschmidt. Semtinel: Interactive Supervision of Automatic Indexing. JCDL '08: Proceedings of the 2008 conference on Digital libraries, June 16-20 2008, Pittsburgh, PA, USA, ACM, New York, 2008, Demo Paper.
2.   Kai Eckert, Magnus Pfeffer and Heiner Stuckenschmidt. Assessing Thesaurus-Based Annotations for Semantic Search Applications. International Journal on Metadata, Semantics and Ontologies, 2008, to appear.
3.   Magnus Pfeffer, Kai Eckert and Heiner Stuckenschmidt. Visual Analysis of Classification Systems and Library Collections. Proceedings of the 12th European Conference on Research and Advanced Technology for Digital Libraries (ECDL), September 14 to September 19, 2008, Aarhus, Denmark, Springer, Heidelberg, 2008, to appear.
4.   Kai Eckert. A methodology for supervised automatic document annotation. JCDL 2008 Doctoral Consortium, June 16 2008, Pittsburgh, PA, USA, 2008, to appear.
5.   Kai Eckert, Heiner Stuckenschmidt and Magnus Pfeffer. Interactive Thesaurus Assessment for Automatic Document Annotation. K-CAP '07: Proceedings of the 4th international conference on Knowledge capture, Whistler, BC, Canada, ACM, New York, NY, USA, 2007.
6.   Kai Eckert. Thesaurus Analysis and Visualization in Semantic Search Applications. University of Mannheim, 2007.
7.   Diana Maynard, Stamatia Dasiopoulou, Stefania Costache, Kai Eckert, Heiner Stuckenschmidt, Martin Dzbor and Siegfried Handschuh. D1.2.2.1.3 Benchmarking of annotation tools. Knowledge Web Project, 2007.