Geiß, Johanna
Preview |
PDF, English
Download (3MB) | Terms of use |
Citation of documents: Please do not cite the URL that is displayed in your browser location input, instead use the DOI, URN or the persistent URL below, as we can guarantee their long-time accessibility.
Abstract
This master thesis deals with the implementation of a search engine using Latent Semantic Indexing (LSI) called BoSSE. Four different search types were implemented which allow a search for documents or terms similar to a given term, query or document. These search types are evaluated and the importance of term weighting, exclusion of non content words and the right selection of k for the reduction of dimension are discussed. Furthermore, an introduction to Latent Semantic Indexing (LSI) and an explanation of the Singular Value Decomposition (SVD) is given.
Document type: | Master's thesis |
---|---|
Date Deposited: | 30 Aug 2006 08:37 |
Date: | 2006 |
Faculties / Institutes: | Neuphilologische Fakultät > Institut für Computerlinguistik |
DDC-classification: | 004 Data processing Computer science |
Controlled Keywords: | Information Retrieval, Linguistische Datenverarbeitung, Semantischer Raum, Singulärwertzerlegung |
Uncontrolled Keywords: | Latent semantic Indexing , LSI , LSALatent semantic Indexing , LSI , LSA |