1. How to submit my research paper? What’s the process of publication of my paper?
The journal receives submitted manuscripts via email only. Please submit your research paper in .doc or .pdf format to the submission email: lnit@ejournal.net.
2. Can I submit an abstract?
The journal publishes full research papers....[Read More]

Empiric Experiments with Text Representing Centroids

Mario Kubek 1, Thomas Böhme 2, and and Herwig Unger 1
1. FernUniversität in Hagen, Lehrgebiet Kommunikationsnetze, Universitätsstr. 27, Hagen, Germany
2. Technische Universität Ilmenau, Institut für Mathematik, Weimarer Straße 25, Ilmenau, Germany
Abstract—Centroid terms are comfortable instruments to represent texts, compare them semantically and to even (hierarchically) cluster sets of documents using them. Their determination depends on their topical and conceptual context, i.e. the dynamically changing knowledge of a user represented by the co-occurrence graph. Herein, important properties of centroids as well as their applicability for tasks in natural language processing and text mining shall be discussed and their use justified by a set of experiments. Based on the obtained results, a new approach to detect fine-grained similarities between text documents is derived.
 
Index Terms—centroid term, co-occurrence graph, document similarity, text processing

Cite: Mario Kubek, Thomas Böhme, and Herwig Unger, "Empiric Experiments with Text Representing Centroids," Lecture Notes on Information Theory, Vol. 5, No. 1, pp. 23-28, June 2017. doi: 10.18178/lnit.5.1.23-28
Copyright © 2012-2015 Lecture Notes on Information Theory, All Rights Reserved