Empiric Experiments with Text Representing Centroids
2017-06-30 11:59:00 来源: 评论:0 点击:
Abstract—Centroid terms are comfortable instruments to represent texts, compare them semantically and to even (hierarchically) cluster sets of documents using them. Their determination depends on their topical and conceptual context, i.e. the dynamically changing knowledge of a user represented by the co-occurrence graph. Herein, important properties of centroids as well as their applicability for tasks in natural language processing and text mining shall be discussed and their use justified by a set of experiments. Based on the obtained results, a new approach to detect fine-grained similarities between text documents is derived.
Index Terms—centroid term, co-occurrence graph, document similarity, text processing
Cite: Mario Kubek, Thomas Böhme, and Herwig Unger, "Empiric Experiments with Text Representing Centroids," Lecture Notes on Information Theory, Vol. 5, No. 1, pp. 23-28, June 2017. doi: 10.18178/lnit.5.1.23-28
Cite: Mario Kubek, Thomas Böhme, and Herwig Unger, "Empiric Experiments with Text Representing Centroids," Lecture Notes on Information Theory, Vol. 5, No. 1, pp. 23-28, June 2017. doi: 10.18178/lnit.5.1.23-28
Array
相关热词搜索:
上一篇:Piecewise Union Blend with Adjustable Primitive’s Subsequent Blend Range Parameters in Constructive Geometry
下一篇:A Design Rationale Model for Green Human Computer Interaction Design
分享到:
收藏
评论排行
频道总排行
频道本月排行
- 6Extracting Basic Fighter Maneuv...
- 2A Comprehensive Analysis of Ar...
- 2Analyses of Energy-Aware Geogra...
- 1Data Collection System Using S...
- 1Piecewise Union Blend with Adj...
- 1Empiric Experiments with Text Representing Centroids
- 1A Design Rationale Model for ...
- 1An Algorithm for Computing All...
- 1A Parallel-and-Interactive Schem...