LI Wen-bin, LIU Chun-nian, CHEN Yi-ying. Classifying Text Corpus Based on Information Gain Weight of Feature[J]. Journal of Beijing University of Technology, 2006, 32(5): 456-460.
    Citation: LI Wen-bin, LIU Chun-nian, CHEN Yi-ying. Classifying Text Corpus Based on Information Gain Weight of Feature[J]. Journal of Beijing University of Technology, 2006, 32(5): 456-460.

    Classifying Text Corpus Based on Information Gain Weight of Feature

    • In order to improve the training speed of classifiers without losing their accuracy, three classifying algorithms based on information gain of features are provided in this work. They are IG-C1, IG-C2 and IG-C, which classifies unlabeled text according to features' weight generated in feature selection phase. All these approaches have two characteristics: lower time complexity and simpler implementation. The performance comparison between these algorithms and Naive Bayes, Vector Space Model using retuers 21578 and 20 newsgroup data sets, shows that IG-C algorithm is best one.
    • loading

    Catalog

      Turn off MathJax
      Article Contents

      /

      DownLoad:  Full-Size Img  PowerPoint
      Return
      Return