• 综合性科技类中文核心期刊
    • 中国科技论文统计源期刊
    • 中国科学引文数据库来源期刊
    • 中国学术期刊文摘数据库(核心版)来源期刊
    • 中国学术期刊综合评价数据库来源期刊
YANG Zhen, DUAN Li-juan, LAI Ying-xu. Online Public Opinion Hotspot Detection and Analysis Based on Short Text Clustering Using String Distance[J]. Journal of Beijing University of Technology, 2010, 36(5): 669-673.
Citation: YANG Zhen, DUAN Li-juan, LAI Ying-xu. Online Public Opinion Hotspot Detection and Analysis Based on Short Text Clustering Using String Distance[J]. Journal of Beijing University of Technology, 2010, 36(5): 669-673.

Online Public Opinion Hotspot Detection and Analysis Based on Short Text Clustering Using String Distance

More Information
  • Received Date: December 09, 2009
  • Available Online: December 14, 2022
  • The unique language characteristic of short texts has made the performance of traditional natural language processing methods degradation,or even unavailable.Exact representation and calculation of the similarity between short texts are great helpful to content based clustering.That this paper treated each short text as a composition of characters,numbers and punctuation,and a similarity measure based on string similarity was proposed.Then a public opinion hotspot detection and analysis system based on short text hierarchical clustering was built.This method calculated the similarity directly which skipped the feature extraction and representation processing of short text,to a certain extent,and avoided using the sparse feature vectors.Experimental results show the effectiveness of the proposed method.
  • [1]
    中国信息产业商会信息安全产业分会.中国信息安全产业发展白皮书(2005—2010)[EB/OL].[2005-03-11].http:∥www.itsec.gov.cn/webportal/document/baipishu.doc.
    [2]
    龚才春.短本语言计算的关键技术研究[D].北京:中国科学院研究生院计算技术研究所,2008.GONG Cai-Chun.Research on short text language computing[D].Beijing:Institute of Computing Technology,ChineseAcademy of Sciences,2008.(in Chinese)
    [3]
    SCOTTJ.Social network analysis:a handbook[M].2nd Edition.London:Sage,2000:123-145.
    [4]
    车万翔,刘挺,秦兵,等.基于改进编辑距离的中文相似句子检索[J].高技术通讯,2004,14(7):15-20.CHE Wan-xiang,LIU Ting,QIN Bing,et al.Similar Chinese sentence retrieval based on improved edit-distance[J].HighTechnology Letters,2004,14(7):15-20.(in Chinese)
    [5]
    杨震,范科峰,雷建军,等.基于语义的文本流形研究[J].电子学报,2009,37(3):557-561.YANG Zhen,FAN Ke-feng,LEI Jian-jun,et al.Text manifold based on semantic analysis[J].Acta Electronica Sinica,2009,37(3):557-561.(in Chinese)
    [6]
    陈黎飞,姜青山,王声瑞.基于层次划分的最佳聚类数确定方法[J].软件学报,2008,19(1):62-72.CHEN Li-fei,JIANG Qing-shan,WANG Sheng-rui.Ahierarchical method for determining the number of clusters[J].Journalof Software,2008,19(1):62-72.(in Chinese)
    [7]
    BOUGUESSA M,WANG S,SUN H.An objective approach to cluster validation[J].Pattern Recognition Letters,2006,27(13):1419-1430.
    [8]
    马旭,徐蔚然,郭军,等.SMS-2008标注中文短信息库[J].中文信息学报,2009,23(4):22-26.MA Xu,XU Wei-ran,GUO Jun,et al.SMS-2008:an annotated Chinese short messages corpus[J].Journal of ChineseInformation Processing,2009,23(4):22-26.(in Chinese)
  • Related Articles

    [1]JI Qiang, SUN Yanfeng, HU Yongli, YIN Baocai. Review of Clustering With Deep Learning[J]. Journal of Beijing University of Technology, 2021, 47(8): 912-924. DOI: 10.11936/bjutxb2021010013
    [2]GUO Limin, LIN Chunhua, GAO Xu, SU Xing. Efficient Clustering Objects for Spatial Network Using CB-graph[J]. Journal of Beijing University of Technology, 2019, 45(6): 524-533. DOI: 10.11936/bjutxb2017120011
    [3]DU Yongping, DU Xiaoyan, CHEN Shouqin. Relaxed Hierarchy Structure Construction for Text Classification[J]. Journal of Beijing University of Technology, 2017, 43(8): 1175-1181. DOI: 10.11936/bjutxb2016040059
    [4]GUO Xiaodong, FU Tibiao, XU Shuai. Safety Assessment of Timber Ancient Buildings Based on Grey Clustering Analytical Method[J]. Journal of Beijing University of Technology, 2017, 43(5): 780-785. DOI: 10.11936/bjutxb2016060034
    [5]ZHANG Sen, ZHU Mei-ling, HOU Guang-kui. Improvement Fuzzy kernel Clustering Algorithm[J]. Journal of Beijing University of Technology, 2012, 38(9): 1408-1411.
    [6]QIN Ru-xin, TIAN Ying-jie, CHEN Jing, DENG Nai-yang, ZHANG Hai-bin. Data Mining Method of Association Rule for Bi-cluster[J]. Journal of Beijing University of Technology, 2009, 35(4): 561-565.
    [7]LI Yu-jian. Adaptive Clustering Algorithm Based on Minimal Spanning Tree Cutting[J]. Journal of Beijing University of Technology, 2007, 33(3): 331-336.
    [8]LI Yu-jian. Hierarchical Subtrees Agglomerative Clustering Algorithms[J]. Journal of Beijing University of Technology, 2006, 32(5): 442-446.
    [9]Huang Zheng, Chen Zuyin. Evaluation on Results of Several Cluter Methods and Utilization of the Cluster Method of Cut-set[J]. Journal of Beijing University of Technology, 1989, 15(1): 18-23.
    [10]Huang Zheng, Chen Zuyin. Application of Mathematical Morphology to Clustering[J]. Journal of Beijing University of Technology, 1988, 14(4): 1-8.

Catalog

    Article views (21) PDF downloads (9) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return