韩红桂, 鲁树武, 伍小龙, 乔俊飞. 基于改进型SVM的城市污水处理过程异常数据清洗方法[J]. 北京工业大学学报, 2021, 47(9): 1011-1020. DOI: 10.11936/bjutxb2019100014
    引用本文: 韩红桂, 鲁树武, 伍小龙, 乔俊飞. 基于改进型SVM的城市污水处理过程异常数据清洗方法[J]. 北京工业大学学报, 2021, 47(9): 1011-1020. DOI: 10.11936/bjutxb2019100014
    HAN Honggui, LU Shuwu, WU Xiaolong, QIAO Junfei. Abnormal Data Cleaning Method for Municipal Wastewater Treatment Based on Improved Support Vector Machine[J]. Journal of Beijing University of Technology, 2021, 47(9): 1011-1020. DOI: 10.11936/bjutxb2019100014
    Citation: HAN Honggui, LU Shuwu, WU Xiaolong, QIAO Junfei. Abnormal Data Cleaning Method for Municipal Wastewater Treatment Based on Improved Support Vector Machine[J]. Journal of Beijing University of Technology, 2021, 47(9): 1011-1020. DOI: 10.11936/bjutxb2019100014

    基于改进型SVM的城市污水处理过程异常数据清洗方法

    Abnormal Data Cleaning Method for Municipal Wastewater Treatment Based on Improved Support Vector Machine

    • 摘要: 针对城市污水处理过程数据存在噪声和缺失的问题,提出一种基于改进型支持向量机(improved support vector machine,ISVM)的异常数据清洗方法.首先,设计一种基于密度估计的噪声数据检测方法,实现对污水噪声数据甄别与剔除.其次,建立一种基于ISVM的缺失数据补偿模型,对缺失数据进行非线性拟合,获得数据缺失时刻的补偿值.最后,运用粒子群优化(particle swarm optimization,PSO)算法更新ISVM参数,提高缺失数据的补偿精度.将该清洗方法应用于城市污水处理过程中,实验结果表明,基于ISVM的异常数据清洗方法能够实现对异常数据的剔除以及缺失数据的补偿,提高了数据质量.

       

      Abstract: To reduce the impact of data noise and loss in the municipal wastewater treatment processes (WWTP), an abnormal data cleaning method was proposed based on improved support vector machine (ISVM) in this paper. First, a noise data detection method was designed to eliminate the noise data of WWTP by using the density estimation. Then, the proposed ISVM was used to design the data compensation model. This data compensation model can obtain the approximation of missing data by realizing the nonlinear fitting of the missing data. Finally, a particle swarm optimization (PSO) algorithm was adopted to optimize the parameters of ISVM to improve the precision of missing data compensation. This proposed cleaning method was applied to a real municipal WWTP, and the experimental results demonstrate that the proposed method can improve the data quality by eliminating the abnormal data and compensating the missing data.

       

    /

    返回文章
    返回