HAN Honggui, ZHAO Zifan, WU Xiaolong, YANG Shiheng, HE Zheng, ZHAO Nan. Data Cleaning Method for Municipal Wastewater Treatment Based on Improved Random Forest[J]. Journal of Beijing University of Technology, 2021, 47(5): 421-430. DOI: 10.11936/bjutxb2020110034
    Citation: HAN Honggui, ZHAO Zifan, WU Xiaolong, YANG Shiheng, HE Zheng, ZHAO Nan. Data Cleaning Method for Municipal Wastewater Treatment Based on Improved Random Forest[J]. Journal of Beijing University of Technology, 2021, 47(5): 421-430. DOI: 10.11936/bjutxb2020110034

    Data Cleaning Method for Municipal Wastewater Treatment Based on Improved Random Forest

    • To reduce the impact of different types of abnormal data in the municipal wastewater treatment processes, a data cleaning method was proposed in this paper based on improved random forest. First, an anomaly detection model for isolated forest was designed to detect the outlier data. Second, an improved random forest regression model was used to predict the missing data, which improved the random forest to adapt to the mixed type missing data. Third, the detected abnormal data was eliminated. Finally, the improved random forest was used to predict and compensate the missing data of mixed types. This cleaning method was tested through the municipal wastewater treatment data. Results show that the method improves the accuracy of compensation for mixed type missing data.
    • loading

    Catalog

      Turn off MathJax
      Article Contents

      /

      DownLoad:  Full-Size Img  PowerPoint
      Return
      Return