面向城市固废焚烧过程的缺失数据填充及应用

    Filling Method of Missing Data for Municipal Solid Waste Incineration Processes With Its Application

    • 摘要: 针对城市固废焚烧(municipal solid waste incineration, MSWI)过程中存在的随机和连续数据缺失问题,提出了一种基于专家经验和约简特征集成模型的填充方法. 首先,将过程数据缺失情况识别为随机分布、时间维度和特征维度缺失3种类型. 接着,基于专家经验对前2种类型进行缺失填充后,面向第3种类型基于分布相似性和互信息相关性为缺失特征预测模型选择建模数据集和约简特征,建立具有互补特性的随机森林、梯度提升决策树和反向传播神经网络子模型对缺失值进行初步预测,利用贝叶斯线性回归(Bayesian linear regression,BLR)构建集成模型以获得最终填充值. 最后,利用填充后的MSWI数据建立基于跨层全连接深度森林回归的二噁英排放浓度软测量模型. 实验结果表明所提方法提高了MSWI过程数据的质量.

       

      Abstract: In view of the problem of random and continuous data missing in municipal solid waste incineration (MSWI) process, a filling method of missing data based on expert experiences and ensemble model of reduced features was proposed. First, according to the lack conditions of process data, the missing data were divided into three types, i.e., missing with random distribution, time dimension and feature dimension. Then, the former two ones were filled based on expert experience, and the distribution similarity and mutual information (MI) correlation were used to select the modeling data and reduce the input feature for the third type one. Sub-models with complementary characteristics based on random forest (RF), gradient boosting decision tree (GBDT) and back propagation neural network (BPNN) were used to predict the preliminary missing values. Furthermore, the fusion model based on Bayesian linear regression (BLR) was used to obtain the final data filling values. Finally, a soft-sensor model of dioxin (DXN) emission concentration based on deep forest regression with cross-layer full connection (DFR-clfc) was established to verify the filling effect. Results show that the proposed method improves the data quality of MSWI process.

       

    /

    返回文章
    返回