Journal of Atmospheric and Environmental Optics ›› 2025, Vol. 20 ›› Issue (2): 176-187.doi: 10.3969/j.issn.1673-6141.2025.02.006

Previous Articles     Next Articles

Estimation method of atmospheric particulate matter mass concentration based on machine learning and multi-source data

YANG Huirong 1,2, HE Nanteng 1,2, BU Lingbing 1,2*, MO Zusi 1,2, FAN Zengchang 1,2, ZHOU Xiaomeng 1,2, SU Xin 1,2   

  1. 1 Key Laboratory for Aerosol-Cloud-Precipitation of China Meteorological Administration, Nanjing University of Information Science and Technology, Nanjing 210044, China; 2 School of Atmospheric Physics, Nanjing University of Information Science and Technology, Nanjing 210044, China
  • Received:2023-02-02 Revised:2023-03-08 Online:2025-03-28 Published:2025-03-24
  • Contact: Lingbing .Bu E-mail:lingbingbu@nuist.edu.cn

Abstract: High concentration of atmospheric particulate matter can cause the reduction of atmospheric visibility and has a negative impact on human health. Therefore, continuous monitoring, estimation and prediction of the temporal and spatial variation characteristics of particulate matter concentration are of great significance for environmental pollution assessment and treatment. In this work, by using the meteorological data of Rencheng District, Jining City, the air quality data from the Electrochemical Plant Station of Jining national environmental air quality monitoring point and the surface aerosol extinction coefficients inverted from lidar signal over Rencheng District from December 2021 to February 2022, a model of particle concentration estimation is established based on machine learning. First, the sliding average of 100 of the air quality index (AQI) in the previous 24-hour is used as the threshold value to divide the samples into two background conditions: clean and polluted. Then, the importance of input factors is ranked using random forest (RF) algorithm, and according to this importance rank, the factors are put into RF, mind evolution algorithm-back propagation neural network (MEA-BPNN), generalized regression neural network (GRNN) and wavelet neural network (WNN) models, respecitively. And then, by comparing the root mean square errors (ERMS) of different models, the optimal model and the optimal number of input factors for particulate matter concentration inversion under different atmospheric background are established. Finally, the particle estimation model developed is applied to the actual atmospheric observation data to evaluate the horizontal distribution of particle concentration under different atmospheric backgrounds based on the optimal model and input factors.

Key words: lidar, aerosol extinction coefficient, particle mass concentration, machine learning

CLC Number: