Power Generation Technology ›› 2025, Vol. 46 ›› Issue (1): 135-144.DOI: 10.12096/j.2096-4528.pgt.23005

• Power Generation and Environmental Protection • Previous Articles    

Research on Coal Calorific Value Prediction Based on K-Means Clustering and Ridge Regression Algorithm

Shichao QIAO, Heng CHEN, Bo LI, Peiyuan PAN, Gang XU   

  1. Beijing Key Laboratory of Emission Surveillance and Control for Thermal Power Generation (North China Electric Power University), Changping District, Beijing 102206, China
  • Received:2023-11-15 Revised:2024-03-09 Published:2025-02-28 Online:2025-02-27
  • Contact: Heng CHEN
  • Supported by:
    National Natural Science Foundation of China(52106008);National Natural Science Foundation of China Innovative Research Group Project(51821004)

Abstract:

Objectives The calorific value of coal is one of the important evaluation criteria for measuring coal quality, and is also the main basis for valuation of power coal. In order to realize high-precision and rapid prediction of coal calorific value while reducing prediction costs, a new prediction method is proposed. Methods The K-Means clustering algorithm is used to cluster similar coal types. The sample data comes from 4 269 entry testing information from a self-contained coal yard of a power plant in Shandong in the past 6 years. On the basis of clustering, Ridge regression models are established from industrial analysis data to received base calorific value, which is used as a prediction model for coal calorific value. Results The established K-Means clustering and Ridge regression hybrid model exhibits excellent prediction performance. Compared with the traditional multiple linear regression model, this hybrid model can reduce the mean absolute error by up to 30.525%, reduce the root mean square error by up to 60.054%, and increase the correlation coefficient by up to 2.320%. Conclusions The mixed model of K-Means clustering and Ridge regression reduces the prediction cost of coal calorific value, and also improves the accuracy and speed of prediction, providing a new idea for predicting coal calorific value.

Key words: coal-fired power plants, calorific value of coal, proximate analysis, prediction, K-Means clustering, Ridge regression

CLC Number: