Traditional weather stations monitor the weather above short grass, which is a standardized environment. Such an environment is far from representative of where most people live. Moreover, despite advances in urban climate modelling, even state-of-the-art weather forecasts and climate scenarios do not account for the hyperlocal influence of land cover on meteorological variables.To bridge this gap, we have constructed several machine learning models to translate 2-meter temperature measurements from standardized to different rural and urban environments. The input features of these models are the land cover fractions: impervious, green and water around a target station, and the interpolated open-field 2-meter temperature and wind values at the target location. The target feature for these models is the temperature data from the Flemish crowd-sourced VLINDER-network, which consists of calibrated stations positioned in unconventional locations. These models were trained on data from a limited set of VLINDER-stations and evaluated on unseen data of previously used and unused VLINDER-stations. We found that a random forest model yields the best results and had the highest interpretability of how the features interacted with the model. The results of the simple artificial neural networks are not robust, making these models less reliable.We explore the addition of more features related to the urban environment such as building height, sky view factor and variables related to radiation. Finally, we investigate how to prevent possible overfitting due to insufficient variation in the land cover in the training data by including other data sources.
Covaci, A, Vergauwen, T, Top, S, Caluwaerts, S & De Cruz, L 2023, 'Machine learning-based emulation of land cover effects at sub-hectometric scale using crowd-sourced weather observations', EGU General Assembly 2023, Vienna, Austria, 23/04/23 - 28/04/23. https://doi.org/10.5194/egusphere-egu23-17349
Covaci, A., Vergauwen, T., Top, S., Caluwaerts, S., & De Cruz, L. (2023). Machine learning-based emulation of land cover effects at sub-hectometric scale using crowd-sourced weather observations. Abstract from EGU General Assembly 2023, Vienna, Austria. https://doi.org/10.5194/egusphere-egu23-17349
@conference{c2b210c033de40c0bad575371665245d,
title = "Machine learning-based emulation of land cover effects at sub-hectometric scale using crowd-sourced weather observations",
abstract = "Traditional weather stations monitor the weather above short grass, which is a standardized environment. Such an environment is far from representative of where most people live. Moreover, despite advances in urban climate modelling, even state-of-the-art weather forecasts and climate scenarios do not account for the hyperlocal influence of land cover on meteorological variables.To bridge this gap, we have constructed several machine learning models to translate 2-meter temperature measurements from standardized to different rural and urban environments. The input features of these models are the land cover fractions: impervious, green and water around a target station, and the interpolated open-field 2-meter temperature and wind values at the target location. The target feature for these models is the temperature data from the Flemish crowd-sourced VLINDER-network, which consists of calibrated stations positioned in unconventional locations. These models were trained on data from a limited set of VLINDER-stations and evaluated on unseen data of previously used and unused VLINDER-stations. We found that a random forest model yields the best results and had the highest interpretability of how the features interacted with the model. The results of the simple artificial neural networks are not robust, making these models less reliable.We explore the addition of more features related to the urban environment such as building height, sky view factor and variables related to radiation. Finally, we investigate how to prevent possible overfitting due to insufficient variation in the land cover in the training data by including other data sources.",
keywords = "urban heat island, machine learning, citizen science, land cover",
author = "Andrei Covaci and Thomas Vergauwen and Sara Top and Steven Caluwaerts and {De Cruz}, Lesley",
year = "2023",
month = apr,
day = "30",
doi = "10.5194/egusphere-egu23-17349",
language = "English",
note = "EGU General Assembly 2023, EGU23 ; Conference date: 23-04-2023 Through 28-04-2023",
url = "https://www.egu23.eu/",
}