PSI - Issue 73

Lenganji Simwanda et al. / Procedia Structural Integrity 73 (2025) 138–145 Author name / Structural Integrity Procedia 00 (2025) 000–000

140

3

Table 1. Key features from the Oslo climate dataset (hourly resolution) used for model training. Feature Unit Source

minimum 25%

50% fractile

75% fractile

maximum

fractile

10 m Wind Speed

m/s

ERA5

0

1.6

2.4

3.4

10.3

10 m Wind Direction

degrees

ERA5

0

83

185

259

359

10min Wind Speed at 10m height 2 m Relative Humidity

m/s

ERA5

0

2.8

4.4

6.2

16.8

%

ERA5 ERA5 ERA5

0.14 -27.2

0.68

0.84

0.92 12.6 11.1

1

2 m Temperature

°C °C

0

5.7 4.8

32.1 18.1

Soil Temperature Level 3 (at 0.3 m depth) Soil Temperature Level 4

-3.2

0.6

°C

ERA5

0.1

1.7

5.3

9.9

13.9

Surface Pressure

Pa

ERA5 ERA5

92000

97400

98200

99000

102000

Surface Solar Radiation

W/m²

0

0

4

154

841

Surface Thermal Radiation

W/m²

ERA5

146

256

296

327

417

Total Cloud Cover

(%)

ERA5

0

0.32

0.91

1

1

Total Precipitation

mm

ERA5 ERA5

0 0

0 0

0 0

0

7.9

Direct Solar Radiation

W/m²

54

749

Snow Depth (m)

mm

ERA5

0

0

0

11.3

401.5

Solar Azimuth Angle

radians

Derived from ISO52010 Derived from ERA5

-3.14

-1.51

0

1.65

3.14

Wind Power Capacity Factor (100 m) Diffuse Horizontal Irradiance

0 -1(fraction)

0

0

0.04

0.15

1

W/m²

ERA5

0

0

4

0.8

392

Ground Snow Load

mm SWE (converted to kN/m²)

SIMELT model

0

0

0

2.3

9.5

Roof Snow Load (U = 1.0, no sliding)

mm SWE (converted to kN/m²)

SIMELT model

0

0

0

0.1

2.7

• ERA5 data from Copernicus Climate Change Service were used as input (Hersbach et al., 2020). SIMELT is a snow accumulation/melt model used to estimate the roof snow load.

3. Machine Learning Implementation 3.1. Models selected and feature engineering

We used four ensemble machine learning models—RF, GBM, XGBoost, and CatBoost—to predict roof snow load from meteorological features. All are tree-based and well-suited for structured regression tasks with nonlinear relationships. Recent studies (Madirisha, Simwanda, and Mtei, 2025; Simwanda et al., 2024) have shown these models excel other models in terms of performance. RF reduces variance via bagging, while GBM builds trees sequentially to minimize prediction error. XGBoost extends gradient boosting with optimized algorithms for accuracy and efficiency, and CatBoost handles ordinal features effectively through the use of symmetric trees, offering greater stability (Chen and Guestrin, 2016). Models were implemented in Python using scikit-learn and official APIs. All models were initially trained on the full set of features from Table 1 (excluding the target – roof snow load). Key predictors included current weather and ERA5 ground snow depth, which provided useful memory of past accumulation and melt. Instead of explicitly engineering features like recent snowfall or melt hours, we relied on the

Made with FlippingBook - Online Brochure Maker