PSI - Issue 75
Target parameter (specimen level)
Target parameter (specimen level)
0,35
(a)
(b)
100000000
Jan Schubnell et al. / Procedia Structural Integrity 75 (2025) 94–101 Schubnell / Structural Integrity Procedia (2025)
100
7 Random Forrest ANN
Random Forrest ANN
0,3
10000000
0,25
0,2
1000000
2 [-]
0,15
RSME [-]
0,1
100000
0,05
0
10000
DS1
DS2
DS3
DS4
DS5
DS6
DS1
DS2
DS3
DS4
DS5
DS6
Fig. 4. (a) RSME and 2 score of the results from the investigations on specimen level (see Table 3) Data spaces
Data spaces
(b)
(a)
Specimen level
SN-curve level
Feature
Score 0.417 0.157 0.104 0.069 0.039 0.027 0.025 0.024 0.023 0.018 0.016
Feature
Score 0.512 0.228 0.042 0.039 0.033 0.027 0.021 0.019 0.018 0.014 0.011
6. Conclusions In this work different approaches compared regarding the prediction of fatigue life (number of cycles) of steels based on an extensive database by (Fliegener et. al., 2024) by ML. The fatigue life is evaluated based on estimated SN-curves and (SN-curve level) or directly predicted (specimen level), see Figure 1. The data spaces for the training of the ML approach were used and different ML approaches were applied. Following conclusions are made: • Higher 2 scores are achieved for the prediction of the SN-curve parameter and compared to the prediction of the fatigue life by the applied ML approaches. •
Stress amplitude Surface hardness
Stress concentrationfactor
Surface hardness
Stress concentrationfactor
Cr
Ni
Loadtype
Load type
P
Temperature
C
C
Roughness Rz
Roughness Rz
Ni
Cr
Mn
P
Si
Stress ratio R
Stress ratio R
Fig. 5. Feature importance score
However, the direct prediction of based on the given input parameter, see Figure 1 (b), seem more meaningful than predicting SN-curve parameter and and then the fatigue life from based of these parameters because in that case a much higher prediction accuracy is reached. • The feature importance score of the random forest approach using all data shows that SCF is the most important factor regarding prediction accuracy on SN-curve level and on specimen level it is the stress amplitude. • In the given database are data sets (SN-curves) that have a large scatter (different values of for the specimen on the same stress amplitude). At the same time no parameter is documented how this can be considered for further analysis (for example individual roughness, initial crack size, SCF … of the single specimen). This limits the applicability of the data and results of a low accuracy in the training of ML algorithm. • Averaging the fatigue life of single specimen at the same stress amplitude can decrease the RSME by ML based fatigue life prediction. Increasing the consistency of the data by decreasing the variation of materials, specimens, … had only a minor effect in this work. In general, this work shows the limitation of ML-based fatigue life prediction if the training data space is large and not always consistent. A lack of standardization in fatigue testing and often incomplete documentation of fatigue test results and the material conditions limit the applicability of older data for current used ML approaches. Indeed, this crucial factor needs to be improvement until open-source databases can be used for fatigue life prediction by ML.
Made with FlippingBook flipbook maker