PSI - Issue 24
Giovanni Zonfrillo et al. / Procedia Structural Integrity 24 (2019) 470–482 G. Zonfrillo and M.S. Gulino / Structural Integrity Procedia 00 (2019) 000–000
479
10
Table 3. Number of materials on which the remarkable models of Table 3 are based and related F 1 scores for each behaviour category. Features Softening Hardening Mixed Stable Total
Number of materials
72
59
5
7
143
f 1 = ln( f ) f 2 = ln( α ) f 3 = ln( σ y ) f 1 = ln(K) f 2 = ln(n) f 3 = ln( σ u )
F 1 score
93.1% 93.1%
84.7% 84.7%
0.0% 14.3% 82.5% 0.0% 14.3% 82.5%
Correct predictions Number of materials Correct predictions Number of materials Correct predictions Number of materials Correct predictions Number of materials F 1 score F 1 score F 1 score
98
99
33
8
238
91.8% 91.8%
82.8% 82.8%
43.1% 0.0% 78.2% 42.4% 0.0% 78.2%
72
59
5
7
143
f 1 = f f 2 = K f 3 = n
94.4% 94.4%
85.5% 84.7%
0.0% 0.0% 82.5% 0.0% 0.0% 82.5%
98
99
33
8
238
f 1 = σ u f 2 = σ u / σ y
90.3% 89.8%
86.9% 86.9%
0.0% 0.0% 73.1% 0.0% 0.0% 73.1%
99
102
33
8
242
f 1 = ln(K) f 2 = ln(n)
F 1 score
88.8% 87.9%
80.8% 80.4%
36.9% 0.0% 74.8% 36.4% 0.0% 74.8%
Correct predictions
where TP represents, for a specific category of material, the number of true positive predictions, FN the number of false negative predictions and FP the number of false positive predictions; these individual concepts are generally summarised in the so-called “confusion matrix”. In the specific problem, S represents the ratio between the frequency of threshold exceeding which leasd to correct classification of the material and the total number of materials with such behaviour within the sample; R instead represents the ratio between the frequency of threshold exceeding which leads to correct classification of the material and the number of times such behaviour is predicted. Finally, the so-called F 1 score (representing the harmonic mean of S and R ) allows assessing the actual predictive ability for the specific combination of model and threshold considered. Table 3 shows, for the models highlighted in Table 2, the number of materials on which these models are based and the associated F 1 scores for each material category. First, models have been excluded for which the combination of features is not available for at least 100 materials. The highest goodness-of-fit results from an imposed value of T equal to 0.4, for all derived logit models; as shown in Table 3, considering logarithmic values of the features often leads to better predictive abilities. Starting from F 1 scores, an inappropriateness of all the derived models in predicting both stable and mixed behaviour is also highlighted. Based on the data reported in Table 3, the model for which f , K and n are the considered features can be pointed out as the best one in terms of softening behaviour prediction (94% correctly predicted). Although this model can be extremely useful if applied during the design phase, it is clear that considering di ff erent features as ln(K) , ln(n) and ln( σ u ) results in a model with comparable goodness-of-fit, but more robust as a consequence of the higher number of materials used for the calculation (143 against 238). The predictive ability on the entire sample for this latter model is lower, also due to a lower predictive ability in terms of hardening behaviour. To highlight the predictive abilities of both models, Fig. 7 shows the distribution of the sample materials based on the values of f , K , n (Fig.7a) and ln(K) , ln(n) , ln( σ u ) (Fig. 7b), as a function of the materials’ behaviour; green-coloured points represent correct predictions of material behaviour, red-coloured points represent incorrect predictions. Considering the whole set of materials, the aforementioned combination of ln(K) , ln(n) , ln( σ u ) turns out to be the absolute best, if features are excluded whose data are not available for almost all materials collected in the database. Other combinations of three features in which at least one of the two “plastic” parameters of the Ramberg-Osgood re lation ( K , n ) is present perform similarly, as well as the combination of K and n which correctly predicts the behaviour of 75% of the sample. Nevertheless, the values of K and n are often absent in databases collecting material properties, available from web sources or technical literature. Excluding these parameters, the best performances are exhibited by combinations between σ u and σ y and in particular by those based on σ u and σ u / σ y . A prediction based on this logit 4. Discussion
Made with FlippingBook - Online catalogs