The Classification and Regression Tree (CART) is a supervised learning approach useful used to segment the space of and the predictors/features space into smaller homogeneous regions that are, represented in a decision tree. It computes the selection of features automatically, in contrast to traditional statistic methods. In this review, we compared CART to traditional statistics in patients who did not attempt a diet to patients who followed a low-protein diet (LPD) or the Mediterranean diet in chronic kidney disease (CKD) patients, and we analyzed them using linear regression and CART methods. In our example, diet adherence proved to be the factor with the greatest impact on renal function decline, but CART failed to detect significant differences between LPD and Mediterranean diet. Similar results were found using traditional statics, but CART gave a model with the proportion of the explained outcome by the model (R2) higher by about 20%, thus a stronger model. In addition, CART allows quick and easy identification of the variables affecting the outcome with a simple visual representation through the decision tree, increasing the interpretability of the results. In summary, no difference in the impact of variables has been detected with the two methods, but CART gave us a more detailed model with a faster and easier interpretation.

Review and practical excursus on the comparison between traditional statics methods and Classification And Regression Tree (CART) in real-life data: Low protein diet compared to Mediterranean diet in patients with chronic kidney disease

Calabrese, Vincenzo
Primo
;
Metro, Daniela
Secondo
;
Alibrandi, Angela;Cernaro, Valeria;Maressa, Veronica;Longhitano, Elisa;Gembillo, Guido
Penultimo
;
Santoro, Domenico
Ultimo
2025-01-01

Abstract

The Classification and Regression Tree (CART) is a supervised learning approach useful used to segment the space of and the predictors/features space into smaller homogeneous regions that are, represented in a decision tree. It computes the selection of features automatically, in contrast to traditional statistic methods. In this review, we compared CART to traditional statistics in patients who did not attempt a diet to patients who followed a low-protein diet (LPD) or the Mediterranean diet in chronic kidney disease (CKD) patients, and we analyzed them using linear regression and CART methods. In our example, diet adherence proved to be the factor with the greatest impact on renal function decline, but CART failed to detect significant differences between LPD and Mediterranean diet. Similar results were found using traditional statics, but CART gave a model with the proportion of the explained outcome by the model (R2) higher by about 20%, thus a stronger model. In addition, CART allows quick and easy identification of the variables affecting the outcome with a simple visual representation through the decision tree, increasing the interpretability of the results. In summary, no difference in the impact of variables has been detected with the two methods, but CART gave us a more detailed model with a faster and easier interpretation.
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11570/3336373
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact