Machine learning identifies exosome related gene signatures for early prediction of non-small cell lung cancer.
Non-small cell lung cancer (NSCLC) remains a major health challenge worldwide, mainly due to the lack of effective early diagnostic biomarkers. Exosome-related genes have recently emerged as potential diagnostic markers due to their roles in tumor progression and immune regulation. This study aimed to identify exosome-related gene signatures as early predictive biomarkers for NSCLC and evaluate their diagnostic and therapeutic significance. We integrated gene expression data from GEO and TCGA databases. Core ExoNSCLC-DEGs were identified using three machine learning methods to construct a NSCLC diagnostic model, and the model was validated using ROC curves, calibration curves, and DCA curves. In addition, immune infiltration analysis, drug enrichment, molecular docking analysis, and regulatory network analysis further explored the potential mechanism of action of ExoNSCLC-DEGs. qRT-PCR experiments verified the reliability of gene expression. We constructed a diagnostic model consisting of six core ExoNSCLC-DEGs (including GPM6A, HYAL1, S100A4, ROBO4, LRRK2, and HBA1). The diagnostic model showed excellent predictive performance in independent cohorts (AUC > 0.98). The calibration curve and DCA curve demonstrated the clinical applicability of the model. Immune infiltration analysis revealed the potential immune effects of some ExoNSCLC-DEGs genes, such as S100A4 and LRRK2, which may play a key role in tumor immune escape. Drug enrichment analysis predicted potential therapeutic compounds, especially sunitinib targeting LRRK2. The regulatory network further identified the key RNA-binding proteins and transcription factors that regulate these biomarkers. qRT-PCR experiments verified the reliability of the expression of ExoNSCLC-DEGs in bioinformatics analysis. The diagnostic model based on the six ExoNSCLC-DEGs has strong diagnostic performance and clinical applicability. In-depth research on ExoNSCLC-DEGs provides new insights into the pathogenesis of NSCLC and provides new directions for subsequent research.