Lung cancer encompasses multiple histological entities with substantial molecular heterogeneity that remain incompletely resolved at population scale. Here, we constructed a unified reference landscape of lung cancer by analyzing raw RNA sequencing data from 1,558 tumors spanning adenocarcinoma (n=753), squamous cell carcinoma (n=540), small cell lung cancer (n=150), and unclassified non-small cell lung cancer (n=80). Following batch correction, samples were embedded using PaCMAP to generate a continuous molecular atlas annotated with clinical and biological metadata. Rather than segregating strictly by histology, tumors organized along conserved transcriptional axes defined by tumor-intrinsic proliferative or metabolic programs and immune-infiltrated states. Consensus clustering resolved nine robust molecular clusters, including a female non-smoker-enriched adenocarcinoma subgroup, a neuroendocrine-like adenocarcinoma marked by ASCL1 activation, immune-associated regions, and bifurcation of both small cell and squamous carcinomas into biologically distinct states. Spatially-restricted expression of clinically actionable targets revealed state-specific vulnerabilities. Projection of patient tumors and patient-derived xenografts onto the atlas demonstrated preservation of transcriptional identity and enabled quantitative assessment of model fidelity. This unified framework redefines lung cancer as a structured continuum of transcriptional states with translational relevance.
来源出处
Beyond Histology: A Unified Transcriptomic Atlas Defines Lung Cancer Biologic…
https://www.biorxiv.org/content/10.64898/2026.03.16.712177v1?rss=1