Machine Learning Identifies Stemness Features Associated with Oncogenic Dedifferentiation
- PMID: 29625051
- PMCID: PMC5902191
- DOI: 10.1016/j.cell.2018.03.034
Machine Learning Identifies Stemness Features Associated with Oncogenic Dedifferentiation
Abstract
Cancer progression involves the gradual loss of a differentiated phenotype and acquisition of progenitor and stem-cell-like features. Here, we provide novel stemness indices for assessing the degree of oncogenic dedifferentiation. We used an innovative one-class logistic regression (OCLR) machine-learning algorithm to extract transcriptomic and epigenetic feature sets derived from non-transformed pluripotent stem cells and their differentiated progeny. Using OCLR, we were able to identify previously undiscovered biological mechanisms associated with the dedifferentiated oncogenic state. Analyses of the tumor microenvironment revealed unanticipated correlation of cancer stemness with immune checkpoint expression and infiltrating immune cells. We found that the dedifferentiated oncogenic phenotype was generally most prominent in metastatic tumors. Application of our stemness indices to single-cell data revealed patterns of intra-tumor molecular heterogeneity. Finally, the indices allowed for the identification of novel targets and possible targeted therapies aimed at tumor differentiation.
Keywords: The Cancer Genome Atlas; cancer stem cells; dedifferentiation; epigenomic; genomic; machine learning; pan-cancer; stemness.
Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Figures
References
-
- Agarwal Graepel S, Herbrich T, Har-Peled R, Roth S, Dan Generalization Bounds for the Area Under the ROC Curve. The Journal of Machine Learning Research 2005
-
- Benjamini Y, Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society Series B (Methodological) 1995;57:289–300.
Publication types
MeSH terms
Substances
Grants and funding
- U01 CA230690/CA/NCI NIH HHS/United States
- U24 CA143882/CA/NCI NIH HHS/United States
- R01 AA023146/AA/NIAAA NIH HHS/United States
- R01 EB020527/EB/NIBIB NIH HHS/United States
- U24 CA143866/CA/NCI NIH HHS/United States
- U54 HG003273/HG/NHGRI NIH HHS/United States
- U24 CA144025/CA/NCI NIH HHS/United States
- U24 CA143843/CA/NCI NIH HHS/United States
- U24 CA210974/CA/NCI NIH HHS/United States
- U24 CA143848/CA/NCI NIH HHS/United States
- R01 CA180778/CA/NCI NIH HHS/United States
- U24 CA210949/CA/NCI NIH HHS/United States
- R01 CA163722/CA/NCI NIH HHS/United States
- U24 CA143867/CA/NCI NIH HHS/United States
- R50 CA221675/CA/NCI NIH HHS/United States
- U24 CA210990/CA/NCI NIH HHS/United States
- P30 ES010126/ES/NIEHS NIH HHS/United States
- P30 CA016672/CA/NCI NIH HHS/United States
- U54 HG003067/HG/NHGRI NIH HHS/United States
- U54 HG006097/HG/NHGRI NIH HHS/United States
- U24 CA143835/CA/NCI NIH HHS/United States
- R01 GM109031/GM/NIGMS NIH HHS/United States
- P30 CA016086/CA/NCI NIH HHS/United States
- U24 CA210950/CA/NCI NIH HHS/United States
- U24 CA143845/CA/NCI NIH HHS/United States
- U24 CA143799/CA/NCI NIH HHS/United States
- U01 DE025188/DE/NIDCR NIH HHS/United States
- U24 CA143840/CA/NCI NIH HHS/United States
- U24 CA143858/CA/NCI NIH HHS/United States
- R01 CA236591/CA/NCI NIH HHS/United States
- U24 CA210957/CA/NCI NIH HHS/United States
- I01 BX003732/BX/BLRD VA/United States
- U54 HG003079/HG/NHGRI NIH HHS/United States
- U24 CA210988/CA/NCI NIH HHS/United States
- U24 CA143883/CA/NCI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources
