Classifying and Understanding Cancer Through microRNA-Based Deep Learning

dc.contributor.authorKaczmarek, Emilyen
dc.contributor.departmentComputingen
dc.contributor.supervisorMousavi, Parvin
dc.contributor.supervisorRenwick, Neil
dc.creator.stunr10138915en
dc.date.accessioned2021-10-01T15:42:05Z
dc.date.available2021-10-01T15:42:05Z
dc.degree.grantorQueen's University at Kingstonen
dc.description.abstractAccurate cancer classification is essential for improved mechanistic understanding, prognostication, and treatment selection. Advances in next-generation sequencing have allowed large molecular datasets to be acquired rapidly. These datasets can be used to increase our understanding of disease through careful extraction and interpretation of molecular information. microRNAs (miRNAs) are small RNA molecules that negatively regulate gene expression, and their dyresgulation is a common disease mechanism in many cancers. While a number of mRNA and protein biomarkers have been suggested for certain cancers, there has been little translation of miRNA biomarkers to a clinical setting. Through a clearer understanding of miRNA dysregulation in cancer, improved mechanistic knowledge and better treatments can be sought. However, there is currently a paucity of computational methods that can handle large next-generation sequencing data for improved cancer classification and understanding. In this work, we present three distinct deep learning architectures designed to classify cancer and extract meaningful information from miRNA profiles: a Deep Cancer Classifier (DCC), Deep Neural Map (DNM), and Graph Transformer Network (GTN). First, we use the DCC to classify neoplastic from non-neoplastic tissue from the same origin. The DCC is compared to machine learning and feature selection algorithms to demonstrate the advantages of miRNA-based deep learning. We also identify miRNA cancer biomarkers with distinct differences in neoplastic and non-neoplastic tissue through our feature selection algorithm (e.g., miR-375 in breast cancer). Next, this work is extended to unsupervised classification with the DNM, using only similarities between miRNA profiles to stratify cancer by neoplasticity status and tissue-of-origin. Activation gradients are implemented to determine miRNA biomarkers and interpret sample misclassifications, finding clusters of cancer subtypes and potential higher-grade tumours. Finally, the GTN is employed to analyze the targeting relationship between miRNA and messenger RNAs (mRNAs). The attention of the network determines both miRNA and mRNA biomarkers, as well as important targeting pathways in cancer. We show the importance of analyzing molecular data through deep learning by achieving high classification accuracy, and identifying both well-known and novel biomarkers.en
dc.description.degreeM.Sc.en
dc.embargo.liftdate2026-09-30T03:43:14Z
dc.embargo.termsI am currently in the process of putting together and submitting manuscripts based on this work, and would like the thesis restricted until these works have been published.en
dc.identifier.urihttp://hdl.handle.net/1974/29464
dc.language.isoengen
dc.relation.ispartofseriesCanadian thesesen
dc.rightsCC0 1.0 Universal*
dc.rightsCC0 1.0 Universal
dc.rights.urihttp://creativecommons.org/publicdomain/zero/1.0/*
dc.rights.urihttp://creativecommons.org/publicdomain/zero/1.0/
dc.subjectDeep Learningen
dc.subjectmicroRNAen
dc.subjectCancer Classificationen
dc.titleClassifying and Understanding Cancer Through microRNA-Based Deep Learningen
dc.typethesisen

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Kaczmarek_Emily_J_202109_MSC.pdf
Size:
10.06 MB
Format:
Adobe Portable Document Format
Description:
Thesis document

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.25 KB
Format:
Item-specific license agreed upon to submission
Description: