Inferring Systemic Functional Language Models

dc.contributor.authorAlsadhan, Nasseren
dc.contributor.departmentComputingen
dc.contributor.supervisorSkillicorn, B. Daviden
dc.date2014-08-28 23:28:17.897
dc.date.accessioned2014-08-29T13:49:18Z
dc.date.available2014-08-29T13:49:18Z
dc.date.issued2014-08-29
dc.degree.grantorQueen's University at Kingstonen
dc.descriptionThesis (Master, Computing) -- Queen's University, 2014-08-28 23:28:17.897en
dc.description.abstractLanguage production in the brain is a complicated process that is not yet fully understood. The bag-of-words model, which considers the frequencies of each word in a document, is a useful approach in many text mining fields, but it does not provide any information about how language is produced. Systemic networks model language as a set of choices, where each choice operates in a particular context. Capturing patterns of choices used to create a particular document provides useful information about the authors and what they were feeling and thinking when they created the document. However, producing systemic networks manually is expensive. We define an automated way of producing systemic networks. Given a set of documents, we cluster words of interest into smaller groups, by using Non-Negative Matrix Factorization (NNMF). We create hierarchical clusters that we interpret as systemic networks. We validate the produced systemic networks in a number of ways; we use them in an authorship prediction problem and compare their results to that of the bag-of-words model, as well as how well they cluster the different choices made by the authors. We also generate random systemic networks and compare their performance with the produced systemic networks.en
dc.description.degreeM.Sc.en
dc.identifier.urihttp://hdl.handle.net/1974/12396
dc.language.isoengen
dc.relation.ispartofseriesCanadian thesesen
dc.subjectText Miningen
dc.titleInferring Systemic Functional Language Modelsen
dc.typethesisen

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Alsadhan_Nasser_N_201408_MSC.pdf
Size:
2.2 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.64 KB
Format:
Item-specific license agreed upon to submission
Description: