Beyond “latent thematic structure”:
Expanding the interpretation of the topic model towards pragmatics
DOI:
https://doi.org/10.17356/ieejsp.v10i4.1323Keywords:
Natural language processing, topic model, model interpretation, pragmaticsAbstract
According to the textbook definition, a topic model aims to uncover the underlying topics of a corpus. Despite its widespread use across disciplines, the nature of these ‘topics’ has remained relatively underdefined. This research note attempts to fill this gap, drawing on empirical evidence to elucidate the practical application of the model. We argue that the frequency of terms within texts is influenced not only by their theme but also by factors such as genre and context, thus extending the notion of ‘latent topics’ beyond referential-semantic boundaries to include pragmatic considerations. Through case studies focusing on different genres, such as parliamentary speeches and online forums, we demonstrate the importance of pragmatics, which is often overlooked in well-known early applications that deal predominantly with formal written texts such as newspaper articles or academic papers.
Downloads
Published
How to Cite
Issue
Section
License
Copyright Notice
Authors who publish with this journal agree to the following terms:
Authors retain copyright and grant the journal right of first publication, with the work three months after publication simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal. This acknowledgement is not automatic, it should be asked from the editors and can usually be obtained one year after its first publication in the journal.