Extended Abstract

Topical advection as a baseline model for corpus-based lexical dynamics

Authors: , , ,

Abstract

An important question in the field of corpus-based evolutionary language dynamics research is concerned with distinguishing selection-driven linguistic change from neutral evolution, and from changes stemming from language-external factors (cultural drift). A commonly used proxy for the popularity or selective fitness of an element is its corpus frequency. However, a number of recent works have pointed out that raw frequencies can often be misleading. We propose a model for controlling for drift in contextual topics in corpora - the topical-cultural advection model - and demonstrate that this simple measure is capable of accounting for a considerable amount of variability in word frequency changes in a corpus spanning two centuries of language use.

Keywords: language dynamics, language evolution, topical advection, corpora

How to Cite: Karjus, A. , Blythe, R. A. , Kirby, S. & Smith, K. (2018) “Topical advection as a baseline model for corpus-based lexical dynamics”, Society for Computation in Linguistics. 1(1). doi: https://doi.org/10.7275/R5RR1WFX