Zipf’s law in Toki Pona
DOI:
https://doi.org/10.36505/ExLing-2020/11/0047/000462Keywords:
Zipf’s Law, Toki Pona, artificial languages, computational linguistics, statisticsAbstract
Zipf’s Law states that within a given text the frequency of any word is inversely proportional to its rank in the frequency table of the words used in that text. It is a statistical regularity of a power law that occurs ubiquitously in language – so far every language that has been tested was found to display the Zipfian distribution. Toki Pona is an experimental artificial language spoken by hundreds of users. It is extremely minimalistic – its vocabulary consists of mere 120 words. A comparative statistical analysis of two parallel texts in French and Toki Pona showed that even a language of such scarce vocabulary adheres to Zipf’s Law just like natural languages.
References
Jiang, B., Yin, J., Liu, Q. 2015. Zipf’s Law for All the Natural Cities around the World. International Journal of Geographical Information Science 29, 1–20.
Lang, S. 2014. Toki Pona: the language of good – the simple way of life. United States, Sonja Lang.
Reed, W.J., Hughes, B. D. 2002. From gene families and genera to incomes and internet file sizes: Why power laws are so common in nature. Physical Review E 66, 1–4.
Scholkmann, F. 2016. Power-Law Scaling of the Impact Crater Size-Frequency Distribution on Pluto. Progress in Physics 12(1), 26–29.
Smith, R. 2007. Investigation of Zipf-plot on the extinct Meroitic language. Glottometrics 15, 53–61.
Yu, S., Xu, C., Liu, H. 2018. Zipf’s law in 50 languages: its structural pattern, linguistic interpretation, and cognitive motivation. Semanticscholar.org.
Zipf, G.K. 1935. The Psycho-Biology of Language: An Introduction to Dynamic Philology. Cambridge, The MIT Press.
Zipf, G.K. 1949. Human Behavior and The Principle of Least Effort. Cambridge, Addison-Wesley Press.
Downloads
Published
Issue
Section
License
Articles are published under the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.