June 16, 2019
On the Idiosyncrasies of the Mandarin Chinese Classifier System
North American Chapter of the Association for Computational Linguistics (NAACL)
While idiosyncrasies of the Chinese classifier system have been a richly studied topic among linguists (Adams and Conklin, 1973; Erbaugh, 1986; Lakoff, 1986), not much work has been done to quantify them with statistical methods. In this paper, we introduce an information-theoretic approach to measuring idiosyncrasy; we examine how much the uncertainty in Mandarin Chinese classifiers can be reduced by knowing semantic information about the nouns that the classifiers modify.
By: Shijia Liu, Hongyuan Mei, Adina Williams, Ryan Cotterell
Facebook AI Research
Natural Language Processing & Speech