* Note: Word frequencies are from word counts in the full text of the Lancaster Corpus of Mandarin Chinese, segmented using the words from CC-CEDICT