Frequency of various conjugated forms: persons and numbers
According to Google Ngram,
https://books.google.com/ngrams/graph...
the frequencies of these 2-word sentences
I go,you go,he goes,she goes,we go,they go
at least since 2000 till 2019, are in this order (>> here means "much greater" and ~ "approximately equal"):
"you go" >> "I go" > "we go" >> "they go" > "it goes" > "he goes" > "she goes"
It's not surprising "you go" is high; it merges singular and plural second person forms. But "it goes", "he goes" and "she goes" are so low that even when they are combined the frequency is still lower than that of "we go". That surprises me.
Let's check "do" in place of "go":
https://books.google.com/ngrams/graph...
The result is about the same, except for the first-person singular "I do", which is very high presumably due to the wedding vow.
Let's check the various forms of French "aller" ('to go'), including "on va":
https://books.google.com/ngrams/graph...
The result is:
"je vais" >> "tu vas" ˜ "nous allons" ~ "il va" ~ "on va" ~ "vous allez" > "elle va" ~ "ils vont".
It's interesting to see "je vais" with a much higher frequency. And if we add up "il va" and "elle va", the combined frequency will be higher than "tu vas", but still much lower than that of "je vais".
Lastly, let's check Spanish "ir" ('to go'):
https://books.google.com/ngrams/graph...
Probably due to a problem in data availability, I have to choose the time range of 1990-2007 instead of 1800-2019 or 2000-2019. Since Spanish is a pro-drop language, I omitted the pronoun so the input is a little simpler. The result is:
"va" >> "van" >> "voy" ~ "vamos" > "vas" > "vais".
I don't know why the third persons are very much more frequent, especially the singular. In any case, this is quite different from English and French, where second-person (singular and plural combined) and first-person singular, respectively are by far the most frequent.
Knowing these frequencies is helpful for foreign language education, maybe not so much in English, but in most other languages. For example, we all know we should not strive for memorizing all tenses and conjugated forms with equal effort at the beginning of the study. Prioritizing based on frequencies has practical implications. While tenses, moods and voices are intuitively ranked in frequency, persons and numbers are less so. With the above Google ngram result of frequencies for different persons and numbers, textbook and dictionary writers may choose to give more example sentences in the more frequent persons and numbers to optimize learners' language study.
The above work is relevant to my books. I started to add example sentences to my Learning French Words book, only for the first few hundred common words, or the words that are understood better with examples. No doubt I'll avoid uncommon tenses in these sentences and only use present indicative and occasionally imperfect tenses. As to persons and numbers, however, so far I have opted to give examples using third-person singular more than any other, because that's been my impression of the most frequent one. Not so, says the Google Ngram, in spite of its known defects. Anyway, I may change to first-person singular more often. And for Spanish, the third-person singular is indeed the most frequent. I haven't had time to update that Spanish book with example sentences yet, but I will. That's one of my projects for 2021.
___________________________________
* This message was originally posted at
https://www.facebook.com/groups/gener...
https://forum.language-learners.org/v...
[2021-07-24 Update]
The example sentences I gave to the headword mentir ("to tell a lie") in my Learning French Words book are "je mens / tu mens / il ment (“I’m / you’re / he’s lying”); j’ai / il as / tu a menti (“I / he / you lied”)". Only two tenses are given because they're the most common, present and present perfect, and within each tense I chose the most common conjugations according to Googel Ngrams.
https://books.google.com/ngrams/graph...
the frequencies of these 2-word sentences
I go,you go,he goes,she goes,we go,they go
at least since 2000 till 2019, are in this order (>> here means "much greater" and ~ "approximately equal"):
"you go" >> "I go" > "we go" >> "they go" > "it goes" > "he goes" > "she goes"
It's not surprising "you go" is high; it merges singular and plural second person forms. But "it goes", "he goes" and "she goes" are so low that even when they are combined the frequency is still lower than that of "we go". That surprises me.
Let's check "do" in place of "go":
https://books.google.com/ngrams/graph...
The result is about the same, except for the first-person singular "I do", which is very high presumably due to the wedding vow.
Let's check the various forms of French "aller" ('to go'), including "on va":
https://books.google.com/ngrams/graph...
The result is:
"je vais" >> "tu vas" ˜ "nous allons" ~ "il va" ~ "on va" ~ "vous allez" > "elle va" ~ "ils vont".
It's interesting to see "je vais" with a much higher frequency. And if we add up "il va" and "elle va", the combined frequency will be higher than "tu vas", but still much lower than that of "je vais".
Lastly, let's check Spanish "ir" ('to go'):
https://books.google.com/ngrams/graph...
Probably due to a problem in data availability, I have to choose the time range of 1990-2007 instead of 1800-2019 or 2000-2019. Since Spanish is a pro-drop language, I omitted the pronoun so the input is a little simpler. The result is:
"va" >> "van" >> "voy" ~ "vamos" > "vas" > "vais".
I don't know why the third persons are very much more frequent, especially the singular. In any case, this is quite different from English and French, where second-person (singular and plural combined) and first-person singular, respectively are by far the most frequent.
Knowing these frequencies is helpful for foreign language education, maybe not so much in English, but in most other languages. For example, we all know we should not strive for memorizing all tenses and conjugated forms with equal effort at the beginning of the study. Prioritizing based on frequencies has practical implications. While tenses, moods and voices are intuitively ranked in frequency, persons and numbers are less so. With the above Google ngram result of frequencies for different persons and numbers, textbook and dictionary writers may choose to give more example sentences in the more frequent persons and numbers to optimize learners' language study.
The above work is relevant to my books. I started to add example sentences to my Learning French Words book, only for the first few hundred common words, or the words that are understood better with examples. No doubt I'll avoid uncommon tenses in these sentences and only use present indicative and occasionally imperfect tenses. As to persons and numbers, however, so far I have opted to give examples using third-person singular more than any other, because that's been my impression of the most frequent one. Not so, says the Google Ngram, in spite of its known defects. Anyway, I may change to first-person singular more often. And for Spanish, the third-person singular is indeed the most frequent. I haven't had time to update that Spanish book with example sentences yet, but I will. That's one of my projects for 2021.
___________________________________
* This message was originally posted at
https://www.facebook.com/groups/gener...
https://forum.language-learners.org/v...
[2021-07-24 Update]
The example sentences I gave to the headword mentir ("to tell a lie") in my Learning French Words book are "je mens / tu mens / il ment (“I’m / you’re / he’s lying”); j’ai / il as / tu a menti (“I / he / you lied”)". Only two tenses are given because they're the most common, present and present perfect, and within each tense I chose the most common conjugations according to Googel Ngrams.
Published on January 02, 2021 18:16
No comments have been added yet.
Learning Spanish, French, and Italian Words Through Etymology and Mnemonics
(1) Small corrections and updates to the published book, "Learning Spanish Words Through Etymology and Mnemonics"
(2) Miscellaneous notes about the unpublished books, Learning French / Italian Words Th (1) Small corrections and updates to the published book, "Learning Spanish Words Through Etymology and Mnemonics"
(2) Miscellaneous notes about the unpublished books, Learning French / Italian Words Through Etymology and Mnemonics
(3) Other language related notes or ideas ...more
(2) Miscellaneous notes about the unpublished books, Learning French / Italian Words Th (1) Small corrections and updates to the published book, "Learning Spanish Words Through Etymology and Mnemonics"
(2) Miscellaneous notes about the unpublished books, Learning French / Italian Words Through Etymology and Mnemonics
(3) Other language related notes or ideas ...more
- Yong Huang's profile
- 9 followers
