Esperanto

I was happy when the app Amikumu released statistics about their members. Amikumu is a social platform where users can create a profile and contact nearby users who speak the same language as themselves. It is aimed towards people learning uncommon languages. The app has become massively popular in the Esperanto community with 7,672 of the ~10,000 users indicating skills in Esperanto (and I am of course one of them!). The released statistics therefore gives us a unique insight into the Esperanto community. I will compare the languages using two measures

1. The number of learners.
2. The average level of the learners.

In the app, users indicates their speaking capabilities of a language using one of four categories; Beginner, Intermediate, Advanced and Fluent. I define a ‘Learner’ as a user choosing one of the first three categories. The average learner level, I calculate from the proportions of learners in those three categories(see the method section for more).

Amikumu has spread using the Esperanto community, so it is not surprising that Esperanto is a popular language in the app. The level of Esperanto speaking is not high compared to the English levels. It reflects how important English is, even for Esperanto speakers. I am also surprised by the number of learners of Scandinavian languages because there are not a lot of native speakers. It could be due to Scandinavians themselves learning other Scandinavian countries. The second-most popular constructed language is Toki Pona and not Lojban or Klingon, as I expected. However, besides Esperanto, the language capabilities within the learners of constructed languages are quite low.  What do you see in the plots? Please share in the comments!

### Methods

For, say English I calculated the number of learners as

$\displaystyle \textup{Learners}=\textup{Advanced}+\textup{Intermediate}+\textup{Beginner}=2566+1673+681$

using statistics from Amikumu. The average language level is calculated as

$\displaystyle \textup{Level}=\frac{c_1\cdot\textup{Advanced}+c_2\cdot\textup{Intermediate}+c_3\cdot\textup{Beginner}}{\textup{Learners}}$

It is not obvious how to choose the constants $c_1,c_2,c_3$. I ended up using the second principal component of the matrix

The coefficients are therefore $(c_1,c_2,c_3)=(0.71802147, -0.01576322,-0.69584244)$. The first principal component is close to a perfect average of the 3 categories. I chose this to subtly indicate that the two axes of the plot are almost independent.