The metadata sorting is finally
done! You have no idea how happy I am to have that part off my plate.
So what's next? Well, for the next week or so you won't see anything here, as I encourage beta taggers to finish up outstanding tabs (and deal with a few tasks of my own that I've been putting off). There are also several main categories that haven't had a thorough testing by beta taggers (Recreation & Sports, Hobbies & Crafts, Health & Wellness, and Schools & Education), so if you've been waiting to help, here's your invitation to pick one of those and try tagging a tab of it.
Once the main categories have been tested, then I'll post a message that you can link to, so you can invite people you know who might be interested in doing this. The more volunteers who help, the sooner this will be done and can be uploaded to the Internet Archive.
The three key things we need in volunteers (besides the time to complete the task) are:
- carefulness / attention to detail
- a willingness to ask questions when in any doubt
and
- the ability to use Google Sheets / Docs at a basic level.
Specialty knowledge in an area (including being able to read another language) is a bonus but not necessary. Discord is the easiest way to connect with us but Dreamwidth PMs, Google Chat, or even IRC can work instead if need be.
Actual stats:Now up to 2.89% tagged.
Available tabs (sorted by descending numbers by language):
English: 3264
Unknown: 544
Spanish: 389
Portuguese: 336
French: 146
Indonesian/Malay: 131
Italian: 89
German: 77
Turkish: 64
Chinese: 59
Arabic: 54
Romanian: 36
Spam: 32
Persian: 16
Dutch: 12
Filipino: 12
Swedish: 8
Hungarian: 7
Polish: 7
Vietnamese: 6
Bosnian: 3
Finnish: 3
Catalan: 2
Danish: 2
Esperanto: 2
Lithuanian: 2
Norwegian: 2
Russian: 2
Besides the list of available tabs above, we have one tab each (often
well under 100 groups - many don't even have 10!) of the following languages:
African: Afrikaans, Chichewa, Hausa, Kinyarwanda, Malagasy, Somali, Swahili, Yoruba
Asian: Acehnese, Armenian, Azerbaijani, Batak Toba, Bengali, Georgian, Gujarati, Hebrew, Hindi/Urdu, Javanese, Kannada, Kapampangan, Kazakh, Korean, Kurdish, Malayalam, Marathi, Mongolian, Sundanese, Tamil, Telugu, Tetum, Thai, Turkmen, Uzbek, Uyghur
European: Albanian, Basque, Breton, Croatian, Czech, Estonian, Galician, Greek, Icelandic, Ido, Interlingua, Latin, Latvian, Maltese, Occitan, Slovak, Slovenian, Welsh
There are also special compilation tabs with one or two groups each of a variety of lesser-used languages, one for each of African, Asian, European, North American, Oceania, and South America. These tabs are very small, all being 20 groups or fewer.
Of special notice is a tab of around 200 groups from
this family of languages (Zo, Tedim, Hakha, Mara, etc.). If you know of
anyone who can read any of these, please put them in contact with us! Google Translate can only recognize and understand Mizo and the rest must be manually identified (a very difficult task for someone who doesn't speak or read any of them). Translating them is next to impossible. (Some don't even have dictionaries available online.)