We had some threads about languages recently. To be honest, it’s one of my favorite topics and I thought it was cool to see so many people talking about it.

So, I want to point out Lemmygrad communities I see about language for anyone interested:

Linguistics:

https://lemmygrad.ml/c/leftlang - somewhat active

https://lemmygrad.ml/c/linguistics - not active

Specific language learning:

https://lemmygrad.ml/c/learnchinese

September’s Korean study thread on c/korea

Translation:

https://lemmygrad.ml/c/translation


I’m not sure to what degree other people here are interested in seeing more language study topics be discussed around here but personally I would be very interested in more activity around this topic, such as helping each other study, producing translations together (I know that’s a big project), studying topics like language revitalization, language acquisition, etc.

Is anyone else here enthusiastic about this kind of topic?

  • @redtea@lemmygrad.ml
    link
    fedilink
    22 years ago

    No need to do this just for me, as I’ll try using calibre first.

    Just a thought, though…

    There might be quite a bit of interest in an app that could (maybe this already exists!):

    1. list all words in a text;
    2. arrange those words by frequency (increasing or decreasing);
    3. automate a translation using e.g. deepl or another translator; and
    4. export those words into Anki cards (with an option to ignore words taken from other lists of the most frequent 100, 200, 500, 1000, 3000, 5000 words, etc*).
    • IME trying to rote learn the most frequent 1-200 words is a bit pointless as these words are so frequent they seem to have the most meanings and that meaning is heavily reliant on context. Others may be happy to ignore other sets of the most frequent words if they’re already confident with them. Excluding the most frequent words in any particular text could also work, but the most frequent e.g. 1000 words in that single text might actually span the 2-5000 most frequent words of the written language in general. So auto-excluding the n# most frequent words of one book might delete quite a few unknown words.

    It looks like I’ll be able to do 1 and 2 with calibre and it looks like 3 might be done easily by saving to .xml and opting the file in Google’s spreadsheet software. And I never really got on with Anki, but I know it’s popular.

    Thanks again for the help.

    • @holdengreen@lemmygrad.ml
      link
      fedilink
      22 years ago

      Yes I should be able to do all of that when I’m not out busy. I write on my Manjaro laptop, but the next step is putting the project in a Docker/OCI container which shouldn’t be that hard to run on Windows or whatever.

      Mobile app is actually a bit diffult because I’d need to somehow package the entire Python env with interpreter and libraries. Trying to package ML models would be difficult too (requiring multiple GB’s of storage) unless cloud hosting was an option which it probably isn’t.

      But that’s what I want to do because it is optimal for on-the-go and for people actually being likely/able to use it. Or the lessons and materials can be statically generated from the container which is the plan initially but limits rich interactivity which may be a good goal.