I found this here and have verified the accuracy by copy-pasting into google translate myself.

My question is, is this discrepancy due directly to an intentional decision to translate differently, or is it because google translate has been trained on news articles that have been manually translated for English-speaking audiences?

(To be clear, both paragraphs should involve one person kicking another in the nuts, unless I’m missing something.)

  • nightshade [they/them]@hexbear.net
    link
    fedilink
    English
    arrow-up
    10
    ·
    10 months ago

    Machine translators make heavy use of machine learning/LLMs on the back end. This is necessary to an extent since the same phrase can have different meanings depending on context, but it also means that the usual biases from machine learning can crop up easily. The most famous example is that if you translated something like “I waved to the doctor” and “I waved to the nurse” to Spanish, it used to give the masculine form/pronouns for the first sentence and the feminine form/pronouns for the second sentence, even though there is no indication of gender in the English version. So there’s a good chance that the context of who is kicking who can cause Google Translate to interpret the same phrase differently due to this bias.

    • HakFoo@lemmy.sdf.org
      link
      fedilink
      English
      arrow-up
      2
      ·
      10 months ago

      It might also be trying to translate on a phrase or sentence level and applying statistics. The two sentences might occur in more contexts (in their statistical model) where they one gets translated literally and the other idiomatically.