I found this here and have verified the accuracy by copy-pasting into google translate myself.

My question is, is this discrepancy due directly to an intentional decision to translate differently, or is it because google translate has been trained on news articles that have been manually translated for English-speaking audiences?

(To be clear, both paragraphs should involve one person kicking another in the nuts, unless I’m missing something.)

  • Rojo27 [he/him]@hexbear.net
    link
    fedilink
    English
    arrow-up
    17
    ·
    10 months ago

    Yeah that’s weird. There’s no change in the wording, aside from who is kicking who, so it’s not like it should be switching up the translation.

    • nightshade [they/them]@hexbear.net
      link
      fedilink
      English
      arrow-up
      10
      ·
      10 months ago

      Machine translators make heavy use of machine learning/LLMs on the back end. This is necessary to an extent since the same phrase can have different meanings depending on context, but it also means that the usual biases from machine learning can crop up easily. The most famous example is that if you translated something like “I waved to the doctor” and “I waved to the nurse” to Spanish, it used to give the masculine form/pronouns for the first sentence and the feminine form/pronouns for the second sentence, even though there is no indication of gender in the English version. So there’s a good chance that the context of who is kicking who can cause Google Translate to interpret the same phrase differently due to this bias.

      • HakFoo@lemmy.sdf.org
        link
        fedilink
        English
        arrow-up
        2
        ·
        10 months ago

        It might also be trying to translate on a phrase or sentence level and applying statistics. The two sentences might occur in more contexts (in their statistical model) where they one gets translated literally and the other idiomatically.

  • HumanBehaviorByBjork [any, undecided]@hexbear.net
    link
    fedilink
    English
    arrow-up
    2
    ·
    edit-2
    10 months ago

    99% of the time when one of these black box algorithms produces a weird, inconsistent result, it’s because it’s badly designed, not because of any single conscious decision someone made to obscure a particular idea. Google Translate was an early application of LLMs, and to bust one’s balls is a well known expression that you could translate as “give one a hard time.”

  • RedDawn [he/him]@hexbear.net
    link
    fedilink
    English
    arrow-up
    2
    ·
    edit-2
    10 months ago

    Also it should be “the curtain rises”. “Sube” is the simple present third person conjugation of subir, as well as the imperative second person command, so it could mean either, but the translator chose the wrong one based on context.