• Tachanka [comrade/them]@hexbear.net
    link
    fedilink
    English
    arrow-up
    20
    ·
    edit-2
    1 year ago

    If we get to the point where this kind of stuff can process accurately, in real time, on a mobile device, we will have destroyed the need for real time speech translators (sorry, translators), and, more importantly, most of the language barriers between the international working class, no? marx-goth Cautiously optimistic.

    • Kaplya@hexbear.net
      link
      fedilink
      English
      arrow-up
      19
      ·
      edit-2
      1 year ago

      I already wrote this below, but Yandex Browser already does this! It only translates to Russian, and with live streams (on Youtube for example) you get a ~15 seconds delay.

      It’s basically as a real-time transcription -> translation -> voice generation pipeline so the accuracy is as good as the transcript it manages to extract from.

    • Timberknave [none/use name]@hexbear.net
      link
      fedilink
      English
      arrow-up
      17
      ·
      1 year ago

      I am not that scared for my job in the next 10 years. As long as people don’t trust Bazinga Translate from the Torment Nexus Company to translate the doctor who goes through the Operation details with them, I am safe. Remember many countries want a stamp from a sworn translator for legal documents. Not to mention how shit even AI translation is still for Arabic (if it can recognize the letters to begin with, lmao). I can pre-translate a text and then go through it again, sometimes it even saves me time.

      • SerLava [he/him]@hexbear.net
        link
        fedilink
        English
        arrow-up
        12
        ·
        1 year ago

        Yes I think this technology will be great for casually listening to shit, but won’t get in the way of serious translation that matters.

    • space_comrade [he/him]@hexbear.net
      link
      fedilink
      English
      arrow-up
      5
      ·
      1 year ago

      Eh I dunno speech to text algorithms are still kinda garbage. They work well on simple sentences but once you throw in colloquialisms and abbreviations it just craps out.