1. Don’t have ChatGPT
  2. OCR needed
  3. Preferably Android

Thanks.

  • JoBo@feddit.uk
    link
    fedilink
    arrow-up
    25
    arrow-down
    1
    ·
    9 months ago

    It will be a great deal quicker just to read the damn thing.

  • starman@programming.dev
    link
    fedilink
    English
    arrow-up
    12
    ·
    edit-2
    9 months ago
    1. Download any OCR software from f-droid, or preferred store.
    2. Copy text.
    3. Run llama-gpt¹ if you want something self-hosted or any LLM² on huggingface chat if you want ready solution
    4. Paste text and write something like “summary:” below.

    ¹Theoretically possible on mobile, but for better performance, run it on PC.

    ²Default one should do the job.

    Disclaimer: I think that it should work, but I haven’t done anything like that before

    • Ziggurat@sh.itjust.works
      link
      fedilink
      arrow-up
      2
      ·
      9 months ago

      I have actually tried it, but from doc files on a PC and running python.

      My main issue is that the model doing it well need a commercial licence. I have the paygrade to experiment by myself on my work time, but not the one to spend company’s money for it. And IT just signed a contract to get GPT4 has part of bing chat pro

  • nottheengineer@feddit.de
    link
    fedilink
    arrow-up
    10
    arrow-down
    2
    ·
    9 months ago

    Android won’t be easy, but you can slap together a python script that runs tesseract or easyOCR and runs it through a pretrained LLM like T5. Those are well-known and well-documented, so chatGPT can probably write the script for you without too many hiccups.

  • Tarte@kbin.social
    link
    fedilink
    arrow-up
    4
    ·
    edit-2
    9 months ago

    What‘s the worth of AI generated summaries if they are not factually reliable? The new Google search result previews that are generated by AI (and I believe Google as a large company has more resources than most of us do) contain so many obvious factual errors (i.e. made-up names, wrong places, false dates) that I really doubt current generation AI is ready to be a reliable help in this use case.

    I, too, like the idea of not having to do all this work manually. But we’re not there yet.