fossilesque@mander.xyzM to Science Memes@mander.xyzEnglish · 1 day agoPublishers Always Innovatingmander.xyzimagemessage-square33fedilinkarrow-up1597arrow-down13
arrow-up1594arrow-down1imagePublishers Always Innovatingmander.xyzfossilesque@mander.xyzM to Science Memes@mander.xyzEnglish · 1 day agomessage-square33fedilink
minus-squarekeepthepace@slrpnk.netlinkfedilinkEnglisharrow-up2·5 hours agoYes, PDFs are much more permissive and may not have any semantic information at all. Hell, some old publications are just scanned images! PDF -> semantic seems to be a hard problem that basically requires OCR, like these people are doing
minus-squareJackbyDev@programming.devlinkfedilinkEnglisharrow-up1·1 hour agoOh nice, thanks for sharing that project. I haven’t heard of it before!
Yes, PDFs are much more permissive and may not have any semantic information at all. Hell, some old publications are just scanned images!
PDF -> semantic seems to be a hard problem that basically requires OCR, like these people are doing
Oh nice, thanks for sharing that project. I haven’t heard of it before!