[Digitalarchivists] Update on book scanning

newmy51 at gmail.com newmy51 at gmail.com
Wed May 31 14:39:55 UTC 2017


Super cool!  Would love to see some photos or screenshots.  Any of this
excellent progress added to the wiki?

Best from Syracuse,

-Danny

On May 31, 2017 7:08 AM, "kprichard" <kprichard at gmail.com> wrote:

> Tonight I finished rebuilding the dorkroom mac mini by reinstalling macOS
> Sierra. Previously I replaced the crashed HD with a donated SSD. Specs are:
> 8GB RAM, Core2Duo 2.4 GHz, 128GB SSD.  It boots quickly and is faster
> overall.  I renamed it to 'BookScannerMacMini'.
>
> Since my last emails I have continued looking for image-to-pdf softwares,
> and recently found another one which looks promising: PDFScanner (macOS)
>
> I put it through the same test as ABBYY FineReader Pro, writing up a
> report and producing a PDF (linked on the wiki)-
>
> https://noisebridge.net/wiki/30_May_2017:_Test_a_copy_of_PDFScanner
>
> Results are acceptable. Not nearly so accurate as ABBYY FineReader, but
> substantially better than Tesseract from cli.  Sorry there are no exact
> quantitative results, just my sense from having looked at this problem for
> more than five minutes.
>
> Cost is $16, which I've spent.  Appears to be faster than FineReader.
>
> Next steps:
> - Hooking the mini up to the twin Canons and getting scan.py working again
> - Add a post-process pipeline with as filesystem watcher, and a script to
> pump the image files thru imagemagick or GIMP: autocrop, align, deskew,
> autolevels, contrast
> - Run some books through and get PDFs
>
> PDFScanner is as close to user-friendly as anything I've seen, certainly
> more so than ABBYY FineReader.  A set of files can be drag-dropped onto it,
> and it automatically starts OCRing them.  If they're all oriented and
> cropped ahead of time, then the only remaining step is to press Cmd-S to
> export as PDF.
>
> We are getting close to having a fully functional book scanner.
>
>
> _______________________________________________
> Digitalarchivists mailing list
> Digitalarchivists at lists.noisebridge.net
> http://www.noisebridge.net/mailman/listinfo/digitalarchivists
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.noisebridge.net/pipermail/digitalarchivists/attachments/20170531/25b795b4/attachment-0003.html>


More information about the Digitalarchivists mailing list