Skip to content
  • Robert Schubert's avatar
    add many more scripts, document recipes: · 1b6702d4
    Robert Schubert authored
    - improve tesserocr-batch: add multiprocessing
      and probability / confmat output
    - new script dta-txt2gt for DTA text-only GT
      (including line wrapping with hyphenation)
    - new script ocrd-gt2pkl to reduce METS/PAGE
      workspaces into plaintext pickle dumps
      (including alignment)
    - new script prob2pkl to combine OCR results
      for plaintext string and probabilities into
      pickle dump format
    - new script confmat2pkl to combine OCR results
      for plaintet string with alternatives and
      probabilities into pickle dump format
    - add proper module installation
    1b6702d4