Skip to content
Snippets Groups Projects

extract landscape tables from pdf

  • Clone with SSH
  • Clone with HTTPS
  • Embed
  • Share
    The snippet can be accessed without any authentication.
    Authored by Dirk Sarpe
    Edited
    pdf_landscapetable2text.sh 227 B
    #!/bin/bash
    pdftk <input.pdf> cat pagestartno-pageendnoeast anotherpagestartno-anotherpageendnoeast output <output_r90.pdf>
    # the step above you have to perform for each pdf
    for file in *r90.pdf; do
    pdftotext -layout $file
    done
    0% Loading or .
    You are about to add 0 people to the discussion. Proceed with caution.
    Please register or to comment