[dba-Tech] FW: PDF Extractor

Gustav Brock Gustav at cactus.dk
Thu Oct 11 12:08:12 CDT 2007


Hi Rocky

If you google on:
  pdf to txt
you'll get a bunch of alternatives. Also this code for .Net:

http://www.codeproject.com/cs/samples/pdf2text.asp 

Using the PDFBox to parse PDFs is fairly easy:

private static string parseUsingPDFBox(string filename)
{
    PDDocument doc = PDDocument.load(filename);
    PDFTextStripper stripper = new PDFTextStripper();
    return stripper.getText(doc);
}

/gustav

>>> Jim.Hale at fleetpride.com 11-10-2007 18:51 >>>

Monarch Pro is good for heavy lifting (ie large PDFs) and has a VBA
class so it can be run from inside Access.
Jim Hale 

-----Original Message-----
From: dba-tech-bounces at databaseadvisors.com 
[mailto:dba-tech-bounces at databaseadvisors.com] On Behalf Of Rocky Smolin
at Beach Access Software
Sent: Thursday, October 11, 2007 11:34 AM
To: List
Subject: [dba-Tech] FW: PDF Extractor

 
Dear List:
 
A client received a pdf that looks like it came from an Access report.
He
needs to extract the data.  He's evaluating a program called
Able2Extract
PDF Converter which seems to be working.  The data in the PDF shows up
in an
Excel spreadsheet.  It's $100.
 
Does anyone know any better ways to do this?
 
MTIA,
 
Rocky





More information about the dba-Tech mailing list