Results 1 to 9 of 9
  1. #1
    Bronze Lounger
    Join Date
    Jan 2001
    Location
    Melbourne, Australia
    Posts
    1,294
    Thanks
    0
    Thanked 0 Times in 0 Posts

    convert PDF files into Excel (excel 2002 SP3 (XP) )

    Can anyone recommend good software for converting PDF files into Excel format.

    Does it depend on how the PDF file was generated. ie what software?
    as I have a evaluation copy of SolidConvertor PDF. The result is it more or less dumps all the data in one cell

    TIA

    diana

  2. #2
    5 Star Lounger
    Join Date
    Aug 2001
    Location
    Surrey, United Kingdom
    Posts
    1,001
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Re: convert PDF files into Excel (excel 2002 SP3 (XP) )

    Hi Diana

    A quick google search http://www.google.co.uk/search?hl=en&q...earch&meta= turned up these results but there is not 1 that I can readily recommend, I do seem to remember using Able2Extract some time ago which was OK.

    You may be better posting in software finds and wants though because those guys are like bloodhounds and will sniff out any free software!!

    Cheers

    Steve
    Cheers

    Steve

    Asking the questions everbody wants the answers too but feels too stupid to ask themselves :-)

  3. #3
    Platinum Lounger
    Join Date
    Feb 2002
    Location
    A Magic Forest in Deepest, Darkest Kent
    Posts
    5,681
    Thanks
    0
    Thanked 1 Time in 1 Post

    Re: convert PDF files into Excel (excel 2002 SP3 (

    Hi Diana

    I am no expert in PDF conversion but speak from experience of working with PDF forms in Adobe Acrobat.

    If , say, I have a Word document that I wish to convert to PDF format the conversion process looks at each line of text and then converts the text and places it in a seperate text box. After converting the file and I edited the text within Adobe Acrobat there would be a page comprising a chunk of text boxes line below line, it has no concept of paragraph, Word formatting and I would say the say is true with conversion back to Excel.

    If I were to copy the content of a PDF document and paste it to Word it does not carry over the formatting of the displayed text and I would spend hours re-formatting, a complete waste of time as it would have been quicker to retype the thing.

    Personally, I think you are on to a hiding for nothing here as all data will be treated as text and you would still have to spend time formatting the data to numerical values where needed. That is my tu' pence worth but as Steve mentions there maybe some other convertors out there but you may have to pay <img src=/S/sad.gif border=0 alt=sad width=15 height=15>
    Jerry

  4. #4
    Bronze Lounger
    Join Date
    Jan 2001
    Location
    Melbourne, Australia
    Posts
    1,294
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Re: convert PDF files into Excel (excel 2002 SP3 (

    Thanks Steve and Jerry

    I didnt realise about the Software Finds forum, so I've posted there [img]/forums/images/smilies/tongue.gif[/img]

    Jerry I agree with you about spending hours reformatting files. sometimes with the result of the conversion its just a mess, and you wonder if its quicker just to retype <img src=/S/bubbles.gif border=0 alt=bubbles width=31 height=17>

    thanks again

    diana

  5. #5
    Plutonium Lounger
    Join Date
    Mar 2002
    Posts
    84,353
    Thanks
    0
    Thanked 29 Times in 29 Posts

    Re: convert PDF files into Excel (excel 2002 SP3 (

    Some PDF creators, such as Adobe's PDFMaker add-in for Word and Excel, can (optionally) write information about the structure of the original file into the PDF file, so-called tags. These make it better possible to convert the PDF file back into Word or Excel. Without tags, the conversion software has to guess.
    See Understanding Tagged (Structured) PDF for background info.

  6. #6
    Bronze Lounger
    Join Date
    Jan 2001
    Location
    Melbourne, Australia
    Posts
    1,294
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Re: convert PDF files into Excel (excel 2002 SP3 (

    Thanks Hans

    Yes thats what I suspected, it depends on how the PDF file was created.

    If using a crude or old method, then its alot difficult to convert PDF back into the native application.

    I dont mind if the application is freeware or if I have to pay, I just would like the app to "work" with seamless easy conversion.

    diana

  7. #7
    Plutonium Lounger
    Join Date
    Mar 2002
    Posts
    84,353
    Thanks
    0
    Thanked 29 Times in 29 Posts

    Re: convert PDF files into Excel (excel 2002 SP3 (

    Even with a tagged PDF file, you can't expect software to convert it back into a fully functional spreadsheet - all formulas will be lost, for example.

  8. #8
    Platinum Lounger
    Join Date
    Feb 2002
    Location
    A Magic Forest in Deepest, Darkest Kent
    Posts
    5,681
    Thanks
    0
    Thanked 1 Time in 1 Post

    Re: convert PDF files into Excel (excel 2002 SP3 (

    Hi Diana

    I have just been playing with this and have a little bit of success, but it is far from perfect and you may have to play with the formatting , but it looks hopeful. I use open source PDF conversion software called PDF Creator It is quite a good little tool as I use it for basic conversion of Excel marking grids. You don't need it for this little experiment but I was just pointing out that I am not using tagging or anything special.

    Secondly you will need Adobe Reader ( I am using 7 at the moment)

    1) Open up your PDF file in Adobe Reader
    2) Click File | Save as Text....
    3) Drop down the "save type:" and select All Files ("*")
    4) Type in a name but instead of txt save as csv i.e. Jerry.csv

    Open up Excel

    1) Create new file
    2) Data | Import External Data | Import Data...
    3) Select you new csv file
    4) In step 2 of 3 , select, Space as the delimiter
    5) In step 3 of 3 you can convert each column to be general, date or text
    6) Select Finish
    7) Select cell A1
    8) Select OK

    Depending how things go, you should have a workable , or at least a reasonably formatted file. As Hans states, everything will come across as strings so you will have to recreate functions etc

    HTH
    Jerry

  9. #9
    Uranium Lounger
    Join Date
    Dec 2000
    Location
    Salt Lake City, Utah, USA
    Posts
    9,508
    Thanks
    0
    Thanked 6 Times in 6 Posts

    Re: convert PDF files into Excel (excel 2002 SP3 (

    <img src=/S/free.gif border=0 alt=free width=30 height=15> pdftohtml -may- preserve table formats in HTML which you can then past to Excel. It requires a little setup work. It will not work on protected/encrypted PDF files, nor PDF files whose content is an embedded graphic.

    Otherwise you'll need a commercial solution, not all of which will convert protected/encrypted PDFs; for that reason my company uses Able2Extract which also claims to extract a scanned document into Excel format - but I don't use it myself, I just send my stuff off to someone else. If "Software Finds and Wants" there are older threads on this subject which you should search for and review.
    -John ... I float in liquid gardens
    UTC -7ąDS

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •