A Microsoft Office (Excel, Word) forum. OfficeFrustration

If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.

Go Back   Home » OfficeFrustration forum » Microsoft Excel » Worksheet Functions
Site Map Home Register Authors List Search Today's Posts Mark Forums Read  

How to import PDF into Excel?



 
 
Thread Tools Display Modes
  #1  
Old June 12th, 2009, 12:22 AM posted to microsoft.public.excel.worksheet.functions
joeu2004
external usenet poster
 
Posts: 1,748
Default How to import PDF into Excel?

I want to import some data from PDF files into Excel. Is there a
straight-forward way to do this?

For example, see http://muddybuddy.com/pdf/sanjose/results-09.pdf.

What I have done in the past is: open the PDF file, save as text, and write
a VBA macro to read the text file and parse the data line-by-line, putting
it into a worksheet in the form that I require.

The issue is: the data in this particular file does not follow a consistent
pattern when it is saved to text. For example compare the data for "Bib:"
numbers 349, 299, 479 and 1084.

(Aside: Can anyone explain why? The data appears consistently in the PDF
file.)

The issue is not insurmountable. I can recognize and deal with the
different patterns in my parser.

The problem is: I don't know (yet) how many different patterns are
possible. I have found 4 so far. But I would have to look carefully at all
1032 entries to determine if there are other forms.

(Actually, I would simply parse what I know and see what is missing, then
add a parser for the missing pattern. But that's tedious.)

  #2  
Old June 12th, 2009, 12:29 AM posted to microsoft.public.excel.worksheet.functions
joeu2004
external usenet poster
 
Posts: 1,748
Default How to import PDF into Excel?

Oops, posted to an unintended m.p.excel NG. Wasn't paying attention when I
posted (sigh). Oh well, I know the right people will see this anyway.


----- original message ------

"JoeU2004" wrote in message
...
I want to import some data from PDF files into Excel. Is there a
straight-forward way to do this?

For example, see http://muddybuddy.com/pdf/sanjose/results-09.pdf.

What I have done in the past is: open the PDF file, save as text, and
write a VBA macro to read the text file and parse the data line-by-line,
putting it into a worksheet in the form that I require.

The issue is: the data in this particular file does not follow a
consistent pattern when it is saved to text. For example compare the data
for "Bib:" numbers 349, 299, 479 and 1084.

(Aside: Can anyone explain why? The data appears consistently in the PDF
file.)

The issue is not insurmountable. I can recognize and deal with the
different patterns in my parser.

The problem is: I don't know (yet) how many different patterns are
possible. I have found 4 so far. But I would have to look carefully at
all 1032 entries to determine if there are other forms.

(Actually, I would simply parse what I know and see what is missing, then
add a parser for the missing pattern. But that's tedious.)


 




Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Forum Jump


All times are GMT +1. The time now is 03:53 AM.


Powered by vBulletin® Version 3.6.4
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright ©2004-2024 OfficeFrustration.
The comments are property of their posters.