To convert PDF to Excel, you need EXTREME knowledge of PDF (and Excel for that matter). To have this knowledge, means you can earn money with it.
Marketleader seems to be Nuance (for decades). If they make money on it, it probably means you will need about 2000 man hours of VB6 coding to finish a successful PDF to Excel conversion. I don't think that's really worth it, and therefore I suggest you just buy the product (it's 100 bucks). There are many free online and offline solutions, most are ad supported or adware, so try at your own risk. Thank you Kimputer for your prompt response. I will pursue the product you suggest if I cannot get this to work (below).
I started playing with this after submitting my question: Public acroavdoc As Acrobat.CAcroAVDoc Public acroPDDoc As Acrobat.CAcroPDDoc Set acroavdoc = CreateObject('AcroExch.AVD oc') acroavdoc.Open('c: test.pd f', 'Accessing PDF's') Set acroPDDoc = acroavdoc.GetPDDoc acropddoc.Save(1,'c: testc onvert.pdf ') The parms for acropddoc.Save are (nType as integer, sFullPath as string) The above code snippet will save to pdf so I'm thinking/hoping that if I can find the correct nType enumeration for an Excel file type, then it will create an Excel file. And then, finding all the enumerations, I will also be able to save to MSWord, HTML, XML.etc. Thank you again for your response, fbroccoli. but it appears it requires user intervention It does if you use its GUI, but not if you use its command line interface (CLI). Here's an EE article that shows how to use the CLI of Power PDF Advanced: Following up on kimputer's excellent comments, PDF-to-Excel is tough stuff!
![Create pdf file from vb6 code visual basic download Create pdf file from vb6 code visual basic download](/uploads/1/2/5/6/125625262/370472763.png)
Creating pdf file from visual basic. Replies (6) Email updates; Last post was 10 May 2006 at 15:06 vb6 United States. Reply; 12 years ago. This is useful code, but i want to create more complex pdf file, not just simple text (insert pictures e.g.) Report abuse. Reply; 12 years ago. Luis Dario Larsen. Environment: Visual Basic 6. When it comes to generating reports in software systems, most users want these reports to be printed as an Adobe Portable Document Format (PDF) file, which is is normally a read-only format so that users can't change the contents of it.
I've had excellent results with this free online tool: It does a good (but not perfect) job of maintaining the formatting, which is always the trick with any PDF-to-Excel (or PDF-to-Word) conversion. Another nice feature of this tool is that it performs OCR if the file is an image-only PDF, thereby automatically creating the text. Of course, you want local (VB6) code, so an online tool won't do it for you, but there is a local install of it, available for purchase and download (not free, but it has a 7-day free trial): Another local install (not free, but reasonably priced at $27) is Boxoft PDF to Excel: Yet another local install worth trying is A-PDF to Excel (also not free, but reasonably priced at $39, and there's a free trial): Both Boxoft PDF to Excel and A-PDF to Excel require that the PDF have text (not just an image), e.g., if your docs are scanned images, the PDFs need to have been processed with OCR software. You may do that with any OCR tool that you have. Then the A-PDF and Boxoft products will be able to process the text generated by OCR and attempt to create a properly formatted Excel spreadsheet. If you don't have an OCR tool, Boxoft has a free one: And A-PDF has a reasonably priced one ($27): And here's a 5-minute EE video Micro Tutorial that shows another free product to perform OCR on PDFs: Here's another approach instead of getting two products (one for OCR and one for conversion to Excel). Take a look at Nuance's.
Nuance, of course, is the same company that makes Power PDF. The standard edition of PaperPort (i.e., non-Professional) can do what you want. Although the list price is $99, the street price is usually less than half that. Right now, the latest version (14) is just $25 at Amazon: It has been built-in OCR and a built-in feature to convert PDF docs to Excel. It can even scan directly to an Excel spreadsheet, as explained in these 5-minute EE video Micro Tutorials: The Amazon purchase is for PaperPort 14.0, but this EE article explains how to upgrade for free to the latest point release, 14.5: And this article shows how to apply the latest patch update (Patch 1): Of course, if your PDF already has text (i.e., is not just a raster image/bitmap/graphic), then ignore all of my recommendations with respect to OCR. As a disclaimer, I want to emphasize that I have no affiliation with any of the companies mentioned in this post and no financial interest in them whatsoever.
I am simply a happy user/customer. Regards, Joe Update: When I submitted this post, I see that kimputer commented on AutoHotkey. If you'd like to learn more about that, this EE article should help. If they contain no text, then they're surely image-only PDFs, containing only raster graphics/images, so the method you use will have to perform OCR in order for conversion to Excel to be meaningful. If you have many files with varying layouts, I don't think you'll get anything to work well — it's going to be error-prone.
But if all the files have the same exact layout (only the data values are different), then you may be able to get an automated method to be reasonably effective in creating an Excel spreadsheet. But it's still not going to be 100% accurate, and for two reasons — OCR errors and PDF-to-Excel conversion errors. So, first question: do all the files have the same exact layout or are the layouts different?
Can you post a sample file (being careful not to include private/sensitive info)? Regards, Joe. Yes, they're probably being scanned. I really don't know what is going to work best for your particular docs. I don't even know if anything will work well.
You should experiment with the software I mentioned earlier. Here's a good first test. OCR the docs with the free software described in my 5-minute EE video Micro Tutorial: You won't be able to call this with your VB6 code, but we'll worry about that later. For now, we just need the docs to have text in them via OCR. Then use the product for the conversion (free trial available), which does have a command line interface that you could call from VB6 code.
![Vb6 Vb6](/uploads/1/2/5/6/125625262/629420543.jpg)
I emphasize that I have no idea how well it will do on your docs. Regards, Joe. Worst case, I have the users open the pdf, click Save as, and export to text, or Excel or Word, and then run my program. No need for that! If the PDF has text, then you can easily grab it in your VB6 code by calling one of the (free!) command line Xpdf utilities. Here's a 5-minute EE video Micro Tutorial that shows how to download and install them: And here's another one that shows specifically how to extract the text: Before calling this from your VB6 code, experiment with the five different settings for the output mode: -layout -lineprinter -raw -table Regards, Joe.