:: PDF Data Extraction from "Specific fixed locations" to Excel ::
PDF :: Extract Data from "Specific fixed locations" PDF file - Export to Excel
Purpose: To Extract Data from "Specific fixed locations" of PDF file and export to excel file.
Useful to extract data easily from a large PDF file.
Motivation to create:
We have a Electrical and Instrumentation drawings in PDF file. File has more than 200 pages.
To get all the data from "Specific fixed locations" of PDF file to excel, I have written this program.
This can be used on any size PDF file, and can be used as general purpose program to extract data to excel file.
Language: Python
inputs:
- PDF File
- Excel File (Which has Rectangular coordinates details)
Outputs: Excel file with extracted data
How to Use: Run the given programs (exe files)
Steps:
1. For Identifying the Coordinates (x0,y0,x1,y1) of a specific Location, we need to draw a "Text Box" from "Comment" Option of Adobe reader and save the PDF File
2. Run the Exe file "Rect-coordinates-TextBox-Annotation.exe"
Give the PDF file name and page number as inputs, and after processing, an excel file automatically popups with the result i.e Rectangular coordinates of the "Text Box" from the given page number
3. Now, we have the "Coordinates" of Required locations.
These coordinates are the inputs for the Next Program, from which we get extract data from PDF file to Excel file.
Most of the Drawing PDF files have common footer with details of
- Sheet Number
- Sheet Description
- Box number
- etc... depends on your PDF drawing file
Save the required coordinates in a excel file as shown below, this excel becomes input for the PDF file
4. Now Run "Rect-Coordinates-Data-extraction-w.r.t-excel-coordinates.exe" file
References: Taken CHAT GPT Help
Disclaimer: This is Just a Hobby kind of project. Tried and tested at my level. Can't guarantee, stability and other issues. use at your Risk and always make backup copy of original files.
Attachments:
Converted Python code to executable distribution.
keep your pdf in the same folder or give complete path while using program.
Executable files links
1. Rect-coordinates-TextBox-Annotation
2. Rect-Coordinates-Data-extraction-w.r.t-excel-coordinates