In this article:

Extract Data From a PDF Invoice to Excel (Easiest Way in 2024)

In this article, we will show you how to extract data from an invoice to Excel. Simply follow the steps below!

How to Extract Data from a PDF Invoice to Excel

To extract data from a PDF invoice to Excel, we will use Lido, a spreadsheet app designed to simplify and automate tasks. You can create an account for free using this link.

Method 1: Using the PDF Importer Tool

Here's how to convert your PDF invoice to Excel using Lido's PDF importer tool:

Step 1: Create a New Spreadsheet

Log into your Lido account and go to the Files page. Click on "New file" to create a new spreadsheet where you will organize and analyze the data from your PDF invoice.

extract data from pdf invoice to excel

Step 2: Use the PDF Importer Tool

In your new spreadsheet, click on the "File" menu at the top. Select the "Import from PDF" option from the dropdown menu. This tool will help convert PDF data into a structured spreadsheet format.

extract data from invoice to excel

Step 3: Upload Your PDF Invoice

Click on "Click to Upload" in the file importer tool and select the PDF invoice from your computer, or you can drag and drop the file.

Step 4: Choose and Extract Data

After uploading the PDF, you will see an interface where you can choose the specific area of the PDF you want to extract. Adjust the selection box to include all parts of the table by dragging the blue corners, then press "Extract data" to begin the extraction process.

Step 5: Review and Insert Extracted Data

Ensure your data is complete and correctly extracted in the new window. For text areas, each line will be placed in a separate cell. For tables, it will be extracted as tabular data. If both are present, only tabular data will be extracted.

Once done, click "Insert at active cell" to close the dialog box. If you need to extract more tables, click "Back" to repeat the process.

Step 6: Export Your Data to Excel

Your data should now be in your spreadsheet. After verifying all necessary data has been correctly extracted, click on the three-dot menu next to the worksheet's name and select "Export Worksheet to Excel."

Method 2: Using the IMPORTPDF Formula

Here, we'll use Lido's IMPORTPDF formula to extract all content from the PDF invoice. Note that IMPORTPDF does not work with scanned PDFs. If you have a scanned PDF invoice, use the EXTRACTTABLESFROMPDF formula instead.

Step 1: Upload Your PDF to Google Drive

Log into your Google Drive and upload the PDF invoice by clicking on "New" then "File upload." This will allow Lido to access your file online.

Step 2: Create a New Lido Spreadsheet

Log into your Lido account and go to the Files page. Click on "New file" to create a new spreadsheet for organizing the data extracted from your PDF invoice.

Step 3: Add a New Worksheet

In your Lido spreadsheet, click the plus (+) icon to create a new worksheet.

Step 4: Enter the IMPORTPDF Formula

In cell A1, type "=IMPORTPDF(" without the quotation marks.

Step 5: Connect Your Google Account

Click "Add Credential" and follow the instructions to link the Google account where you uploaded the PDF. This will allow Lido to access your file. Make sure to complete all steps to set up your account properly.

Step 6: Press Comma and Select a PDF File

After connecting your Google account, press the comma key to move to the next part of the formula. Click "Select a file" to open your Google Drive files.

Step 7: Link the PDF in Google Drive

Navigate through your Google Drive and click on the PDF invoice you uploaded. This will link the selected PDF directly to the formula in your spreadsheet.

Step 8: Complete the IMPORTPDF Formula

Finish the formula by typing ",Sheet1!B2)" to designate where the extracted data should be placed, starting at cell B2 in Sheet1. Press ENTER to finalize and apply the formula.

Step 9: Run the IMPORTPDF Formula

Right-click on cell A1 and select "Run action" from the context menu. This will execute the IMPORTPDF formula and start the data extraction from your PDF.

Step 10: Check the Extracted Data

Switch to Sheet1 and review the extracted data. Ensure that all data is accurately displayed and correctly represented in the cells.

Step 11: Export Data to Excel

After verifying that the data extraction is correct, click the three-dot menu next to the Sheet1 tab and select "Export Worksheet to Excel." This will convert your data into an Excel file.

Method 3: Using the EXTRACTTABLESFROMPDF Formula

For this method, we'll use Lido’s EXTRACTTABLESFROMPDF formula to extract all recognized tables from a PDF invoice. This formula is particularly effective with scanned documents.

Step 1: Upload Your PDF Invoice to Google Drive

Sign into your Google Drive account and upload the PDF file you wish to extract data from. Ensure the PDF is accessible online for use with Lido's tools.

Step 2: Set Up a New Spreadsheet in Lido

Navigate to Lido's Files page and create a new spreadsheet by clicking on the "New file" button. This document will be used to organize and manage the data extracted from your PDF file.

Step 3: Create a New Worksheet

Click the plus (+) icon to add a new worksheet next to your default sheet.

Step 4: Enter the EXTRACTTABLESFROMPDF Formula

In your new worksheet, go to cell A1 and input the formula "=EXTRACTTABLESFROMPDF(".

Step 5: Link Your Google Account

Click the "Add Credential" button to link your Google Drive to Lido. Follow the on-screen instructions to connect your account and allow Lido to access your PDF file.

Step 6: Proceed with File Selection

Press the comma key to advance to the next part of the formula. Click "Select a file" to open the file selector.

Step 7: Select the PDF Invoice

Find and select the PDF invoice you uploaded to Google Drive. This will link the PDF to the formula for data extraction.

Step 8: Finalize the Formula

Complete the formula by typing ",Sheet1!B2)" to specify where the extracted data should be placed, starting at cell B2 in Sheet1. Press ENTER to complete the formula.

Step 9: Execute the Formula

Right-click on cell A1 and choose "Run action" from the context menu. This will run the formula and extract the table data from your PDF into the spreadsheet.

Step 10: Review the Extracted Data

Switch to Sheet1 to inspect the extracted data. Ensure the tables have been accurately extracted and correctly reflect the contents of your invoice.

Step 11: Export the Data to Excel

Once you have verified the extracted data, click the three-dot menu next to the Sheet1 tab and select "Export Worksheet to Excel."

We hope that you now have a better understanding of how to extract data from a PDF invoice to Excel.