In this article, we will show you how to extract data from an invoice to Excel. Simply follow the steps below!
To extract data from a PDF invoice to Excel, we will use Lido, a spreadsheet app designed to simplify and automate tasks. You can create an account for free using this link.
Here's how to convert your PDF invoice to Excel using Lido's PDF importer tool:
Log into your Lido account and go to the Files page. Click on "New file" to create a new spreadsheet where you will organize and analyze the data from your PDF invoice.
In your new spreadsheet, click on the "File" menu at the top. Select the "Import from PDF" option from the dropdown menu. This tool will help convert PDF data into a structured spreadsheet format.
Click on "Click to Upload" in the file importer tool and select the PDF invoice from your computer, or you can drag and drop the file.
After uploading the PDF, you will see an interface where you can choose the specific area of the PDF you want to extract. Adjust the selection box to include all parts of the table by dragging the blue corners, then press "Extract data" to begin the extraction process.
Ensure your data is complete and correctly extracted in the new window. For text areas, each line will be placed in a separate cell. For tables, it will be extracted as tabular data. If both are present, only tabular data will be extracted.
Once done, click "Insert at active cell" to close the dialog box. If you need to extract more tables, click "Back" to repeat the process.
Your data should now be in your spreadsheet. After verifying all necessary data has been correctly extracted, click on the three-dot menu next to the worksheet's name and select "Export Worksheet to Excel."
Here, we'll use Lido's IMPORTPDF formula to extract all content from the PDF invoice. Note that IMPORTPDF does not work with scanned PDFs. If you have a scanned PDF invoice, use the EXTRACTTABLESFROMPDF formula instead.
Log into your Google Drive and upload the PDF invoice by clicking on "New" then "File upload." This will allow Lido to access your file online.
Log into your Lido account and go to the Files page. Click on "New file" to create a new spreadsheet for organizing the data extracted from your PDF invoice.
In your Lido spreadsheet, click the plus (+) icon to create a new worksheet.
In cell A1, type "=IMPORTPDF(" without the quotation marks.
Click "Add Credential" and follow the instructions to link the Google account where you uploaded the PDF. This will allow Lido to access your file. Make sure to complete all steps to set up your account properly.
After connecting your Google account, press the comma key to move to the next part of the formula. Click "Select a file" to open your Google Drive files.
Navigate through your Google Drive and click on the PDF invoice you uploaded. This will link the selected PDF directly to the formula in your spreadsheet.
Finish the formula by typing ",Sheet1!B2)" to designate where the extracted data should be placed, starting at cell B2 in Sheet1. Press ENTER to finalize and apply the formula.
Right-click on cell A1 and select "Run action" from the context menu. This will execute the IMPORTPDF formula and start the data extraction from your PDF.
Switch to Sheet1 and review the extracted data. Ensure that all data is accurately displayed and correctly represented in the cells.
After verifying that the data extraction is correct, click the three-dot menu next to the Sheet1 tab and select "Export Worksheet to Excel." This will convert your data into an Excel file.
For this method, we'll use Lido’s EXTRACTTABLESFROMPDF formula to extract all recognized tables from a PDF invoice. This formula is particularly effective with scanned documents.
Sign into your Google Drive account and upload the PDF file you wish to extract data from. Ensure the PDF is accessible online for use with Lido's tools.
Navigate to Lido's Files page and create a new spreadsheet by clicking on the "New file" button. This document will be used to organize and manage the data extracted from your PDF file.
Click the plus (+) icon to add a new worksheet next to your default sheet.
In your new worksheet, go to cell A1 and input the formula "=EXTRACTTABLESFROMPDF(".
Click the "Add Credential" button to link your Google Drive to Lido. Follow the on-screen instructions to connect your account and allow Lido to access your PDF file.
Press the comma key to advance to the next part of the formula. Click "Select a file" to open the file selector.
Find and select the PDF invoice you uploaded to Google Drive. This will link the PDF to the formula for data extraction.
Complete the formula by typing ",Sheet1!B2)" to specify where the extracted data should be placed, starting at cell B2 in Sheet1. Press ENTER to complete the formula.
Right-click on cell A1 and choose "Run action" from the context menu. This will run the formula and extract the table data from your PDF into the spreadsheet.
Switch to Sheet1 to inspect the extracted data. Ensure the tables have been accurately extracted and correctly reflect the contents of your invoice.
Once you have verified the extracted data, click the three-dot menu next to the Sheet1 tab and select "Export Worksheet to Excel."
We hope that you now have a better understanding of how to extract data from a PDF invoice to Excel.