In this article, we will show you how to extract data from scanned receipts using the Lido app. Simply follow the steps below!
To automate data extraction from scanned receipts, we'll utilize Lido, a tool designed to streamline and automate repetitive tasks in spreadsheets. You can sign up for a free account using this link: https://www.lido.app/go/signup.
Here's how to extract data from scanned receipts using Lido's built-in PDF extraction tool:
Log into your Lido account and go to the Files page. Click "New file" to start a new spreadsheet to organize and analyze the data extracted from your scanned receipts.
In your new spreadsheet, access the "File" menu at the top. Choose the "Import from PDF" option from the dropdown menu to help convert the PDF data of your scanned receipts into a structured spreadsheet format.
Click on "Click to Upload" in the file importer tool and either select the scanned receipt PDF from your computer or drag and drop the file into the uploader.
Once the PDF is uploaded, an interface will appear allowing you to select the specific area of the scanned receipt you wish to extract data from.
Adjust the selection box to cover all necessary parts of the receipt by dragging the corners, then click "Extract data" to begin the extraction process.
Check that the data in the new window is complete and correctly extracted.
If the selected area contains text, each line will be placed into individual cells. Tabular data will be formatted as tables. If both types of data are present, the plain text will be ignored.
After verifying the data, click "Insert at active cell" to add it to your spreadsheet. If more data needs to be extracted, click "Back" to adjust and repeat the extraction.
Once inserted, the extracted data should automatically show up in your spreadsheet.
For this method, we'll use Lido's custom formula, EXTRACTTABLESFROMPDF, to automate the extraction of tabular data from scanned receipts.
Sign into Google Drive and upload the scanned receipt in PDF format by clicking on "New" then selecting "File upload" at the top left. Ensure the receipt is accessible online for Lido's use.
Go to the Files page on your Lido account and click "New file" to start a new spreadsheet for managing data from your scanned receipt.
Next to your default sheet, click the plus (+) icon at the top left to add a new worksheet.
In the new worksheet, go to cell A1 and type "=EXTRACTTABLESFROMPDF(".
Click on "Add Credential" and then select "Connect to Google Drive" to link your Google Drive with Lido. Follow the on-screen instructions to complete the connection.
With your account linked, press the comma key to move to the next part of the formula. Then click "Select a file" to open the file selector.
Navigate through your Google Drive and choose the scanned receipt PDF you uploaded earlier. This links the PDF directly to the formula.
Finish typing the formula by adding ",Sheet1!B2)" to designate that the extracted data should populate starting at cell B2 in Sheet1. Press ENTER to apply the formula.
Right-click on the menu in cell A1 and select "Run action" from the context menu. This initiates the formula, extracting table data from your scanned receipt PDF into the spreadsheet.
Switch to "Sheet1" to inspect the extracted data. Verify that the tables have been accurately extracted and correctly represent the content of the scanned receipt. Note that only tabular data will be extracted.
We hope that you now know how to extract data from scanned receipts.