In this article:

OCR Receipt Data Extraction (Easiest Way in 2024)

In this article, we will show you how to extract data from scanned receipts using the Lido app. Simply follow the steps below!

How to Extract Data from Scanned Receipts

To automate data extraction from scanned receipts, we'll utilize Lido, a tool designed to streamline and automate repetitive tasks in spreadsheets. You can sign up for a free account using this link: https://www.lido.app/go/signup.

Method 1: Using the PDF Extraction Tool

Here's how to extract data from scanned receipts using Lido's built-in PDF extraction tool:

Step 1: Create a New Spreadsheet

Log into your Lido account and go to the Files page. Click "New file" to start a new spreadsheet to organize and analyze the data extracted from your scanned receipts.

ocr receipt data extraction

Step 2: Use the PDF Importer Tool

In your new spreadsheet, access the "File" menu at the top. Choose the "Import from PDF" option from the dropdown menu to help convert the PDF data of your scanned receipts into a structured spreadsheet format.

extract data from scanned receipts

Step 3: Upload Your Scanned Receipt PDF

Click on "Click to Upload" in the file importer tool and either select the scanned receipt PDF from your computer or drag and drop the file into the uploader.

extract data from receipt

Step 4: Choose and Extract Data

Once the PDF is uploaded, an interface will appear allowing you to select the specific area of the scanned receipt you wish to extract data from.

Adjust the selection box to cover all necessary parts of the receipt by dragging the corners, then click "Extract data" to begin the extraction process.

scan receipt and extract data

Step 5: Review and Insert Extracted Data

Check that the data in the new window is complete and correctly extracted.

If the selected area contains text, each line will be placed into individual cells. Tabular data will be formatted as tables. If both types of data are present, the plain text will be ignored.

After verifying the data, click "Insert at active cell" to add it to your spreadsheet. If more data needs to be extracted, click "Back" to adjust and repeat the extraction.

Once inserted, the extracted data should automatically show up in your spreadsheet.

Method 2: Using EXTRACTTABLESFROMPDF

For this method, we'll use Lido's custom formula, EXTRACTTABLESFROMPDF, to automate the extraction of tabular data from scanned receipts.

Step 1: Upload Scanned Receipt PDF to Google Drive

Sign into Google Drive and upload the scanned receipt in PDF format by clicking on "New" then selecting "File upload" at the top left. Ensure the receipt is accessible online for Lido's use.

Step 2: Create a New Spreadsheet in Lido

Go to the Files page on your Lido account and click "New file" to start a new spreadsheet for managing data from your scanned receipt.

Step 3: Add a New Worksheet

Next to your default sheet, click the plus (+) icon at the top left to add a new worksheet.

Step 4: Input the EXTRACTTABLESFROMPDF Formula

In the new worksheet, go to cell A1 and type "=EXTRACTTABLESFROMPDF(".

Step 5: Link Your Google Drive to Lido

Click on "Add Credential" and then select "Connect to Google Drive" to link your Google Drive with Lido. Follow the on-screen instructions to complete the connection.

Step 6: Open the File Dialog

With your account linked, press the comma key to move to the next part of the formula. Then click "Select a file" to open the file selector.

Step 7: Select the Uploaded PDF Receipt

Navigate through your Google Drive and choose the scanned receipt PDF you uploaded earlier. This links the PDF directly to the formula.

Step 8: Complete the Formula

Finish typing the formula by adding ",Sheet1!B2)" to designate that the extracted data should populate starting at cell B2 in Sheet1. Press ENTER to apply the formula.

Step 9: Run the Formula for Extraction

Right-click on the menu in cell A1 and select "Run action" from the context menu. This initiates the formula, extracting table data from your scanned receipt PDF into the spreadsheet.

Step 10: Review the Extracted Data

Switch to "Sheet1" to inspect the extracted data. Verify that the tables have been accurately extracted and correctly represent the content of the scanned receipt. Note that only tabular data will be extracted.

We hope that you now know how to extract data from scanned receipts.