In this article, we will show you how to extract data from PDF to Excel using the Lido app. It's quick and easy, simply follow the steps below!
To extract data from a PDF, we'll be utilizing Lido, a spreadsheet designed to automate and streamline repetitive tasks. You can set up your account for free using the following link: https://www.lido.app/go/signup.
Here's how to extract data from a PDF file to Excel using the PDF Importer tool in Lido.
Log into your Lido account and navigate to the Files page. Create a new spreadsheet. This new document will be where you organize and analyze the data extracted from your PDF.
In your new spreadsheet, head to the File menu located at the top of the interface. Select the "Import from PDF" from the dropdown options. This tool is specially designed to facilitate the conversion of data from PDFs into a structured spreadsheet format.
Click on the upload section of the PDF Importer tool to browse and upload the PDF file from which you want to extract data. Ensure that this is the correct file that contains the data you need.
After uploading the PDF, you will see an interface where you can select the specific area of the PDF you wish to extract. Adjust the selection box to precisely cover just the data you need, ensuring you don't include unwanted sections, and then press "Extract data" to start the extraction process.
The extracted data is now in the active cell of the spreadsheet. The PDF importer is designed to convert data into spreadsheet format.
Note that if the selected area contains only text, then each line of text is placed into a separate cell. If the selected area contains tabular data, then the tabular data will be extracted. If the selected area contains both tabular data and plain text, the plain text will be ignored.
If you need to extract more data from the PDF, click "Back" now. To finish, just close the modal by clicking the "X" button in the top right corner.
After ensuring all necessary data has been correctly extracted and is now in your spreadsheet, find the three-dot menu next to the worksheet's name. Click on it and choose "Export Worksheet to Excel". This option converts your spreadsheet into an Excel file, which you can then save to your computer for further use or analysis.
In this method, we will use Lido’s custom formula IMPORTPDF which extracts everything from the supplied pdf at once. IMPORTPDF does not work on scanned PDF documents; If your PDF document is scanned, consider method 3 below which uses EXTRACTTABLESFROMPDF formula instead.
Begin by logging into your Google Drive and uploading the PDF file from which you want to extract data. Ensure the PDF is accessible online to utilize it with Lido’s tools.
Navigate to Lido's files page and create a new spreadsheet. This spreadsheet will be the place where the extracted PDF data is organized and analyzed.
Add a new worksheet to your spreadsheet. This is done by clicking the plus button at the top left of the interface.
Type the beginning of the IMPORTPDF formula into cell A1. This sets up the cell to receive the full formula that will instruct Lido on how to process your PDF file.
Click the "Add Credential" button to begin linking your Google Drive account to Lido, allowing the application to access the PDF file you've uploaded. Follow all on-screen instructions to ensure proper setup.
After adding your credentials, continue the formula by typing a comma, which separates the arguments, then click "Select a file" to open a dialog box where you can choose the uploaded PDF from your Google Drive.
Browse through your Google Drive in the file selector and click on the PDF you previously uploaded. This action links the selected PDF directly to the IMPORTPDF formula.
Finalize the formula by indicating where in the spreadsheet you want the extracted data to be placed. In this case, typing ", Sheet1!B2)" specifies that data should be inserted starting at cell B2 of Sheet1. Press ENTER to complete and set the formula.
After setting the formula, right-click on cell A1 where it is entered and select “Run action” from the context menu. This will execute the IMPORTPDF formula, initiating the data extraction process from the PDF.
Once the formula has run, navigate to Sheet1 to review the extracted data. Ensure that all data appears as expected and is accurately represented within the cells.
After confirming the data extraction is correct, click the three-dot menu next to the Sheet1 tab and select "Export Worksheet to Excel." This action converts your data into an Excel file, allowing you to save, share, or further analyze it outside of Lido.
In this method, we will use Lido’s custom formula EXTRACTTABLESFROMPDF which extracts everything from the PDF it recognized as a table. This formula works on scanned documents.
Start by logging into your Google Drive and uploading the specific PDF file from which you want to extract table data. Ensure that the PDF is accessible in your Drive for the extraction process.
Navigate to the Files section in Lido and create a new spreadsheet. This spreadsheet will be used to manage and analyze the table data you extract from the PDF.
To organize your data extraction efficiently, add a new worksheet to your spreadsheet by clicking the plus icon located in the top left corner. This new sheet will house the extracted tables.
In the new worksheet, go to cell A1 and begin typing the EXTRACTTABLESFROMPDF formula. This formula will initiate the process to extract only tabular data from your PDF.
Click "Add Credential" to link your Google Drive to Lido. This step is essential as it allows Lido to access the PDF file directly from your Drive. Follow all prompts to ensure a secure connection.
After adding your account credentials, press the comma key to move to the next part of the formula. Then click "Select a file" to open a file picker where you can choose the uploaded PDF from your Google Drive.
Find and select the PDF you previously uploaded. This action links the PDF with the formula, setting it up for data extraction.
Complete the formula by specifying the output location for the extracted data within your spreadsheet. Typing ', Sheet1!B2)' tells the formula to place the extracted tables starting at cell B2 on Sheet1. Press ENTER to finalize the formula.
With the formula set, right-click on cell A1 and select "Run action" from the context menu. This will execute the formula, extracting table data from the PDF into your spreadsheet.
Go to Sheet1 to check the extracted data. Ensure that the tables have been properly extracted and accurately reflect the content of the PDF.
Note that only tabular data will be extracted. Consider using methods 1 and 2 if you need to extract non-tabular data.
After confirming that the table data is correctly extracted, click the three-dot menu beside "Sheet1" and choose "Export Worksheet to Excel". This converts your data into an Excel format.
We hope you now have a better understanding of how to extract data from PDF to Excel.