In this article, we will show you how to extract tables from PDF using an online tool called Lido. Simply follow the process below.
The tool we will be using is called Lido. It is a spreadsheet for automating and simplifying repetitive tasks. You can make an account by following this link: https://www.lido.app/go/signup.
Here's how you can extract a table from a PDF file using the PDF importer tool in Lido.
Log into Lido and open a new, blank spreadsheet. This will be your workspace where the extracted table data from the PDF will be stored and managed.
Navigate to the File menu at the top of your Lido interface and select "Import from PDF". This tool is specifically designed to extract data, such as tables, from PDF documents and place it into your spreadsheet.
Click the "Click to upload" button within the PDF Importer tool to browse and select the PDF file from which you want to extract tables. Ensure you select the correct file containing the relevant data.
After uploading your PDF, a preview will appear allowing you to select the area containing the table you wish to extract. Adjust the selection box to precisely encompass the entire table, then click "Extract data" to initiate the extraction process.
Check the extracted table to ensure it appears correctly and all relevant data has been captured.
If the extraction is accurate, click on "Insert at active cell" to place the table into the spreadsheet at the selected cell.
If additional tables need to be extracted, click "Back" to return to the selection phase; otherwise, close the tool by clicking the "X" in the top right corner of the window.
Confirm that the table is now correctly positioned and formatted within your document. This step completes the extraction process, and the data is now ready for any further analysis or manipulation.
In this method, we will use Lido’s custom formula EXTRACTTABLESFROMPDF which extracts everything from the PDF it recognized as a table.
Access your Google Drive account and upload the PDF file that contains the tables you want to extract. Ensure the PDF is accessible online, as Lido will need to retrieve it directly from Google Drive.
Create a new spreadsheet in Lido by opening a fresh, blank document in the Files page. This spreadsheet will serve as your workspace for the table data you are about to extract from the PDF.
Add a new worksheet to your spreadsheet by clicking the plus button. This new sheet will be where you input the formula and view the extracted data.
Go to cell A1 in the new worksheet and begin typing the EXTRACTTABLESFROMPDF formula. This formula is designed to identify and extract table data from the specified PDF.
Initiate the credential linking process by clicking "Add Credential". This step is crucial as it allows Lido to access your Google Drive and the PDF file you uploaded, ensuring the formula can retrieve the necessary data.
After adding your credentials, press the comma key to proceed to the next part of the formula where you will need to select the specific PDF file. Click "Select a file" to bring up your Google Drive files in a dialog box.
Navigate through your files to locate and select the PDF from which you want to extract tables. This step links the chosen PDF directly to the formula in your spreadsheet.
The final argument of the EXTRACTTABLESFROMPDF formula indicates where the extracted data should be placed. In this instance, we are specifying that the data should go into worksheet Sheet1, beginning at cell B2.
With the formula set, right-click on cell A1 where the formula is entered and select ‘Run action’ from the context menu. This command will execute the formula, triggering the extraction of table data from the PDF.
Finally, navigate to "Sheet1" to verify that the tables have been extracted correctly and are displayed as expected in the spreadsheet. Check for accuracy and completeness of the data.
We hope you now have a better understanding of how to extract tables from a PDF file.