In this article, we will show you how to convert a PDF to a CSV file using Lido’s tools and formulas. Simply follow the process below.
We will be using Lido, a spreadsheet created to automate and simplify redundant tasks. You can create your account for free by following this link: https://www.lido.app/go/signup.
The PDF Importer tool is designed to convert specific data selected from PDF documents into a spreadsheet format. If instead you would like to convert the entire PDF document, use Lido's custom formulas as described in methods 2 and 3 below.
Begin by logging into Lido and creating a new spreadsheet. This fresh spreadsheet will serve as the base where the data extracted from your PDF will be stored and managed.
Navigate to the File menu on the top toolbar of Lido and select "Import from PDF" from the dropdown options to open the PDF Importer tool. This tool is equipped to handle the conversion of data from PDF files into a spreadsheet format.
Click on the upload button within the PDF Importer tool interface to select and upload the PDF file from which you want to extract and convert data. Make sure that this is the correct PDF file containing the data you need.
After uploading the PDF, you'll be able to use a selection tool within the PDF Importer to highlight the specific portion of the PDF that contains the data you want to convert. Adjust the selection to fit just the data you need and then click "Extract data" to initiate the conversion process.
The transformed data has been placed into the spreadsheet's active cell. The PDF importer's function is to change data into a format suitable for spreadsheets. When the chosen section is purely text, each text line is allocated to its own cell. In cases where the chosen section has table-like data, this data is then extracted.
Should the chosen section include both table-like data and simple text, the simple text will be disregarded. If you need to transform additional data from the PDF, you should click "Back" at this point. To complete the process, simply exit the modal by clicking the "X" button located in the top right corner.
After ensuring all data has been converted and inserted into the spreadsheet, click the three-dot menu next to the worksheet name. From there, select "Export Worksheet to CSV". This will export the data into a CSV file.
In this approach, we'll employ Lido's unique formula, IMPORTPDF, which transforms all content from the provided PDF document simultaneously. However, IMPORTPDF is not compatible with scanned PDFs. If you're dealing with a scanned PDF, you might want to look into the third method mentioned below, which utilizes the EXTRACTTABLESFROMPDF formula instead.
Begin by logging into your Google Drive and uploading the PDF document you want to convert. Ensure the file is accessible in your Drive as it's necessary for the subsequent steps in Lido.
Open Lido and create a new spreadsheet by navigating to the Files section. This new spreadsheet will be used to handle the data that will be extracted from your PDF.
Add a new worksheet to your spreadsheet by clicking the plus icon, found at the top left of the Lido interface. This worksheet will specifically be used to receive and display the extracted PDF data.
Navigate to cell A1 in your new worksheet and start typing the IMPORTPDF formula.
Click the "Add Credential" button to begin establishing a connection between Lido and your Google Drive, which allows Lido to access the PDF file you've uploaded. Follow the prompts carefully to ensure the account is correctly linked.
After linking your Google account, press the comma key to separate the first part of the formula from the next, then click "Select a file" to bring up a dialog box where you can select the uploaded PDF file from your Google Drive.
Navigate through your Google Drive in the file picker to locate and select the PDF you want to convert.
This specifies that the data extracted by the EXTRACTTABLESFROMPDF formula should be directed to the Sheet1 worksheet, beginning at cell B2.
After the formula is set up, right-click on cell A1 where the formula is entered and choose “Run action” from the context menu. This executes the formula, starting the process of extracting data from the PDF.
Switch over to Sheet1 to inspect the extracted data. Verify that all the information has been extracted accurately and is properly formatted within the spreadsheet cells.
Once you are satisfied that the data is correctly extracted and displayed, click the three-dot menu next to the Sheet1 tab and select "Export Worksheet to CSV".
In this approach, we'll employ Lido's unique formula, EXTRACTTABLESFROMPDF, which is designed to extract anything it identifies as a table from the PDF. This formula is effective on scanned documents.
Log into your Google Drive and upload the PDF file from which you intend to extract table data. Make sure the file is uploaded successfully as you'll need to access it directly from Lido.
Navigate to the Files area in Lido and create a new spreadsheet. This document will serve as your primary workspace where the extracted table data will be placed.
Add a new worksheet to your spreadsheet by clicking the plus sign. This new sheet is where you will input the formula and subsequently display the extracted data.
In the newly created worksheet, go to cell A1 and start typing the EXTRACTTABLESFROMPDF formula. This formula is specifically designed to identify and extract tables from the PDF.
Select "Add Credential" to begin establishing a connection between your Google Drive and Lido. This connection is necessary for Lido to access and process the PDF file you've uploaded.
After successfully linking your account, press the comma key to add a separation in the formula and then choose "Select a file." This opens a file picker where you can select the previously uploaded PDF.
Find and select the PDF document you uploaded earlier.
This specifies that the data extracted by the EXTRACTTABLESFROMPDF formula should be directed to the Sheet1 worksheet, beginning at cell B2.
With the formula ready, right-click on cell A1 where you entered the formula and select "Run action." This command triggers the formula to execute.
Switch to Sheet1 to inspect the extracted data. Ensure that the data, particularly table content, has been extracted correctly and is well-formatted within the worksheet.
Be aware that only data in tables will be extracted. If you need to extract data that isn't in table form, you should consider using methods 1 and 2.
After verifying the extraction accuracy, click the three-dot menu beside Sheet1's tab and select "Export Worksheet to CSV." This allows you to convert and download the data in CSV format, suitable for further use or analysis outside of Lido.
We hope you now have a better understanding of how to convert a PDF to a CSV file using Lido.