PDFill | Overview | First | Previous | Next | Last
PDF Document Management 18: FREE PDF Extract Text Online Tools
PDF Document Management 18: FREE PDF Extract Text Desktop Tools
This function provides method for extracting text from the selected pages, and presents the results in a variety of formats.
Batch (DOS) Command Support: You can start a batch job in Windows by issuing the execution command directly from the MS-DOS command prompt window without opening the PDFill GUI.
Here are the steps on how to use PDF Extract Text:
1. Choose Document Menu > Select a File for More Operations > FREE Extract Text (Select a File)
or click FREE Extract Text Button
in the Document Toolbar.
2. Select a PDF to be extracted into text.
3. Here is the dialog of PDF Extract Text's Properties:
- All Pages or Select Pages
- Output each PDF Page Separately
- Unicode Text Format
- Open TXT File Automatically after Saving
- Open Folder Automatically after Saving
Text Option:
1: Human readable format
2: Human Readable Format with Unformatted Lines
3: CSV String with X, Y, Color, Size, Font and Text
4: CSV String: Font, Color, Size, X1, Y1, X2, Y2, X3, Y3, X4, Y4, Text
5: Similar to option 4, but individual words are returned
6: Similar to option 4 but character widths are output after each block of text
7: Similar to option 5 but character widths are output after each line of text
4. Click the button Extract and save into a new Text file or files.
		Batch (DOS) Command Support: 
            
You can start a batch job in Windows by issuing the 
        execution command directly from the MS-DOS command prompt window without 
        opening the PDFill GUI. 
(It is only available for the registered user of PDFill PDF Editor)
"C:\Program Files\PlotSoft\PDFill\PDFill.exe" ExtractText Input.pdf Output.txt TextOption1to7 UnicodeTextFortmat OutputPageSeparately -SelectPages "1-2" -HeaderText "Test<<Page 1 of n>>"
- TextOption1to7 (1-7): 1: Human readable format; 2: Human Readable Format with Unformatted Lines; 3: CSV String with X, Y, Color, Size, Font and Text; 4: CSV String: Font, Color, Size, X1, Y1, X2, Y2, X3, Y3, X4, Y4, Text; 5: Similar to option 4, but individual words are returned; 6: Similar to option 4 but character widths are output after each block of text; 7: Similar to option 5 but character widths are output after each line of text
- UnicodeTextFortmat (0/1): ANSI Text Format (0); Unicode Text Format(1)
- OutputPageSeparately (0/1): Output all PDF Pages into one file(0); Separately Output each PDF Page Separately(1)
- -SelectPages: Selection Page Number String (Optional), "1,2-3,4-last". Default is "" for all pages.
- -HeaderText: To add automatic page numbering, select Format (Optional): <<1>>; <<1 of n>>; <<1/n>>; <<Page 1>>; <<Page 1 of n>>; <<Bates#6#Offset#100>>
- Example:
"C:\Program Files\PlotSoft\PDFill\PDFill.exe" ExtractText "D:\BatchTest\Input.pdf" "D:\BatchTest\output.txt" 4 1 1 -SelectPages "1-2" -HeaderText "Test <<Page 1 of n>> "
PDFill Copyright 2002-2022 by PlotSoft L.L.C.. All rights reserved.