- I want to Extract particular Text from PDF File and Store it in a String Variable. And later will use it in Forms. For eg: i want extract the name in the PDF only is there any method to Get Text from PDF.
- May 11, 2015 Re: Search a text inside a PDF file - C# or Javascript language. Test Screen Name May 11, 2015 4:19 AM ( in response to Saikat Malakar ) 1.
- Search PDF files for account numbers and generate spreadsheet-ready report showing document names and page numbers where text is found. Search PDF files for a set of pre-defined keywords. Add keyword(s) to the document metadata.
This functionality can be used for:
- PDF document searching and reporting
- Copying PDF files into multiple folders based on presence of specific text or pattern
- Applying custom document processing based on search results
I am working for text search and extraction from pdf using third party dll itextsharp. I am getting the text on searching but not only that text, the whole text of that page. I thought to use phrases or chunks so that I can get pre-and post of that text only along with it instead of whole page text. Finding text in a PDF file. You can search a PDF file for a word or any set of characters.
- Search PDF files for account numbers and generate spreadsheet-ready report showing document names and page numbers where text is found.
- Search PDF files for a set of pre-defined keywords. Add keyword(s) to the document metadata.
- Search PDF files for a set of pre-defined keywords and insert a different cover page depending on search results.
- Search PDF files for a set of keywords such as 'Invoice', 'Purchase Order', 'Monthly Statement' and etc., copy files into corresponding folder if a match is found. Use 'Copy' functionality to organize files into folder.
AutoBatch Text Search Report
Processing date and time: Sunday, August 30, 2015 07:08:20
Software build date: Aug 30 2015, Adobe Acrobat version: 15.1536
Search text: Account d+, Number of matches: 6
File: C:DataAAccounts.pdf, Page: 1, Text found: Account 3791030334
File: C:DataAAccounts.pdf, Page: 2, Text found: Account 992344112
File: C:DataAAccounts.pdf, Page: 3, Text found: Account 9821112008
File: C:DataAAccounts.pdf, Page: 4, Text found: Account 943734112333
File: C:DataAAccounts.pdf, Page: 5, Text found: Account 48384112001
File: C:DataAAccounts.pdf, Page: 6, Text found: Account 8892237701
Example of CSV file (it can be opened directly by most spreadsheet applications including Microsoft Excel): Processing date and time: Sunday, August 30, 2015 07:08:20
Software build date: Aug 30 2015, Adobe Acrobat version: 15.1536
Search text: Account d+, Number of matches: 6
File: C:DataAAccounts.pdf, Page: 1, Text found: Account 3791030334
File: C:DataAAccounts.pdf, Page: 2, Text found: Account 992344112
File: C:DataAAccounts.pdf, Page: 3, Text found: Account 9821112008
File: C:DataAAccounts.pdf, Page: 4, Text found: Account 943734112333
File: C:DataAAccounts.pdf, Page: 5, Text found: Account 48384112001
File: C:DataAAccounts.pdf, Page: 6, Text found: Account 8892237701
File,Page,Text
C:DataAAccounts.pdf,1,Account 3791030334
C:DataAAccounts.pdf,2,Account 992344112
C:DataAAccounts.pdf,3,Account 9821112008
C:DataAAccounts.pdf,4,Account 943734112333
C:DataAAccounts.pdf,5,Account 48384112001
C:DataAAccounts.pdf,6,Account 8892237701
C:DataAAccounts.pdf,1,Account 3791030334
C:DataAAccounts.pdf,2,Account 992344112
C:DataAAccounts.pdf,3,Account 9821112008
C:DataAAccounts.pdf,4,Account 943734112333
C:DataAAccounts.pdf,5,Account 48384112001
C:DataAAccounts.pdf,6,Account 8892237701