OCR PDF Files with Asprise Java PDF Reader (with Text Extract)/Writer Library and Asprise OCR Engine
Sample code:
import com.asprise.util.pdf.PDFReader;
import com.asprise.util.ocr.OCR;
PDFReader reader = new PDFReader(new File("my.pdf"));
reader.open(); // open the file.
int pages = reader.getNumberOfPages();
for(int i=0; i < pages; i++) {
BufferedImage img = reader.getPageAsImage(i);
// recognizes both characters and barcodes
String text = new OCR().recognizeAll(image);
System.out.println("Page " + i + ": " + text);
}
reader.close(); // finally, close the file.
For more details on Asprise PDF library, please read Developer's Guide or view the Javadoc.
For more deitals on Asprise OCR engine, please visit this page.
< < Go back to product page
|