Python3 package for Chinese/English OCR, with paddleocr-v4 o
Extract text from images using OCR
Extract text from images using OCR
Extract text from images
Give it a pdf and it'll extract the text
Convert images to text using OCR
Extract and search text from images
Extract text from images using OCR
Upload an image to extract text
Extract Japanese text from images
Demo of GOT-OCR 2.0's Transformers implementation
Extract text from images
Extract text from documents
Imgocr is a Python3 package designed for Chinese/English OCR (Optical Character Recognition). It utilizes the powerful paddleocr-v4 engine to extract text from images with high accuracy. This tool is ideal for developers and users looking to automate text extraction from scanned documents, images, or other visual sources.
• Multi-language Support: Process both Chinese and English texts with ease.
• High Accuracy: Leveraging paddleocr-v4 ensures precise text recognition.
• PDF Compatibility: Extract text from PDF files in addition to standard images.
• Paragraph Detection: Automatically detect and retain the structure of paragraphs.
• Integration Ready: Easily integrate into your existing workflows or applications.
pip install imgocr
in your terminal to install.from imgocr import Imgocr
in your Python script.Imgocr(r"image_path.jpg")
.ocr()
method to process the image and retrieve text.get_text()
method.What is the accuracy of Imgocr?
Imgocr uses paddleocr-v4, which offers high accuracy for both Chinese and English text recognition, making it suitable for professional use.
Can Imgocr process PDF files?
Yes, Imgocr supports PDF files in addition to standard image formats like JPG, PNG, and BMP.
How do I install Imgocr?
Install Imgocr using pip: pip install imgocr
. Ensure you have Python 3 installed on your system.