Extract text from document images
A demo app which retrives information from multiple PDF docu
Extract text from multilingual invoices
Search for similar text in documents
Extract text from images using OCR
AI powered Document Processing app
Convert images with text to searchable documents
Parse documents to extract structured information
Find relevant text chunks from documents based on queries
Extract information from documents by asking questions
A token classification model identifies and labels specific
GOT - OCR (from : UCAS, Beijing)
中文Late Chunking Gradio服务
Donut is an AI-powered tool designed to extract text from scanned documents. It enables users to convert document images into editable text, making it ideal for tasks like document scanning, data entry, and research. With Donut, you can seamlessly transition from physical or digital document images to usable text formats.
• Multi-language support: Extract text from documents in various languages.
• High accuracy: Advanced OCR technology ensures precise text extraction.
• Multiple file formats: Supports popular image formats like JPG, PNG, and PDF.
• Easy-to-use interface: User-friendly design for quick and efficient text extraction.
• Preserves layout: Maintains the original formatting and structure of the document.
What languages does Donut support?
Donut supports a wide range of languages, including English, Spanish, French, German, Chinese, and many more.
How accurate is the text extraction?
Donut uses advanced AI models to ensure high accuracy, but results may vary depending on image quality and complexity.
Can Donut handle handwritten text?
Donut is primarily designed for printed text. Handwritten text extraction may be less accurate and is not fully supported.