Extract information from Indonesian receipts
Submit your Hugging Face username to check certification progress
Chat with PDFs using OpenAI GPT
Extract bibliographic data from PDFs
Extract structured data from documents using images
Display a welcome message on a web page
Find answers in documents
Edit a README.md file for an organization card
All paper summaries read by Merve
Create a custom PDF CV from Markdown and image
Generate a detailed report on your dataset
Upload documents and ask questions
Display blog posts with summaries
Donut Base Finetuned Cord V2 is a specialized AI model designed for document analysis, particularly focused on extracting information from Indonesian receipts. It is optimized to accurately identify and retrieve key details such as dates, amounts, items, and other relevant data from receipt documents. This model is a fine-tuned version of the Donut Base model, tailored for specific use cases in Indonesian languages and contexts.
• High Accuracy for Indonesian Receipts: Specifically trained to handle receipts in Indonesian language and format. • Comprehensive Data Extraction: Capable of identifying and extracting dates, totals, items, and other relevant fields from receipts. • Efficient Processing: Optimized for quick and accurate document analysis. • General Document Understanding: While specialized for receipts, it can handle other document types to some extent. • Integration-Friendly: Designed to be easily integrated into workflows or applications requiring receipt data extraction.
CORD
library or similar tools.Example code snippet (pseudo-code):
from cord import DonutBaseFinetunedCordV2
model = DonutBaseFinetunedCordV2()
receipt_image = "path/to/indonesian_receipt.jpg"
result = model.process(receipt_image)
extracted_data = result.extract()
1. What types of receipts does Donut Base Finetuned Cord V2 support?
Donut Base Finetuned Cord V2 is primarily designed for Indonesian receipts, including retail, food, and service receipts. It may work with other types of receipts to some extent, but accuracy is highest with Indonesian formats.
2. Can Donut Base Finetuned Cord V2 handle non-Indonesian receipts?
While the model is optimized for Indonesian receipts, it may still process receipts in other languages or formats, but accuracy will vary. For non-Indonesian receipts, consider using a more general-purpose document analysis model.
3. What formats does Donut Base Finetuned Cord V2 support?
The model supports images of receipts (e.g., PNG, JPG) and potentially PDF formats. Text-based input may also be supported, depending on the implementation.
4. Is Donut Base Finetuned Cord V2 useful for documents other than receipts?
While primarily designed for receipts, the model can be applied to other structured documents with similar layouts. However, its performance may not be as robust as models specifically trained for those documents.