OCR Tool for the 1853 Archive Site
Upload images for accurate English / Latin OCR
Multimodal retrieval using llamaindex/vdr-2b-multi-v1
Query PDF documents using natural language
Traditional OCR 1.0 on PDF/image files returning text/PDF
Extract text from images
Find similar sentences in your text using search queries
RAG with multiple types of loaders like text, pdf and web
Extract named entities from medical text
A token classification model identifies and labels specific
OCR that extract text from image of hindi and english
Spirit.AI
Search documents and retrieve relevant chunks
1853ArchiveOCR is an OCR (Optical Character Recognition) tool designed to extract text from scanned documents and images. It is specifically developed for use with the 1853 Archive Site, making it easier to access and work with historical or archived content.
What formats does 1853ArchiveOCR support?
1853ArchiveOCR supports JPEG, PNG, BMP, and TIFF formats for image uploads.
Can 1853ArchiveOCR handle old or blurry text?
Yes, 1853ArchiveOCR is designed to handle old or blurry text with high accuracy, making it suitable for historical documents.
Where is the extracted text stored?
The extracted text is not stored on the server. It is available for immediate use and can be copied or downloaded by the user.