Florence 2
Analyze images to generate captions, detect objects, or perform OCR
Image Face Upscale Restoration-GFPGAN
Enhance and upscale images with face restoration
Florence2 + SAM2
Segment objects in images and videos using text prompts
Marigold Depth Estimation
Generate depth maps from images
DeepDanbooru
Tag images with labels
FitDiT
FitDiT is a high-fidelity virtual try-on model.
OmniParser demo
Convert images of screens to structured elements
ShowUI
Generate clickable coordinates on a screenshot
Better Florence 2
Interact with Florence-2 to analyze images and generate descriptions
MatchAnything
Find similar images from a collection
Marigold-LCM Depth Estimation (Deprecated)
Generate 3D depth maps from images and videos
Dpt Depth Estimation
Generate depth map from images
OCR Image To Text
Extract text from images using OCR
Image Matching Webui
Find similar images by uploading a photo
Sapiens Segmentation
Segment body parts in images
Llava Llama-3 8B
Meta Llama3 8b with Llava Multimodal capabilities
Chicago Gallery
Art Institute of Chicago Gallery
Clip Demo
Find images matching a text query
StableNormal
Compute normals for images and videos
Background Removal Arena
Vote on background-removed images to rank models