Convert text into speech in multiple languages
3D novel view synthesis from any number images!
Swap Single Face
Generate music tracks with instrument settings
Identify objects in images using text queries
Follow visual instructions in Chinese
Select coordinates on an image based on instructions
Generate speech from text with customizable options
Generate 3D models from text descriptions
FLUX.1-Schnell on serverless inference, no GPU required
Interact with video using OpenAI's Vision API
Generate audio by cloning a voice
Extract text from images
Analyze documents to extract and structure text
Convert anime images to sketches
Summarize and classify long texts
Separate vocals from background in audio
Extract text from images
https://huggingface.co/spaces/VIDraft/mouse-webgen
Launch a web interface for text generation