Create an animated video from audio and a reference image
Convert spoken words to text
Improve images with text instructions
Fast image relighting using Latent Bridge Matching
Hunyuan-Large樑εδ½ιͺ
Generates a sound effect that matches video shot
Extend images to new sizes using prompts
More advanced and challenging multi-task evaluation
Interact with Florence-2 to analyze images and generate descriptions
Enhance image details and resolution
Generate subtitled videos from YouTube links
Fine-tuning large language model with Gradio UI
Analyze documents to extract text and visualize segmentation
3D generation from sketchs with TRELLIS & sdxl
Voice Clone Multilingual TTS
Compare model answers to questions
Efficient T2V generation
Generate audio from text with different voices
Analyze document layout from images
Generate chat responses with Qwen AI