AIDock
VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance

The VQGAN-CLIP framework enables open-domain image generation and editing, leveraging pre-trained VQGAN for visual synthesis and CLIP for semantic guidance. This approach facilitates high-quality visual outputs and complex natural language-guided manipulations without requiring additional task-specific training, making advanced capabilities broadly accessible.

View blog
Resources
There are no more papers matching your filters at the moment.