Show-o
Show-o is a transformative multi-modal transformer model for image captioning, visual question answering, and text-to-image generation, enhancing AI research and development.
Show-o model
multi-mode formation