Zhixiang Future has released HiDream-I1, a 17B parameter open source image generation model, with excellent image quality and first-class prompt word compliance capabilities.
MagicColor launched by the Hong Kong University of Science and Technology uses diffusion models and self-supervised training to achieve efficient multi-instance line-brush coloring, which is suitable for animation production, digital art, game development
Chinese company DeepSeek launches the high-performance open source AI model R1, which enables powerful inference capabilities at a cost far lower than OpenAI.
Sync Labs launches Lipsync-2, the world's first mouth-synced AI model that does not require training, suitable for live-action, animation and AI content generation.
Amazon AI video model Nova Reel has been upgraded to 1.1, supporting up to two minutes of multi-lens video generation, and a new "manual lens control" mode is added.
Google AI mode has added image questioning function, and users can search for related information by photos to enjoy a smarter multimodal search experience.
Google Gemini AI will soon support video format file analysis such as MP4, AVI, 3GP, etc., bringing users more powerful and smarter multimedia processing capabilities.