Vidu 1.5 Launch Marks New Emergence in Multimodal AI, to Introduce Groundbreaking Consistency Controls that Reshape the Future of AI Video Production

November 14, 2024 02:45 AM AEDT | By Cision
Follow us on Google News: https://kalkinemedia.com/resources/assets/public/images/google-news.webp

SINGAPORE, Nov. 13, 2024 /PRNewswire/ -- Shengshu Technology officially releases Vidu 1.5, marking the global debut of long-context capabilities similar to those in Large Language Models within its visual model, showcasing a new emergent capability. This is a major upgrade to its groundbreaking multimodal generative models. Vidu 1.5 introduces the world's first Multiple-Entity Consistency capability, which seamlessly integrates people, objects, and environments to create stunning video effects—something that couldn't be achieved, until now.

With Vidu's Multiple-Entity Consistency feature, images with no relation to each other, be it characters, objects and even environments, can be integrated into a single video featuring all three characteristics. Moreover, the resulting generated video from Vidu 1.5 is capable of ensuring visual consistency even with complex inputs that require the processing of multiple subjects or environments – and this has not been possible until today. For example, if Vidu 1.5's generated video features multiple entities, their unique attributes will remain distinct throughout the footage, instead of character traits that tend to blend or become inconsistent midway.

Multiple-Entity Consistency works by enabling users to upload multiple images—for instance, a photo of a person, an image of a rose detailed outfit, and also a shot of a moped. Vidu 1.5 will combine these images into a single continuous, fluid video featuring the person dressed in a rose-accented shirt, riding on a moped. Or by uploading two images of the same individuals alongside a photo of the pyramids, and using a prompt like "a man looking up at the pyramids," Vidu 1.5 would seamlessly combine these elements, delivering a cohesive video that aligns perfectly with your expectation.

Taking it a step further, Vidu 1.5 also introduces Multiple Angle Consistency, a feature that allows you to either generate videos using any inputted images as references or by uploading three photos of a single subject. The model ensures visual continuity and fills in missing details, providing a seamless 360-degree view of the subject. The result is a cohesive video that accurately presents the subject from any angle, enhancing realism and dynamic storytelling. And this also applies to facial movements, accounting for a more natural-feeling continuity between subtle facial expressions.

Beyond presenting stronger character emotions, Vidu 1.5 also introduces Advanced Control. This enables greater precision over camera movements, angles, and cinematic techniques, of course without detracting from the visual consistency that would otherwise yield unwanted transitions or skipping between frames. In fact, advanced cinematography styles including adjustable dynamic ranges offer a higher degree of control over speed and scale resulting in complex shots like zooming, panning and tilting or a combination of these.

With Vidu 1.5, we've significantly enhanced detail generation and clarity at up to 1080p, bringing to life visuals like never before. For instance, you can now clearly see the intricate patterns on a cake or the vivid texture of a sizzling steak amid flames and oozing red juices. This level of detail transforms the storytelling experience, making every frame richer and more immersive.

Vidu 1.5 expands its appeal to animators and anime creators with Expanded Animation Styles, including Japanese fantasy and hyper-realistic styles. This upgrade caters to both casual creators of short-form content and professional filmmakers, delivering polished, production-ready videos with superior clarity and precision compared to other generative AI platforms.

The breakthrough in Vidu 1.5's controls lie in major advancements Shengshu Technology has made in semantic understanding. The update offers more nuanced language comprehension, interpreting text prompts with impressive precision and enabling video outputs that better reflect complex storytelling and scene requirements. Last but not least, Vidu 1.5 generates a 4-second clip in just 25 seconds, showcasing Vidu's ongoing advancements with improving the speed and accuracy of generating video.

"The future of content creation is here, and it is powered by the limitless possibilities of AI," said Jiayu Tang, CEO and co-founder of Shengshu Technology. "Together, we can ignite a global wave of inspiration, reshaping industries, and democratizing creativity. At the core of this transformation lies the ability for anyone to engage in high-quality content production, unlocking new opportunities and breaking down traditional limitations."

To learn more about Vidu 1.5, please visit https://www.vidu.studio/

About Shengshu Technology

Founded in March 2023, Shengshu Technology is a leading innovator in deep generative models, specializing in advanced diffusion probabilistic techniques. In 2022, Shengshu introduced the world's first new technical framework, U-ViT, which explores the fusion of Diffusion Models and Transformer architectures for a wide assortment of multimodal generation tasks. Shengshu Technology made the leap from research to commercialization with the launch of its flagship product, Vidu, in July 2024. This AI video generator enables creators to bring their visions to life, whether for art design, game development, film post-production, or social content creation. Our mission is to develop the world's most advanced multimodal generative models, seamlessly integrating text, images, videos, and 3D content. We are at the forefront of applying generative AI to art, design, gaming, filmmaking, and social media, with the goal of enhancing human creativity and productivity.


Disclaimer

The content, including but not limited to any articles, news, quotes, information, data, text, reports, ratings, opinions, images, photos, graphics, graphs, charts, animations and video (Content) is a service of Kalkine Media Pty Ltd (“Kalkine Media, we or us”), ACN 629 651 672 and is available for personal and non-commercial use only. The principal purpose of the Content is to educate and inform. The Content does not contain or imply any recommendation or opinion intended to influence your financial decisions and must not be relied upon by you as such. Some of the Content on this website may be sponsored/non-sponsored, as applicable, but is NOT a solicitation or recommendation to buy, sell or hold the stocks of the company(s) or engage in any investment activity under discussion. Kalkine Media is neither licensed nor qualified to provide investment advice through this platform. Users should make their own enquiries about any investments and Kalkine Media strongly suggests the users to seek advice from a financial adviser, stockbroker or other professional (including taxation and legal advice), as necessary.
The content published on Kalkine Media also includes feeds sourced from third-party providers. Kalkine does not assert any ownership rights over the content provided by these third-party sources. The inclusion of such feeds on the Website is for informational purposes only. Kalkine does not guarantee the accuracy, completeness, or reliability of the content obtained from third-party feeds. Furthermore, Kalkine Media shall not be held liable for any errors, omissions, or inaccuracies in the content obtained from third-party feeds, nor for any damages or losses arising from the use of such content.
Kalkine Media hereby disclaims any and all the liabilities to any user for any direct, indirect, implied, punitive, special, incidental or other consequential damages arising from any use of the Content on this website, which is provided without warranties. The views expressed in the Content by the guests, if any, are their own and do not necessarily represent the views or opinions of Kalkine Media. Some of the images/music that may be used on this website are copyrighted to their respective owner(s). Kalkine Media does not claim ownership of any of the pictures displayed/music used on this website unless stated otherwise. The images/music that may be used on this website are taken from various sources on the internet, including paid subscriptions or are believed to be in public domain. We have made reasonable efforts to accredit the source wherever it was indicated as or found to be necessary.

This disclaimer is subject to change without notice. Users are advised to review this disclaimer periodically for any updates or modifications.

Two ASX Listed Stocks Giving Bullish Indications

Recent Articles

Investing Tips

Previous Next
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.