Kuaishou Technology Unveils Kling AI, Empowering Users To Create Videos Via Text-to-Video Generation
In Brief
Kuaishou Technology unveiled Kling AI, a tool for generating realistic videos from text prompts, positioning it as a rival to OpenAI’s Sora.
Chinese technology company Kuaishou Technology, known for its content community and social platform, unveiled Kling AI, a new tool designed to generate highly realistic videos from text prompts, representing a potential competitor to OpenAI’s Sora model.
Sora is an AI model designed to generate lifelike and creative scenes based on textual input. Central to Sora’s video generation capabilities is a deep learning framework. The model learns associations between words and phrases with various elements such as objects, actions, environments, and stylistic nuances.
While OpenAI’s Sora model is capable of generating videos up to one minute in length, the Kling AI can produce realistic videos reaching up to two minutes with 1080p resolution at 30 frames per second. It supports multiple aspect ratios and achieves a high level of detail through advanced 3D face and body reconstruction techniques, ensuring that its AI-generated videos are virtually indistinguishable from real footage.
Kling AI utilizes a 3D Variational Autoencoder (VAE) to reconstruct faces and bodies, enabling the generation of detailed expressions and limb movements from a single full-body image. This technology is enhanced by a 3D spatiotemporal joint attention mechanism, enhancing the model’s ability to handle complex scenes and movements. This ensures that the generated content maintains consistency with the laws of physics.
The application adopts an open-access approach, although with regional limitations in place. The model is currently functioning in its trial phase.
Kuaishou Technology Unveils KwaiYii LLM And Kolors Text-to-Image Model
Kuaishou Technology is among several Chinese technology companies competing to introduce alternatives to Sora. The text-to-video capability represents its most recent AI innovation, following the introduction of its KwaiYii large language model (LLM) and the text-to-image model Kolors, which debuted in May.
Additionally, the “AI Dancer” feature, accessible within the Kuaishou Technology application and the video production application Kwaiying, enables users to upload a static photo and generate videos featuring the character dancing to specified rhythms and movements.
Disclaimer
In line with the Trust Project guidelines, please note that the information provided on this page is not intended to be and should not be interpreted as legal, tax, investment, financial, or any other form of advice. It is important to only invest what you can afford to lose and to seek independent financial advice if you have any doubts. For further information, we suggest referring to the terms and conditions as well as the help and support pages provided by the issuer or advertiser. MetaversePost is committed to accurate, unbiased reporting, but market conditions are subject to change without notice.
About The Author
Alisa, a dedicated journalist at the MPost, specializes in cryptocurrency, zero-knowledge proofs, investments, and the expansive realm of Web3. With a keen eye for emerging trends and technologies, she delivers comprehensive coverage to inform and engage readers in the ever-evolving landscape of digital finance.
More articlesAlisa, a dedicated journalist at the MPost, specializes in cryptocurrency, zero-knowledge proofs, investments, and the expansive realm of Web3. With a keen eye for emerging trends and technologies, she delivers comprehensive coverage to inform and engage readers in the ever-evolving landscape of digital finance.