– insightFace precompiled for python 3.x (pick and choose a version that matches your python installation version, you can check that by double clicking on python_embeded\python.exe and look at the top): https://github.com/Gourieff/Assets/tree/main/Insightface To install it use: .\python_embeded\python.exe -m pip install “G:\SOFTWARE\2024\CUSTOM_NODES\insightFace\insightface-0.7.3-cp312-cp312-win_amd64.whl” onnxruntime
Errors: ERROR: pip’s dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts. matrix-client 0.4.0 requires urllib3~=1.21, but you have urllib3 2.3.0 which is incompatible. mediapipe 0.10.20 requires numpy<2, but you have numpy 2.2.1 which is incompatible. mediapipe 0.10.20 requires protobuf<5,>=4.25.3, but you have protobuf 5.29.2 which is incompatible. torch 2.5.1+cu124 requires sympy==1.13.1; python_version >= “3.9”, but you have sympy 1.13.3 which is incompatible.
“There are many good reasons to be concerned about the rise of generative AI(…). Unfortunately, there are also many good reasons to be concerned about copyright’s growing prevalence in the policy discourse around AI’s regulation. Insisting that copyright protects an exclusive right to use materials for text and data mining practices (whether for informational analysis or machine learning to train generative AI models) is likely to do more harm than good. As many others have explained, imposing copyright constraints will certainly limit competition in the AI industry, creating cost-prohibitive barriers to quality data and ensuring that only the most powerful players have the means to build the best AI tools (provoking all of the usual monopoly concerns that accompany this kind of market reality but arguably on a greater scale than ever before). It will not, however, prevent the continued development and widespread use of generative AI.”
…
“(…) As Michal Shur-Ofry has explained, the technical traits of generative AI already mean that its outputs will tend towards the dominant, likely reflecting ‘a relatively narrow, mainstream view, prioritizing the popular and conventional over diverse contents and narratives.’ Perhaps, then, if the political goal is to push for equality, participation, and representation in the AI age, critics’ demands should focus not on exclusivity but inclusivity. If we want to encourage the development of ethical and responsible AI, maybe we should be asking what kind of material and training data must be included in the inputs and outputs of AI to advance that goal. Certainly, relying on copyright and the market to dictate what is in and what is out is unlikely to advance a public interest or equality-oriented agenda.”
…
“If copyright is not the solution, however, it might reasonably be asked: what is? The first step to answering that question—to producing a purposively sound prescription and evidence-based prognosis, is to correctly diagnose the problem. If, as I have argued, the problem is not that AI models are being trained on copyright works without their owners’ consent, then requiring copyright owners’ consent and/or compensation for the use of their work in AI-training datasets is not the appropriate solution. (…)If the only real copyright problem is that the outputs of generative AI may be substantially similar to specific human-authored and copyright-protected works, then copyright law as we know it already provides the solution.”
Deepfake technology is a type of artificial intelligence used to create convincing fake images, videos and audio recordings. The term describes both the technology and the resulting bogus content and is a portmanteau of deep learning and fake.
Deepfakes often transform existing source content where one person is swapped for another. They also create entirely original content where someone is represented doing or saying something they didn’t do or say.
Deepfakes aren’t edited or photoshopped videos or images. In fact, they’re created using specialized algorithms that blend existing and new footage. For example, subtle facial features of people in images are analyzed through machine learning (ML) to manipulate them within the context of other videos.
Deepfakes uses two algorithms — a generator and a discriminator — to create and refine fake content. The generator builds a training data set based on the desired output, creating the initial fake digital content, while the discriminator analyzes how realistic or fake the initial version of the content is. This process is repeated, enabling the generator to improve at creating realistic content and the discriminator to become more skilled at spotting flaws for the generator to correct.
The combination of the generator and discriminator algorithms creates a generative adversarial network. A GANuses deep learning to recognize patterns in real images and then uses those patterns to create the fakes. When creating a deepfake photograph, a GAN system views photographs of the target from an array of angles to capture all the details and perspectives. When creating a deepfake video, the GAN views the video from various angles and analyzes behavior, movement and speech patterns. This information is then run through the discriminator multiple times to fine-tune the realism of the final image or video.
You’re being tricked into believing that AI can produce Hollywood-level videos…
We’re far from it.
Yes, we’ve made huge progress.
A video sample like this, created using Kling 1.6, is light-years ahead of what was possible a year ago. But there’s still a significant limitation: visual continuity beyond 5 seconds.
Right now, no AI model can maintain consistency beyond a few seconds. That’s why most AI-generated concepts you’re seeing on your feed rely on 2–5 second cuts – it’s all the tech can handle before things start to fall apart.
This isn’t necessarily a problem for creating movie trailers or spec ads. Trailers, for instance, are designed for quick, attention-grabbing rapid cuts, and AI excels at this style of visual storytelling.
But, making a popular, full-length movie with nothing but 5-second shots? That’s absurd.
There are very few exceptions to this rule in modern cinema (e.g., the Bourne franchise).
To bridge the gap between trailers and full-length cinema, AI creative needs to reach 2 key milestones: – 5-12 sec average: ASL for slower, non-action scenes in contemporary films – think conversations, emotional moments, or establishing shots – 30+ sec sequences: Longer, uninterrupted takes are essential for genres that require immersion – drama, romance, thrillers, or any scene that builds tension or atmosphere
Mastering longer cuts is crucial.
30-second continuous shots require a higher level of craftsmanship and visual consistency – you need that 20-30 seconds of breathing room to piece together a variety of scenes and create a compelling movie.
So, where does AI creative stand now?
AI is already transforming industries like auto, fashion, and CPG. These brands can use AI today because short, 2–5 second cuts work perfectly in their visual language. Consumers are accustomed to it, and it simply works. This psychological dynamic is unlikely to change anytime soon.
But for AI to produce true cinema (not just flashy concepts) it needs to extend its visual consistency. And every GenAI company is racing to get there.
The timeline?
Next year, expect breakthroughs in AI-generated content holding consistency for 10+ seconds. By then, full-length commercials, shows, and movies (in that order) will start to feel more crafted, immersive, and intentional, not just stitched together.
If you’re following AI’s impact on creativity, this is the development to watch. The companies that solve continuity will redefine what’s possible in film.
– Collect: Data from sensors, logs, and user input. – Move/Store: Build infrastructure, pipelines, and reliable data flow. – Explore/Transform: Clean, prep, and detect anomalies to make the data usable. – Aggregate/Label: Add analytics, metrics, and labels to create training data. – Learn/Optimize: Experiment, test, and train AI models.
– Instrumentation and logging: Sensors, logs, and external data capture the raw inputs. – Data flow and storage: Pipelines and infrastructure ensure smooth movement and reliable storage. – Exploration and transformation: Data is cleaned, prepped, and anomalies are detected. – Aggregation and labeling: Analytics, metrics, and labels create structured, usable datasets. – Experimenting/AI/ML: Models are trained and optimized using the prepared data. – AI insights and actions: Advanced AI generates predictions, insights, and decisions at the top.
𝗪𝗵𝗼 𝗺𝗮𝗸𝗲𝘀 𝗶𝘁 𝗵𝗮𝗽𝗽𝗲𝗻 𝗮𝗻𝗱 𝗸𝗲𝘆 𝗿𝗼𝗹𝗲𝘀:
– Data Infrastructure Engineers: Build the foundation — collect, move, and store data. – Data Engineers: Prep and transform the data into usable formats. – Data Analysts & Scientists: Aggregate, label, and generate insights. – Machine Learning Engineers: Optimize and deploy AI models.
FLUX (or FLUX. 1) is a suite of text-to-image models from Black Forest Labs, a new company set up by some of the AI researchers behind innovations and models like VQGAN, Stable Diffusion, Latent Diffusion, and Adversarial Diffusion Distillation
DISCLAIMER– Links and images on this website may be protected by the respective owners’ copyright. All data submitted by users through this site shall be treated as freely available to share.