A.I. – Page 4 – pIXELsHAM

Google Stitch – Transform ideas into UI designs for mobile and web applications

pIXELsHAM.com

Jun 5, 2025

https://stitch.withgoogle.com/

Stitch is available for free of charge with certain usage limits. Each user receives a monthly allowance of 350 generations using Flash mode and 50 generations using Experimental mode. Please note that these limits are subject to change.

Watch this video on YouTube

A.I., production

weavy.ai – Turn your creative vision into scalable workflows. Access all AI models and professional editing tools in one node based platform

pIXELsHAM.com

Jun 5, 2025

https://www.weavy.ai/

A.I., ves

Runway Partners with AMC Networks Across Marketing and TV Development

pIXELsHAM.com

Jun 4, 2025

https://runwayml.com/news/runway-amc-partnership

Runway and AMC Networks, the international entertainment company known for popular and award-winning titles including MAD MEN, BREAKING BAD, BETTER CALL SAUL, THE WALKING DEAD and ANNE RICE’S INTERVIEW WITH THE VAMPIRE, are partnering to incorporate Runway’s AI models and tools in AMC Networks’ marketing and TV development processes.

A.I.

LumaLabs.ai – Introducing Modify Video

pIXELsHAM.com

Jun 4, 2025

https://lumalabs.ai/blog/news/introducing-modify-video

Reimagine any video. Shoot it in post with director-grade control over style, character, and setting. Restyle expressive actions and performances, swap entire worlds, or redesign the frame to your vision.
Shoot once. Shape infinitely.

Watch this video on YouTube

A.I., production

How to Build & Sell AI Agents – Ultimate Beginner’s Guide

pIXELsHAM.com

Jun 2, 2025

A.I., Featured, production

N8N.io – From Zero to Your First AI Agent in 25 Minutes

pIXELsHAM.com

Jun 2, 2025

https://n8n.io

https://github.com/n8n-io/self-hosted-ai-starter-kit

Watch this video on YouTube

Watch this video on YouTube

A.I.

Transformer Explainer -Interactive Learning of Text-Generative Models

pIXELsHAM.com

Jun 2, 2025

https://github.com/poloclub/transformer-explainer

Transformer Explainer is an interactive visualization tool designed to help anyone learn how Transformer-based models like GPT work. It runs a live GPT-2 model right in your browser, allowing you to experiment with your own text and observe in real time how internal components and operations of the Transformer work together to predict the next tokens. Try Transformer Explainer at http://poloclub.github.io/transformer-explainer

Watch this video on YouTube

A.I., VR

Henry Daubrez – How to generate VR/ 360 videos directly with Google VEO

pIXELsHAM.com

May 30, 2025

https://www.linkedin.com/posts/upskydown_vr-googleveo-veo3-activity-7334269406396461059-d8Da

If you prompt for a 360° video in VEO (like literally write “360°” ) it can generate a Monoscopic 360 video, then the next step is to inject the right metadata in your file so you can play it as an actual 360 video.
Once it’s saved with the right Metadata, it will be recognized as an actual 360/VR video, meaning you can just play it in VLC and drag your mouse to look around.

Spatial Media Metadata Injector – for 360 videos

A.I., Featured, production

Black Forest Labs released FLUX.1 Kontext

pIXELsHAM.com

May 29, 2025

https://replicate.com/blog/flux-kontext

https://replicate.com/black-forest-labs/flux-kontext-pro

There are three models, two are available now, and a third open-weight version is coming soon:

FLUX.1 Kontext [pro]: State-of-the-art performance for image editing. High-quality outputs, great prompt following, and consistent results.
FLUX.1 Kontext [max]: A premium model that brings maximum performance, improved prompt adherence, and high-quality typography generation without compromise on speed.
Coming soon: FLUX.1 Kontext [dev]: An open-weight, guidance-distilled version of Kontext.

We’re so excited with what Kontext can do, we’ve created a collection of models on Replicate to give you ideas:

Multi-image kontext: Combine two images into one.
Portrait series: Generate a series of portraits from a single image
Change haircut: Change a person’s hair style and color
Iconic locations: Put yourself in front of famous landmarks
Professional headshot: Generate a professional headshot from any image

A.I.

AI Models – A walkthrough by Andreas Horn

pIXELsHAM.com

May 28, 2025

the 8 most important model types and what they’re actually built to do: ⬇️

1. 𝗟𝗟𝗠 – 𝗟𝗮𝗿𝗴𝗲 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗠𝗼𝗱𝗲𝗹
→ Your ChatGPT-style model.
Handles text, predicts the next token, and powers 90% of GenAI hype.
🛠 Use case: content, code, convos.

2. 𝗟𝗖𝗠 – 𝗟𝗮𝘁𝗲𝗻𝘁 𝗖𝗼𝗻𝘀𝗶𝘀𝘁𝗲𝗻𝗰𝘆 𝗠𝗼𝗱𝗲𝗹
→ Lightweight, diffusion-style models.
Fast, quantized, and efficient — perfect for real-time or edge deployment.
🛠 Use case: image generation, optimized inference.

3. 𝗟𝗔𝗠 – 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗔𝗰𝘁𝗶𝗼𝗻 𝗠𝗼𝗱𝗲𝗹
→ Where LLM meets planning.
Adds memory, task breakdown, and intent recognition.
🛠 Use case: AI agents, tool use, step-by-step execution.

4. 𝗠𝗼𝗘 – 𝗠𝗶𝘅𝘁𝘂𝗿𝗲 𝗼𝗳 𝗘𝘅𝗽𝗲𝗿𝘁𝘀
→ One model, many minds.
Routes input to the right “expert” model slice — dynamic, scalable, efficient.
🛠 Use case: high-performance model serving at low compute cost.

5. 𝗩𝗟𝗠 – 𝗩𝗶𝘀𝗶𝗼𝗻 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗠𝗼𝗱𝗲𝗹
→ Multimodal beast.
Combines image + text understanding via shared embeddings.
🛠 Use case: Gemini, GPT-4o, search, robotics, assistive tech.

6. 𝗦𝗟𝗠 – 𝗦𝗺𝗮𝗹𝗹 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗠𝗼𝗱𝗲𝗹
→ Tiny but mighty.
Designed for edge use, fast inference, low latency, efficient memory.
🛠 Use case: on-device AI, chatbots, privacy-first GenAI.

7. 𝗠𝗟𝗠 – 𝗠𝗮𝘀𝗸𝗲𝗱 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗠𝗼𝗱𝗲𝗹
→ The OG foundation model.
Predicts masked tokens using bidirectional context.
🛠 Use case: search, classification, embeddings, pretraining.

8. 𝗦𝗔𝗠 – 𝗦𝗲𝗴𝗺𝗲𝗻𝘁 𝗔𝗻𝘆𝘁𝗵𝗶𝗻𝗴 𝗠𝗼𝗱𝗲𝗹
→ Vision model for pixel-level understanding.
Highlights, segments, and understands *everything* in an image.
🛠 Use case: medical imaging, AR, robotics, visual agents.

A.I., photogrammetry

Spaitial.ai – Spatial Foundation Models

pIXELsHAM.com

May 28, 2025

https://www.spaitial.ai/

Watch this video on YouTube

A.I.

Introducting ComfyUI Native API Nodes

pIXELsHAM.com

May 22, 2025

https://blog.comfy.org/p/comfyui-native-api-nodes

Models Supported

Black Forest Labs Flux 1.1[pro] Ultra, Flux .1[pro]
Kling 2.0, 1.6, 1.5 & Various Effects
Luma Photon, Ray2, Ray1.6
MiniMax Text-to-Video, Image-to-Video
PixVerse V4 & Effects
Recraft V3, V2 & Various Tools
Stability AI Stable Image Ultra, Stable Diffusion 3.5 Large
Google Veo2
Ideogram V3, V2, V1
OpenAI GPT4o image
Pika 2.2

A.I., production

ComfyUI-CoCoTools_IO – A set of nodes focused on advanced image I/O operations, particularly for EXR file handling

pIXELsHAM.com

May 21, 2025

https://github.com/Conor-Collins/ComfyUI-CoCoTools_IO

Features

Advanced EXR image input with multilayer support
EXR layer extraction and manipulation
High-quality image saving with format-specific options
Standard image format loading with bit depth awareness

Current Nodes

Image I/O

Image Loader: Load standard image formats (PNG, JPG, WebP, etc.) with proper bit depth handling
Load EXR: Comprehensive EXR file loading with support for multiple layers, channels, and cryptomatte data
Load EXR Layer by Name: Extract specific layers from EXR files (similar to Nuke’s Shuffle node)
Cryptomatte Layer: Specialized handling for cryptomatte layers in EXR files
Image Saver: Save images in various formats with format-specific options (bit depth, compression, etc.)

Image Processing

Colorspace: Convert between sRGB and Linear colorspaces
Z Normalize: Normalize depth maps and other single-channel data

A.I.

Google AI – Meet Flow, The AI-powered Filmmaking with Veo 3

pIXELsHAM.com

May 21, 2025

https://blog.google/technology/ai/google-flow-veo-ai-filmmaking-tool/

Watch this video on YouTube

Watch this video on YouTube

Watch this video on YouTube

Watch this video on YouTube

Watch this video on YouTube

Watch this video on YouTube

A.I., blender

NVidia – 3D Guided Generative AI restyling in Blender

pIXELsHAM.com

May 21, 2025

https://build.nvidia.com/nvidia/genai-3d-guided

https://github.com/NVIDIA-AI-Blueprints/3d-guided-genai-rtx

A.I., production

Tim Riopelle – Restyling basic models to control genAI output

pIXELsHAM.com

May 21, 2025

https://www.linkedin.com/feed/update/urn:li:activity:7310425559509921792/

A.I.

OpenAI GPT-4.1 Prompting Guide

pIXELsHAM.com

May 20, 2025

OpenAI-GPT-4.1PromptingGuide Download

A.I., animation

Cartwheel – Jeffrey Katzenberg-Backed AI Animation Software Raises $10 Million

pIXELsHAM.com

May 20, 2025

https://www.cartoonbrew.com/tech/jeffrey-katzenberg-backed-ai-animation-software-cartwheel-raises-10-million-247264.html

https://getcartwheel.com/home

A.I., production

Version Zero AI has a splines output solution for AI/ML rotoscoping

pIXELsHAM.com

May 20, 2025

https://beforesandafters.com/2025/05/20/version-zero-ai-has-a-splines-output-solution-for-ai-ml-rotoscoping/

https://www.vzerostudios.com/

Watch this video on YouTube

Watch this video on YouTube

A.I., commercials

Claudio Tosti – La vita pittoresca dell’abate Uggeri

pIXELsHAM.com

May 18, 2025

https://vivariumnovum.it/saggistica/varia/la-vita-pittoresca-dellabate-uggeri

Book author: Claudio Tosti
Title: La vita pittoresca dell’abate Uggeri – Vol. I – La Giornata Tuscolana

ISBN: 978-8895611990

Video made with Pixverse.ai and DaVinci Resolve

A.I., production

DiffusionToolkit – An image metadata-indexer and viewer for AI-generated images

pIXELsHAM.com

May 13, 2025

https://github.com/RupertAvery/DiffusionToolkit

It aims to help you organize, search and sort your ever-growing collection.

https://github.com/RupertAvery/DiffusionToolkit/blob/master/Diffusion.Toolkit/Tips.md

A.I.

Mape – ComfyUI Helpers

pIXELsHAM.com

May 13, 2025

https://comfyui.ma.pe/

Multi-monitor image preview
Variable Assigment/Wireless Nodes
Prompt Tweaking
Command Palette (Shift+P)
Pinned favourite nodes
Fuzzy search
Auto organize nodes
Error management
Node navigation
Node time tracking
Hidden nodes connections

A.I., modeling

Bevelify.com – High Quality text-to-3D image-to-3D Models In Minutes

pIXELsHAM.com

May 13, 2025

https://bevelify.com/

3Dprinting, A.I., blender, Featured, modeling, photogrammetry, production, software

Convert 2D Images or Text to 3D Models

pIXELsHAM.com

May 13, 2025

HyperHuman Rodin – free and pay per use
LumaLabs Genie – free, web based
Vectary – free and monthly fees, web based
Selva3D – pay per use
3D-Tool – one off payment
Insight3d – free and open-source
ItsLitho – free, web based
Blender – free
SculptGL – free, web based
Embossify – pay per use, web based
Smoothie-3D – free, web based
ZW3D – one off payment
ImageToSTL – free, web based
Alpha3D – First 50 AI-generated 3D assets free
Reliefmod – free, web based
3D Builder – free
Cube by CSM.ai – web based
Kaedim3D – monthly and pay per use fees
3DForPrint – free, web based
ZoeDepth – free, web based
ZoeDepth Colab notebook – easy to use interface for the depth estimation model “ZoeDepth”
TilingZoeDepth – higher resolution free web based
DreamGaussian – free, web based
Photoshop Neural Filters – monthly fees
DepthR – pay per use
Materialize – open source
VistaSculpt – monthly fees
STL2PNG Converter – free
Hi3D – free
Meshy.ai – Text to 3D – free and pay per use
Shapen – Free daily generation
CGDream – 3000 free monthly credits or monthly fees
Tripo3D – (aka TripoAi) 600 free monthly credits or monthly fees
DeepAI: 3D Models – free
BambuLab MakerLabs Image-to-3D – subscription based
Hunyuan 3D – free
3dAi Studio Prims – credits based
Bevelify – free and subscriptions

https://www.news.viverse.com/post/pixel-to-polygon-converting-2d-images-to-3d-models-top-tools-revealed

19 Tools to Instantly Convert 2D Images to 3D Ones | 2025 Edition

https://www.linkedin.com/posts/andrew-price-%F0%9F%93%8Dgdc-17678911_testing-whether-3d-artists-should-be-worried-activity-7307446107289047040-fedm

Watch this video on YouTube

(more…)

A.I., jokes, trailers

Darri Thorsteinsson – America’s Funniest AI Home Videos

pIXELsHAM.com

May 12, 2025

A.I., production

What is an AI Agent + OpenAI Practical Guide to Building AI Agents

pIXELsHAM.com

May 10, 2025

Watch this video on YouTube

If you’re serious about AI Agents, this is the guide you’ve been waiting for. It’s packed with everything you need to build powerful AI agents. It follows a very hands-on approach that cuts down your time and avoids the common mistakes most developers make.

Andreas Horn on AI Agents vs Agentic AI

1. 𝗔𝗜 𝗔𝗴𝗲𝗻𝘁𝘀: 𝗧𝗼𝗼𝗹𝘀 𝘄𝗶𝘁𝗵 𝗔𝘂𝘁𝗼𝗻𝗼𝗺𝘆, 𝗪𝗶𝘁𝗵𝗶𝗻 𝗟𝗶𝗺𝗶𝘁𝘀
➜ AI agents are modular, goal-directed systems that operate within clearly defined boundaries. They’re built to:
* Use tools (APIs, browsers, databases)
* Execute specific, task-oriented workflows
* React to prompts or real-time inputs
* Plan short sequences and return actionable outputs

𝘛𝘩𝘦𝘺’𝘳𝘦 𝘦𝘹𝘤𝘦𝘭𝘭𝘦𝘯𝘵 𝘧𝘰𝘳 𝘵𝘢𝘳𝘨𝘦𝘵𝘦𝘥 𝘢𝘶𝘵𝘰𝘮𝘢𝘵𝘪𝘰𝘯, 𝘭𝘪𝘬𝘦: 𝘊𝘶𝘴𝘵𝘰𝘮𝘦𝘳 𝘴𝘶𝘱𝘱𝘰𝘳𝘵 𝘣𝘰𝘵𝘴, 𝘐𝘯𝘵𝘦𝘳𝘯𝘢𝘭 𝘬𝘯𝘰𝘸𝘭𝘦𝘥𝘨𝘦 𝘴𝘦𝘢𝘳𝘤𝘩, 𝘌𝘮𝘢𝘪𝘭 𝘵𝘳𝘪𝘢𝘨𝘦, 𝘔𝘦𝘦𝘵𝘪𝘯𝘨 𝘴𝘤𝘩𝘦𝘥𝘶𝘭𝘪𝘯𝘨, 𝘊𝘰𝘥𝘦 𝘴𝘶𝘨𝘨𝘦𝘴𝘵𝘪𝘰𝘯𝘴

But even the most advanced are limited by scope. They don’t initiate. They don’t collaborate. They execute what we ask!

2. 𝗔𝗴𝗲𝗻𝘁𝗶𝗰 𝗔𝗜: 𝗔 𝗦𝘆𝘀𝘁𝗲𝗺 𝗼𝗳 𝗦𝘆𝘀𝘁𝗲𝗺𝘀
➜ Agentic AI is an architectural leap. It’s not just one smarter agent — it’s multiple specialized agents working together toward shared goals. These systems exhibit:
* Multi-agent collaboration
* Goal decomposition and role assignment
* Inter-agent communication via memory or messaging
* Persistent context across time and tasks
* Recursive planning and error recovery
* Distributed orchestration and adaptive feedback

Agentic AI systems don’t just follow instructions. They coordinate. They adapt. They manage complexity.

𝘌𝘹𝘢𝘮𝘱𝘭𝘦𝘴 𝘪𝘯𝘤𝘭𝘶𝘥𝘦: 𝘳𝘦𝘴𝘦𝘢𝘳𝘤𝘩 𝘵𝘦𝘢𝘮𝘴 𝘱𝘰𝘸𝘦𝘳𝘦𝘥 𝘣𝘺 𝘢𝘨𝘦𝘯𝘵𝘴, 𝘴𝘮𝘢𝘳𝘵 𝘩𝘰𝘮𝘦 𝘦𝘤𝘰𝘴𝘺𝘴𝘵𝘦𝘮𝘴 𝘰𝘱𝘵𝘪𝘮𝘪𝘻𝘪𝘯𝘨 𝘦𝘯𝘦𝘳𝘨𝘺/𝘴𝘦𝘤𝘶𝘳𝘪𝘵𝘺, 𝘴𝘸𝘢𝘳𝘮𝘴 𝘰𝘧 𝘳𝘰𝘣𝘰𝘵𝘴 𝘪𝘯 𝘭𝘰𝘨𝘪𝘴𝘵𝘪𝘤𝘴 𝘰𝘳 𝘢𝘨𝘳𝘪𝘤𝘶𝘭𝘵𝘶𝘳𝘦 𝘮𝘢𝘯𝘢𝘨𝘪𝘯𝘨 𝘳𝘦𝘢𝘭-𝘵𝘪𝘮𝘦 𝘶𝘯𝘤𝘦𝘳𝘵𝘢𝘪𝘯𝘵𝘺

𝗧𝗵𝗲 𝗖𝗼𝗿𝗲 𝗗𝗶𝗳𝗳𝗲𝗿𝗲𝗻𝗰𝗲?
AI Agents = autonomous tools for single-task execution
Agentic AI = orchestrated ecosystems for workflow-level intelligence

Next, here 𝗮𝗿𝗲 𝘁𝗵𝗲 𝘁𝗼𝗽 10 𝗞𝗲𝘆 𝗧𝗮𝗸𝗲𝗮𝘄𝗮𝘆𝘀 𝗳𝗿𝗼𝗺 𝗢𝗽𝗲𝗻𝗔𝗜’𝘀 𝗚𝘂𝗶𝗱𝗲:

(more…)

A.I., quotes, ves

Jeffrey Ian Wilson – The Hidden Risks of Using ChatGPT and Anonymous AI Tools in non-secured Confidential Workflows Outside Proper Production Pipelines

pIXELsHAM.com

May 9, 2025

https://www.linkedin.com/pulse/hidden-risks-using-chatgpt-anonymous-ai-tools-workflows-wilson-govcc

What You Can Do Today

If you’re serious about protecting your IP, client relationships, and professional credibility, you need to stop treating generative AI tools like consumer-grade apps. This isn’t about fear, it’s about operational discipline. Below are immediate steps you can take to reduce your exposure and stay in control of your creative pipeline.

Use ChatGPT via the API, not the public app, for any sensitive data.
Isolate ComfyUI to a sandboxed VM, Docker container, or offline machine.
Audit every custom node, don’t blindly trust GitHub links or ComfyUI workflows
Educate your team, a single mistake can leak an unreleased game asset, a feature film script, or trade secrets.
Open source does not mean secure.

A.I., commercials

The Rhythm Of Life – Vodafone full AI commercial

pIXELsHAM.com

May 6, 2025

A.I.

USTC Ev-DeblurVSR – Event-Enhanced Blurry Video Super-Resolution

pIXELsHAM.com

May 5, 2025

https://github.com/DachunKai/Ev-DeblurVSR

https://arxiv.org/pdf/2504.13042

A.I., ves

Disney Hacker Admits Using Malware-Laced AI Art App to Achieve Breach

pIXELsHAM.com

May 5, 2025

https://cyberinsider.com/disney-hacker-admits-using-malware-laced-ai-art-app-to-achieve-breach/

A 25-year-old Santa Clarita man has agreed to plead guilty to hacking a Disney employee’s personal computer, stealing login credentials, and exfiltrating 1.1 terabytes of confidential data from internal Slack channels.

The charges stem from a targeted cyberattack carried out in the spring and summer of 2024 that compromised Disney’s internal communications and led to the public leak of sensitive corporate data.

“Kramer, operating under the alias “NullBulge,” created and distributed a malicious program disguised as an AI art generation tool. He uploaded this trojanized application to GitHub and other public repositories in early 2024, enticing users interested in generative AI. At least three victims, including one Disney employee, downloaded the program. Once executed, the software provided Kramer with remote access to the victims’ machines and stored credentials.”

After infiltrating the employee’s personal system, Kramer accessed corporate Slack credentials to infiltrate Disney’s internal Slack workspace and downloaded around 1.1 terabytes of data from nearly 10,000 channels including unreleased media projects, internal code, links to APIs, and credentials for internal web services.