pIXELsHAM – Page 10 – blog of links related to computer animation and production technology Sponsored by ReelMatters.com

General OCR Theory – Towards OCR-2.0 via a Unified End-to-end Model – HF Transformers implementation

pIXELsHAM.com

Feb 6, 2025

A.I.

https://huggingface.co/stepfun-ai/GOT-OCR-2.0-hf

GOT-OCR2 works on a wide range of tasks, including plain document OCR, scene text OCR, formatted document OCR, and even OCR for tables, charts, mathematical formulas, geometric shapes, molecular formulas and sheet music.

Views : 9
Can AI art be copyrighted? not if there’s no human input, US office says

pIXELsHAM.com

Feb 6, 2025

A.I., quotes, ves

https://www.designboom.com/art/can-ai-art-be-copyrighted-not-if-theres-no-human-input-us-office-01-31-2025

https://www.techspot.com/news/106562-us-copyright-office-rules-out-copyright-ai-created.html

Views : 10
Beej – Guide to Git

pIXELsHAM.com

Feb 6, 2025

production

https://beej.us/guide/bggit

bggit_a4_c_2 Download

Views : 4
QNTM – Developer Philosophy

pIXELsHAM.com

Feb 6, 2025

production, quotes
https://qntm.org/devphilo
- Avoid, at all costs, arriving at a scenario where the ground-up rewrite starts to look attractive
- Aim to be 90% done in 50% of the available time
- Automate good practice
- Think about pathological data
- There is usually a simpler way to write it
- Write code to be testable
- It is insufficient for code to be provably correct; it should be obviously, visibly, trivially correct
Views : 11
Arminas Valunas – “Coca-Cola: Wherever you are.”

pIXELsHAM.com

Feb 4, 2025

A.I., commercials, design

Arminas created this using Juggernaut Xl model and QR Code Monster SDXL ControlNet.

His pipeline:
Static Images – Forge UI.
Upscaled with Leonardo AI universal upscaler.
Animated with Runway ML and Minimax.
Video upscale – Topaz Video AI.
Composited in Adobe Premiere.

Juggernaut Xl download here:
https://civitai.com/models/133005/juggernaut-xl

QR Code Monster SDXL:
https://civitai.com/models/197247?modelVersionId=221829

https://www.linkedin.com/posts/arminas-valunas-b4477255_my-spec-ad-for-coca-cola-wherever-you-are-ugcPost-7292470579431956480-VSBz

Views : 23
Realistic Product Lighting In Blender

pIXELsHAM.com

Feb 4, 2025

lighting

Views :
10
The ONLY Geometry Nodes Tutorial You’ll Ever Need!

pIXELsHAM.com

Feb 4, 2025

blender

Views :
15
Google AI Studio -This Free AI Changes How You Learn Software Forever

pIXELsHAM.com

Feb 4, 2025

A.I.

Views :
12
OpenAI releases o3-mini

pIXELsHAM.com

Feb 3, 2025

A.I.

https://openai.com/index/openai-o3-mini

OpenAI o3-mini is our first small reasoning model that supports highly requested developer features including function calling⁠(opens in a new window), Structured Outputs⁠(opens in a new window), and developer messages⁠(opens in a new window), making it production-ready out of the gate.

o3-mini does not support vision capabilities, so developers should continue using OpenAI o1 for visual reasoning tasks.

ChatGPT Plus, Team, and Pro users can access OpenAI o3-mini starting today, with Enterprise access coming in February. o3-mini will replace OpenAI o1-mini in the model picker, offering higher rate limits and lower latency, making it a compelling choice for coding, STEM, and logical problem-solving tasks.

As part of this upgrade, we’re tripling the rate limit for Plus and Team users from 50 messages per day with o1-mini to 150 messages per day with o3-mini.

Starting today, free plan users can also try OpenAI o3-mini by selecting ‘Reason’ in the message composer or by regenerating a response. This marks the first time a reasoning model has been made available to free users in ChatGPT.

Views : 14
Run DeepSeek R1 Locally

pIXELsHAM.com

Feb 3, 2025

A.I.

DeepSeek Gets an ‘F’ in Safety From Researchers https://gizmodo.com/deepseek-gets-an-f-in-safety-from-researchers-2000558645

Views : 41
The BEST Way to model cars in Blender | Mclaren Senna GTR

pIXELsHAM.com

Feb 3, 2025

blender, modeling

Views :
8
ComfyUI Tutorial – How To Create Consistent Images Using Flux Model in ComfyUI

pIXELsHAM.com

Feb 1, 2025

A.I.

Views :
27
Netflix Eyeline-Research Go-with-the-Flow – An easy and efficient way to control the motion patterns of video diffusion models

pIXELsHAM.com

Feb 1, 2025

A.I.

https://github.com/Eyeline-Research/Go-with-the-Flow

https://huggingface.co/Eyeline-Research/Go-with-the-Flow/tree/main

https://eyeline-research.github.io/Go-with-the-Flow

Views : 14
Hashem Al-Ghaili – Historical Icons Brought Back to Life using AI

pIXELsHAM.com

Feb 1, 2025

A.I.

Views :
21
Heather Cooper – 9 Video Models Comparison: Text to video

pIXELsHAM.com

Feb 1, 2025

A.I.

https://www.linkedin.com/posts/heatherbcooper_video-model-comparison-text-to-video-activity-7290822319407550464-QzUY

🔹 Google DeepMind Veo 2
🔹 OpenAI Sora
🔹 Hunyuan Video
🔹 Pika 2.1
🔹 Alibaba Cloud Wanx 2.1
🔹 Runway Gen-3
🔹 Kling AI 1.6
🔹 Luma AI Ray2
🔹 Hailuo T2V-01

Uncompressed video under the post

(more…)
Views : 9
Learn How to Set Up a Cartoony Facial Rig in Blender With Geometry Nodes

pIXELsHAM.com

Feb 1, 2025

animation, blender

https://80.lv/articles/learn-how-to-set-up-a-cartoony-facial-rig-in-blender-with-geometry-nodes

https://edwardurena.gumroad.com/l/wztetf

Views : 10
BEST 2D ANIMATION RIGGING SOFTWARE

pIXELsHAM.com

Jan 29, 2025

animation

Views :
6
Blender Comic Style Tutorial

pIXELsHAM.com

Jan 29, 2025

blender

Views :
11
Parsec – Remote desktop fast access, 4k, 60 fps app

pIXELsHAM.com

Jan 28, 2025

production, software

https://parsec.app

Views : 11
DimensionX – Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

pIXELsHAM.com

Jan 28, 2025

A.I.

https://chenshuo20.github.io/DimensionX

https://github.com/wenqsun/DimensionX

https://huggingface.co/spaces/fffiloni/DimensionX

https://huggingface.co/wenqsun/DimensionX/tree/main

Views : 9
Brian Gallagher – Why Almost Everybody Is Wrong About DeepSeek vs. All the Other AI Companies

pIXELsHAM.com

Jan 28, 2025

A.I., ves

https://lemalogic.com/post/why-almost-everybody-is-wrong-about-deepseek-vs-all-the-other-ai-companies

Benchmarks don’t capture real-world complexity like latency, domain-specific tasks, or edge cases. Enterprises often need more than raw performance, also needing reliability, ease of integration, and robust vendor support. Enterprise money will support the industries providing these services.

… it is also reasonable to assume that anything you put into the app or their website will be going to the Chinese government as well, so factor that in as well.

Views : 13
ComfyUI-CogVideoXWrapper – Control motion paths in ComfyUI

pIXELsHAM.com

Jan 27, 2025

A.I.

https://github.com/kijai/ComfyUI-CogVideoXWrapper

Views : 10
One-Prompt-One-Story – Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

pIXELsHAM.com

Jan 27, 2025

A.I.

https://byliutao.github.io/1Prompt1Story.github.io

Tneration models can create high-quality images from input prompts. However, they struggle to support the consistent generation of identity-preserving requirements for storytelling.

Our approach 1Prompt1Story concatenates all prompts into a single input for T2I diffusion models, initially preserving character identities.

Views : 15
What did DeepSeek figure out about reasoning with DeepSeek-R1?

pIXELsHAM.com

Jan 27, 2025

A.I.

https://www.seangoedecke.com/deepseek-r1

The Chinese AI lab DeepSeek recently released their new reasoning model R1, which is supposedly (a) better than the current best reasoning models (OpenAI’s o1- series), and (b) was trained on a GPU cluster a fraction the size of any of the big western AI labs.

DeepSeek uses a reinforcement learning approach, not a fine-tuning approach. There’s no need to generate a huge body of chain-of-thought data ahead of time, and there’s no need to run an expensive answer-checking model. Instead, the model generates its own chains-of-thought as it goes.

https://medium.com/@ShankarsPayana/how-deepseek-r1-using-fp8-instead-of-fp32-beat-openai-meta-gemini-and-claude-c105d94d0c39

The secret behind their success? A bold move to train their models using FP8 (8-bit floating-point precision) instead of the standard FP32 (32-bit floating-point precision).
…
By using a clever system that applies high precision only when absolutely necessary, they achieved incredible efficiency without losing accuracy.
…
The impressive part? These multi-token predictions are about 85–90% accurate, meaning DeepSeek R1 can deliver high-quality answers at double the speed of its competitors.

https://www.tweaktown.com/news/102798/chinese-ai-firm-deepseek-has-50-000-nvidia-h100-gpus-says-ceo-even-with-us-restrictions/index.html

Chinese AI firm DeepSeek has 50,000 NVIDIA H100 AI GPUs

Views : 11
Raphael AI – World’s First Unlimited Free AI Image Generator powered by FLUX.1-Dev model

pIXELsHAM.com

Jan 26, 2025

A.I., software

https://raphael.app

Views : 40
Texture Copilot – AI Copilot for 3D Texturing

pIXELsHAM.com

Jan 26, 2025

A.I.

https://ncsoft.github.io/ncresearch/3f0ba4889e331ddbed68c9dd48d845fa18d874de

Views : 16
CaPa – Carve-n-Paint Synthesisfor Efficient 4K Textured Mesh Generation

pIXELsHAM.com

Jan 26, 2025

A.I., modeling

https://ncsoft.github.io/CaPa

https://github.com/ncsoft/CaPa

a novel method for generating hyper-quality 4K textured mesh under only 30 seconds, providing 3D assets ready for commercial applications such as games, movies, and VR/AR.

Views : 14
NVidia DynOMo – Online Point Tracking by Dynamic Online Monocular Gaussian Reconstruction

pIXELsHAM.com

Jan 26, 2025

photogrammetry

https://jennyseidenschwarz.github.io/DynOMo.github.io

https://github.com/dvl-tum/DynOMo

Views : 13
LumaLabs Ray2 – A large–scale video generative model

pIXELsHAM.com

Jan 26, 2025

A.I.

https://lumalabs.ai/ray

Views : 7
SurFhead – Affine Rig Blending for Geometrically Accurate 2D Gaussian Surfel-based Head Avatars

pIXELsHAM.com

Jan 26, 2025

photogrammetry

https://summertight.github.io/SurFhead

https://github.com/surfhead2025/surfhead

Views : 9
Spell.Spline – 2D-to-3D generate entire 3D scenes or “Worlds” from an image

pIXELsHAM.com

Jan 26, 2025

A.I.

https://blog.spline.design/introducing-spell

https://spell.spline.design/explore/featured

Views : 18
The Best AI Animation Tool in 2025? (Prompt Battle)

pIXELsHAM.com

Jan 26, 2025

A.I.

Views :
9
Kim Jung Gi – 2020.04.16 Live Drawing

pIXELsHAM.com

Jan 26, 2025

design

Views :
5
Node-it Shading – Teaser for Blender

pIXELsHAM.com

Jan 26, 2025

blender

Views :
14
Fal Video Studio – The first open-source AI toolkit for video editing

pIXELsHAM.com

Jan 25, 2025

A.I., software
https://github.com/fal-ai-community/video-starter-kit

https://fal-video-studio.vercel.app
- 🎬 Browser-Native Video Processing: Seamless video handling and composition in the browser
- 🤖 AI Model Integration: Direct access to state-of-the-art video models through fal.ai
  
  Minimax for video generation
  
  Hunyuan for visual synthesis
  
  LTX for video manipulation
- 🎵 Advanced Media Capabilities:
  
  Multi-clip video composition
  
  Audio track integration
  
  Voiceover support
  
  Extended video duration handling
- 🛠️ Developer Utilities:
  
  Metadata encoding
  
  Video processing pipeline
  
  Ready-to-use UI components
  
  TypeScript support
Views : 18
Tencent Hunyuan3D – an advanced large-scale 3D synthesis system for generating high-resolution textured 3D assets

pIXELsHAM.com

Jan 25, 2025

A.I.

https://github.com/tencent/Hunyuan3D-2

Hunyuan3D 2.0, an advanced large-scale 3D synthesis system for generating high-resolution textured 3D assets. This system includes two foundation components: a large-scale shape generation model – Hunyuan3D-DiT, and a large-scale texture synthesis model – Hunyuan3D-Paint.

The shape generative model, built on a scalable flow-based diffusion transformer, aims to create geometry that properly aligns with a given condition image, laying a solid foundation for downstream applications. The texture synthesis model, benefiting from strong geometric and diffusion priors, produces high-resolution and vibrant texture maps for either generated or hand-crafted meshes. Furthermore, we build Hunyuan3D-Studio – a versatile, user-friendly production platform that simplifies the re-creation process of 3D assets.

It allows both professional and amateur users to manipulate or even animate their meshes efficiently. We systematically evaluate our models, showing that Hunyuan3D 2.0 outperforms previous state-of-the-art models, including the open-source models and closed-source models in geometry details, condition alignment, texture quality, and e.t.c.

Views : 27
Florent Poux – Create 3D Point Cloud Renderings with Blender

pIXELsHAM.com

Jan 25, 2025

blender

https://towardsdatascience.com/the-blender-handbook-for-3d-point-cloud-visualization-and-rendering-1700ebe69c7b

Views : 12
SLAM XCAM 8K VR180 3D Camera

pIXELsHAM.com

Jan 25, 2025

hardware, photography

https://www.kickstarter.com/projects/vr1803dcamera/slam-vr180-3d-ai-camera-smarter-smoother-sharper?ref=axcdoc

8K 30FPS VR180 3D Video | Dual 1/1.5″ CMOS Sensors | 10-bit Color | Snapdragon8 GN2 | Android13 | 6.67″AMOLED|5000mAh |100Mbps Data

Views : 19
Physical Open Waters Used To Create The Water Scenes In ‘Flow’ Is Now Available Publicly

pIXELsHAM.com

Jan 25, 2025

blender

https://www.cartoonbrew.com/tools/the-custom-blender-plug-in-that-was-used-to-create-the-water-scenes-in-flow-is-now-available-publicly-245166.html

https://blendermarket.com/products/physical-open-waters

Views : 8

COLLECTIONS

| Featured AI
| Design And Composition
| Explore posts

POPULAR SEARCHES

unreal | pipeline | virtual production | free | learn | photoshop | 360 | macro | google | nvidia | resolution | open source | hdri | real-time | photography basics | nuke

FEATURED POSTS

Social Links

DISCLAIMER – Links and images on this website may be protected by the respective owners’ copyright. All data submitted by users through this site shall be treated as freely available to share.

Subscribe to PixelSham.com RSS for free — Subscribe to PixelSham.com RSS for free

Views : 7,121