pIXELsHAM – blog of links related to computer animation and production technology Sponsored by ReelMatters.com

SynthLight – Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces

pIXELsHAM.com

Jan 20, 2025

A.I., lighting

https://vrroom.github.io/synthlight

Views : 20
Shapen – Pixels to polygons text-to-model

pIXELsHAM.com

Jan 20, 2025

A.I., modeling

https://shapen.com

Views : 20
Seaweed APT – Diffusion Adversarial Post-Training for One-Step Video Generation

pIXELsHAM.com

Jan 20, 2025

A.I.

https://seaweed-apt.com

https://cdn.seaweed-apt.com/assets/showreel/seaweed-apt.mp4

This demonstrate large-scale text-to-video generation with a single neural function evaluation (1NFE) by using our proposed adversarial post-training technique. Our model generates 2 seconds of 1280×720 24fps videos in real-time

Views : 37
This ONE Step Makes CG Look Cinematic

pIXELsHAM.com

Jan 20, 2025

composition, lighting

Views : 22
Pyper – a flexible framework for concurrent and parallel data-processing in Python

pIXELsHAM.com

Jan 18, 2025

python

Pyper is a flexible framework for concurrent and parallel data-processing, based on functional programming patterns.

https://github.com/pyper-dev/pyper

Views : 108
Jacob Bartlett – Apple is Killing Swift

pIXELsHAM.com

Jan 18, 2025

software, ves

https://blog.jacobstechtavern.com/p/apple-is-killing-swift

Jacob Bartlett argues that Swift, once envisioned as a simple and composable programming language by its creator Chris Lattner, has become overly complex due to Apple’s governance. Bartlett highlights that Swift now contains 217 reserved keywords, deviating from its original goal of simplicity. He contrasts Swift’s governance model, where Apple serves as the project lead and arbiter, with other languages like Python and Rust, which have more community-driven or balanced governance structures. Bartlett suggests that Apple’s control has led to Swift’s current state, moving away from Lattner’s initial vision.

Views : 20
Don’t Splat your Gaussians – Volumetric Ray-Traced Primitives for Modeling and Rendering Scattering and Emissive Media

pIXELsHAM.com

Jan 18, 2025

photogrammetry

https://arcanous98.github.io/projectPages/gaussianVolumes.html

We propose a compact and efficient alternative to existing volumetric representations for rendering such as voxel grids.

Views : 20
IPAdapter – Text Compatible Image Prompt Adapter for Text-to-Image Image-to-Image Diffusion Models and ComfyUI implementation

pIXELsHAM.com

Jan 17, 2025

A.I.

github.com/tencent-ailab/IP-Adapter

ip-adapter.github.io/

The IPAdapter are very powerful models for image-to-image conditioning. The subject or even just the style of the reference image(s) can be easily transferred to a generation. Think of it as a 1-image lora. They are an effective and lightweight adapter to achieve image prompt capability for the pre-trained text-to-image diffusion models. An IP-Adapter with only 22M parameters can achieve comparable or even better performance to a fine-tuned image prompt model.

Once the IP-Adapter is trained, it can be directly reusable on custom models fine-tuned from the same base model.

The IP-Adapter is fully compatible with existing controllable tools, e.g., ControlNet and T2I-Adapter.

Views : 14
Sony – Diffusion Training from Scratch on a Micro-Budget

pIXELsHAM.com

Jan 17, 2025

A.I.

stability.ai/news/stable-point-aware-3d

huggingface.co/VSehwag24/MicroDiT

Views : 15
GAGA – Group Any Gaussians via 3D-aware Memory Bank segmentation

pIXELsHAM.com

Jan 17, 2025

A.I.

www.gaga.gallery/

https://github.com/weijielyu/Gaga

Views : 23
Curl – Client for URLs is a free and open source command line tool for transferring data using various protocols

pIXELsHAM.com

Jan 16, 2025

software

curl.se/

www.keycdn.com/support/popular-curl-examples

Views : 28
ComfyUI – Zero to hero with Cubiq (Matteo)

pIXELsHAM.com

Jan 15, 2025

A.I.

Views : 19
Kinetix.tech – Character motion control

pIXELsHAM.com

Jan 15, 2025

A.I.

www.kinetix.tech/

www.kinetix.tech/character-motion-control-for-video-generation-models

Views : 26
SPAR3D – Stable Point-Aware Reconstruction of 3D Objects from Single Images

pIXELsHAM.com

Jan 15, 2025

A.I., modeling, photogrammetry

SPAR3D is a fast single-image 3D reconstructor with intermediate point cloud generation, which allows for interactive user edits and achieves state-of-the-art performance.

https://github.com/Stability-AI/stable-point-aware-3d

https://static1.squarespace.com/static/6213c340453c3f502425776e/t/677e3bc1b9e5df16b60ed4fe/1736326093956/SPAR3D+Research+Paper.pdf

https://stability.ai/news/stable-point-aware-3d?utm_source=x&utm_medium=social&utm_campaign=SPAR3D

Views : 27
MiniMax-01 goes open source

pIXELsHAM.com

Jan 15, 2025

A.I.

MiniMax is thrilled to announce the release of the MiniMax-01 series, featuring two groundbreaking models:

MiniMax-Text-01: A foundational language model.
MiniMax-VL-01: A visual multi-modal model.

Both models are now open-source, paving the way for innovation and accessibility in AI development!

🔑 Key Innovations
1. Lightning Attention Architecture: Combines 7/8 Lightning Attention with 1/8 Softmax Attention, delivering unparalleled performance.
2. Massive Scale with MoE (Mixture of Experts): 456B parameters with 32 experts and 45.9B activated parameters.
3. 4M-Token Context Window: Processes up to 4 million tokens, 20–32x the capacity of leading models, redefining what’s possible in long-context AI applications.

💡 Why MiniMax-01 Matters
1. Innovative Architecture for Top-Tier Performance
The MiniMax-01 series introduces the Lightning Attention mechanism, a bold alternative to traditional Transformer architectures, delivering unmatched efficiency and scalability.

2. 4M Ultra-Long Context: Ushering in the AI Agent Era
With the ability to handle 4 million tokens, MiniMax-01 is designed to lead the next wave of agent-based applications, where extended context handling and sustained memory are critical.

3. Unbeatable Cost-Effectiveness
Through proprietary architectural innovations and infrastructure optimization, we’re offering the most competitive pricing in the industry:
$0.2 per million input tokens
$1.1 per million output tokens

🌟 Experience the Future of AI Today
We believe MiniMax-01 is poised to transform AI applications across industries. Whether you’re building next-gen AI agents, tackling ultra-long context tasks, or exploring new frontiers in AI, MiniMax-01 is here to empower your vision.

✅ Try it now for free: hailuo.ai

📄 Read the technical paper: filecdn.minimax.chat/_Arxiv_MiniMax_01_Report.pdf

🌐 Learn more: minimaxi.com/en/news/minimax-01-series-2

💡API Platform: intl.minimaxi.com/

Views : 30
ComfyUI Tutorial Series Ep16 – How to Create Seamless Patterns & Tileable

pIXELsHAM.com

Jan 14, 2025

A.I.

Views : 23
Quantastic – Design inspirations

pIXELsHAM.com

Jan 14, 2025

design

Views : 36
Why Steven Spielberg Avoids a Wide Open Aperture

pIXELsHAM.com

Jan 12, 2025

composition, photography

Views : 16

COLLECTIONS

| Featured AI
| Design And Composition
| Explore posts

POPULAR SEARCHES

unreal | pipeline | virtual production | free | learn | photoshop | 360 | macro | google | nvidia | resolution | open source | hdri | real-time | photography basics | nuke

FEATURED POSTS

Social Links

DISCLAIMER – Links and images on this website may be protected by the respective owners’ copyright. All data submitted by users through this site shall be treated as freely available to share.

Subscribe to PixelSham.com RSS for free — Subscribe to PixelSham.com RSS for free

Views : 4,698