You CAN do some cool stuff by trying different scaling methods and dithering in post! Esp by going far smaller than you need, then re-expanding. Turning off any scaling algos or optimization so it just purely scales up big square pixels.
I’m describing it badly but it’s a fun technique for pixelating stuff
layer diffuse can do transparent images since for at least a year: [GitHub - lllyasviel/LayerDiffuse\_DiffusersCLI: LayerDiffuse in pure diffusers without any GUI](https://github.com/lllyasviel/LayerDiffuse_DiffusersCLI)
BEN is something else, it takes existing image and attempts to detect what is background. It often erases bit too much or needs to be fixed manually and it's not much better than tool already integrated to Krita.
True. Different tools have different uses. Where I think BEN excels is in wispy gradient shit like hair (I've been working with a lot of hair lately). It's the only one that gets it to the quality I need consistently.
I also use traditional segmentation pipelines when working on more complex masking setups or just plain ol' REMBG (when I need something fast). I do a lot of photogrammetry and 3DGS and these segmentation/masking tools have saved me countless hours of manual labor even compared to the initial learning curve.
If you need still images with transparency, SD Forge does it with a plugin. (I forgot the name of it)
But I remember installing it through the interface using the github link, and it worked as soon as I understood how to use it...
It was with SDXL models.
now this the kind of thing can be be used for specialized creative tools that artists will come to appreciate, at least those who have not been infected by the anti-ai mind virus.
Am I wrong or is it just randomly ignoring the prompt in the demo video?
If the prompt is "A forest floor being consumed by spreading magical fire" Then I would expect a forest floor somewhere.
If the prompt is "Water splattering in mid-air" Then I would expect some air.
Ask any other image or video model to portray air and it will portray something, this model (from the demovid at least) seems to just make the largest object transparent. It is impressive but it also seems difficult to get the wanted video, perhaps in a next run it makes the water transparent and it shows the air
I will be messaging you in 2 months on [**2025-03-09 15:37:47 UTC**](http://www.wolframalpha.com/input/?i=2025-03-09%2015:37:47%20UTC%20To%20Local%20Time) to remind you of [**this link**](https://www.reddit.com/r/LocalLLaMA/comments/1hx7421/transpixar_a_new_generative_model_that_preserves/m68lfoh/?context=3)
[**CLICK THIS LINK**](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=Reminder&message=%5Bhttps%3A%2F%2Fwww.reddit.com%2Fr%2FLocalLLaMA%2Fcomments%2F1hx7421%2Ftranspixar_a_new_generative_model_that_preserves%2Fm68lfoh%2F%5D%0A%0ARemindMe%21%202025-03-09%2015%3A37%3A47%20UTC) to send a PM to also be reminded and to reduce spam.
^(Parent commenter can ) [^(delete this message to hide from others.)](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=Delete%20Comment&message=Delete%21%201hx7421)
*****
|[^(Info)](https://www.reddit.com/r/RemindMeBot/comments/e1bko7/remindmebot_info_v21/)|[^(Custom)](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=Reminder&message=%5BLink%20or%20message%20inside%20square%20brackets%5D%0A%0ARemindMe%21%20Time%20period%20here)|[^(Your Reminders)](https://www.reddit.com/message/compose/?to=RemindMeBot&subject=List%20Of%20Reminders&message=MyReminders%21)|[^(Feedback)](https://www.reddit.com/message/compose/?to=Watchful1&subject=RemindMeBot%20Feedback)|
|-|-|-|-|
Most AI models that process video and photo can only produce RGB output. To produce/maintain transparency they have to output RGBA.
In simplified terms the reason for this is that adding an additional image channel that has to be processed adds complexity and processing work, regardless of whether the thing you are processing really needs transparency or not. And given that over 90% of images and video don't contain transparency, it makes sense that people training models would choose to exclude it.
58 Comments
justalittletest123@reddit
jiahaooo@reddit
Lost_Cyborg@reddit
UnkarsThug@reddit
NotRandomseer@reddit
Wickedinteresting@reddit
UnkarsThug@reddit
fullouterjoin@reddit
MoffKalast@reddit
Colecoman1982@reddit
syrupsweety@reddit
umarmnaq@reddit (OP)
troop99@reddit
vTuanpham@reddit
umarmnaq@reddit (OP)
troop99@reddit
Journeyj012@reddit
big_ass_grey_car@reddit
auradragon1@reddit
YearnMar10@reddit
FaceDeer@reddit
Soft_Importance_8613@reddit
llamabott@reddit
YearnMar10@reddit
Colecoman1982@reddit
10minOfNamingMyAcc@reddit
big_ass_grey_car@reddit
10minOfNamingMyAcc@reddit
big_ass_grey_car@reddit
10minOfNamingMyAcc@reddit
pooppooppoopie@reddit
AssistBorn4589@reddit
searcher1k@reddit
TheDailySpank@reddit
AssistBorn4589@reddit
TheDailySpank@reddit
Eralyon@reddit
Any-Conference1005@reddit
Eralyon@reddit
ThatInternetGuy@reddit
bot_exe@reddit
maddogawl@reddit
Former-Ad-5757@reddit
procraftermc@reddit
Former-Ad-5757@reddit
Zealousideal-Cut590@reddit
madaradess007@reddit
YRUTROLLINGURSELF@reddit
mutes-bits@reddit
RemindMeBot@reddit
parzival-jung@reddit
Fun_Yam_6721@reddit
GammaScorpii@reddit
SgathTriallair@reddit
mikael110@reddit
ApplePenguinBaguette@reddit
sunshinecheung@reddit
Roth_Skyfire@reddit