Breaking the music supply constraint
Posted by entsnack@reddit | LocalLLaMA | View on Reddit | 265 comments
I just cancelled my music subscriptions to save some cash and wanted to share the self-hosted music supply chain that replaced them. I nice side effect of thus setup is breaking the constraint of a finite supply catalog that is tailed for the masses:
-
2 x DGX Spark linked via ConnectX 7 running Plex and multiple Ace-Step 1.5 XL models in parallel for music generation with GePa prompt optimization. Also holds my organic music that the models can remix. TODO: a reinforcement learning from human feedback interface.
-
iPad Pro running Prism as a Plex client for bitperfect and sample rate-matched audio.
-
Schiit stack -> Hifiman Arya Stealths
This effectively gives me an infinite supply of music for free, that is personalized and private. It's immensely satisfying listening to Shrimp Bizkit and Phlegminem on repeat (my own artist names), I much prefer this to the organic music created after 2011.
My only problem is the loss of community, I have noone to share my new favorite songs and artists with because they're generated for me. If anyone wants to hop on to my Plex share to discuss, let me know!
HitMachineHOTS@reddit
Is the remix Suno quality?
Dany0@reddit
This is kinda cyberpunk but more the dystopian kind
Tramagust@reddit
Nah because it's oss
Dany0@reddit
Just because it's OSS doesn't mean it was ethically trained. Plus, if you really do love music, then I don't think that you'll be happy with OP's setup. I can imagine actually doing something like this for myself, but only if it was fully trained on just my playing, that would be actually fun, and maybe I'd even use it from time to time you know?
entsnack@reddit (OP)
Organic music is the past. The vinyl of 2026. We will remember the days we sniffed human sweat watching humans perform on stage.
-dysangel-@reddit
I used to smell human sweat while doing everything, but then I realised I had pretty open access to water and soap, and so I just get soapy water and mist it into my nose every so often.
Conscious-content42@reddit
"It's the smell, if there is such a thing" https://www.youtube.com/watch?v=yL9Y24ciNWs
yourgamermomthethird@reddit
Didn’t imagine 2 of my interests would be on the same sub this is pretty cool
AbsoluteHedonn@reddit
This is garbage engagement bait
entsnack@reddit (OP)
What about it makes you think so? Is it not related to local LLMs broadly construed?
truthputer@reddit
Your post is just about AI slop and rage bait.
Stop wasting our time with this shit.
entsnack@reddit (OP)
Sir, this is /r/LocalLLaMA
Vicar_of_Wibbly@reddit
It’s parody of the finest kind. Subtle, funny, and completely over the heads of the cerebrally encumbered.
entsnack@reddit (OP)
stealing this to use instead of smoothbrain
KingMitsubishi@reddit
This is the best post ever
Correct-Boss-9206@reddit
Pretty sure it's a joke/parody, shrimp
Song-Historical@reddit
Can you link some of your creations
entsnack@reddit (OP)
You sound bitter, has AI replaced you or something?
Song-Historical@reddit
I'm genuinely interested! I haven't heard good AI generated music tbh. I was an early user of Aria and Suno.
entsnack@reddit (OP)
Ah you should genuinely try it out then, the model is free :-)
Song-Historical@reddit
I don't have DGX sparks to spare
know-your-enemy-92@reddit
You can run ace step 1.5 easily even on 3060ti
Song-Historical@reddit
I only have the 3070 on my laptop
No-Promotion4006@reddit
We are Charlie Kirk is good AI generated music, very famous example
audioen@reddit
I just listened to it and while I would hesitate to call it actually good music, I agree it is definitely a good example. Its sound quality is nearly good enough to not be immediately distracting or weird, and it even seems to have something like a musical composition, though I personally find it quite boring to listen to -- just not enough variation.
bnm777@reddit
You misread people if you think that comment sounds bitter.
entsnack@reddit (OP)
Turns out they were indeed replaced by AI. Can't blame them for calling it leftover casserole.
ThinDoughnut3617@reddit
Yeah no, this is dystopian. Music is art, art is what it means to express as a human. No machine will & can reproduce that. AI can do whatever, but it should not touch art.
entsnack@reddit (OP)
There's a reason "struggling artist" is a trope, and my setup frees humans to do something more lucrative than making music.
ThinDoughnut3617@reddit
Like I said, music making is about expression and so much more than making a profit. If making music is meant to be lucrative and one does it for that sole reason, to seek profit, then that's a problem in itself and apart from what making music truly stands for.
entsnack@reddit (OP)
Well you sound European and maybe you should lobby for more worker protections there (or just move there). I'm enjoying listening to Chester Bennington sing new songs to me and won't give that up because someone thinks it's nOt TrUE aRt.
ThinDoughnut3617@reddit
I mean at this point you're just rage baiting. If not, you do you and enjoy it.
HeywoodJaBlessMe@reddit
That's it. AI generated music has almost zero value except as a curiosity.
vanonym_@reddit
I uuuuh... I'm not sure I would enjoy that listening experience. But good for you I guess?
What's the output format and resolution of your system and how does that play, don't you have a huge quality drop from lossless formats provided by other listening platforms?
entsnack@reddit (OP)
The base audiobquality is low resolution. However, you can take the image of the waveform and upsample it using WAN or any AI upsampler, then convert it back into audio.
NandaVegg@reddit
>The base audio quality is low resolution. However, you can take the image of the waveform and upsample it using WAN or any AI upsampler, then convert it back into audio.
Is this really a thing?! Assuming visual waveform upscaling actually works, you will still need to 10000+ visual chunks for 2-3 minutes audio just to do that interpolation visually.
entsnack@reddit (OP)
most definitely not a thing, neither are cables.
upsampling is tricky but we are making progress, it's already done well commercially, just needs to trickle down to open weights.
IrisColt@reddit
ca-cables? Like literally physical cables?
entsnack@reddit (OP)
Yes, this one from Alo Audio (now Campfire in Portland, OR) is particularly good: https://aloaudio.com/products/gold-16
Jester14@reddit
lmfao I still can't tell if this is a troll post
entsnack@reddit (OP)
I mean the cable-shilling of a $1000 cable should have been a giveaway.
IrisColt@reddit
heh
Capt_Blahvious@reddit
Ok, now I know you're either shitting us or you are smelling your own audiophile farts.
gtrak@reddit
Only a true audiophile will hear the difference
IrisColt@reddit
Thanks!!!
vanonym_@reddit
Thank you for mentioning upsampling. I think that cables are like the very last thing I would consider important when trying to get hifi AI audio, the generation part seems to be the bottleneck but eh i didn't test it.
--Spaci--@reddit
Ive never understood AI music, not only was it never really good it obviously just had no soul
the320x200@reddit
When recording was invented there was a movement against it. "Canned music has no soul!" How could it compare to live music. Just a mechanical reproduction of the real thing!
Pretty much every new technology is seen as soulless at first.
--Spaci--@reddit
This isn't even made by a human, I think there's a pretty large difference here. You are comparing two different ways the listen to music made by humans to listening to music made by AI
Recoil42@reddit
Tbh, I've been playing around with Suno lately and there's some pretty good stuff out there. I wouldn't say we're best-of-human-level yet, but listenable? Definitely.
--Spaci--@reddit
That does have quality but still no soul, a human didn't make it. Say like im listening to jcole for example, I feel like I understand this person in like a semi parasocial way through their bars and lyrics, I don't listen to music just because it sounds good
Recoil42@reddit
Frankly if you define soulful as human-made, then that's tautological and running the discussion immediately off the rails. But I didn't say soulful, I said listenable. The contention was only that the technology is good and getting better, not that that it's ready to replace the best of human art.
SpicyWangz@reddit
It’s not ready to replace the best of human art, because it will only ever be an imitation of art.
Borkato@reddit
I thought this was a pro AI sub
funkdialout@reddit
Ah yes, please keep the echo chamber clear of dissenting opinions!
SpicyWangz@reddit
I love AI. I use it every day.
the320x200@reddit
Photography is also not art. The human didn't make the image. All they did was press a button. /s
SpicyWangz@reddit
Definitely a lesser form of art.
Recoil42@reddit
Most of the best of human art is functionally imitation of other art. That's like... the entire history of art.
SpicyWangz@reddit
All of the best human art is good because there is real emotion and story behind it.
the320x200@reddit
What happens if an artist makes a piece as a commission and their heart isn't in it but they draw it anyway? Is that not art anymore?
What if you don't know the artist's intentions or emotions and have to just judge the piece in isolation?
Recoil42@reddit
The Mona Lisa was a commission piece. Perhaps the most famous piece of artwork in history was just a rich guy's wife doing what essentially amounted to a contemporaneous version of what would today be at best, a photo studio session.
Recoil42@reddit
When John Lennon was singing about walruses it's because he really was both the eggman AND the walrus. Nailed it. The art understander has logged on.
AmphibianFrog@reddit
A lot of works of art and imitations or inspired by other works of art. But in a lot of ways "AI Art" is an imitation of "Art".
Recoil42@reddit
If art is imitative then asserting that AI art is imitiative does not make AI art not art.
AmphibianFrog@reddit
Well it depends on your definition of art! If the clouds in the sky happened to make the most beautiful image ever, it still wouldn't be art because it wasn't made with the kind of intent that makes it art.
I think the current definition of art involves a human, it at least some kind of sentient being.
Recoil42@reddit
That's the wonderful thing, I'm using your definition of art. You very explicitly said art can be imitative. If art is imitative then media produced with the assistance of an ML model is not disqualified from being art simply by having an imitative component.
I don't think that's true, nor would it fundamentally disqualify AI art from being art even if it were true. Machine learning models are created by humans and used by humans. The inputs are human. They do not have the property of being human-free.
FYI: Warhol, Picasso, Cage, Ono, Duchamp, Pollock, and many more spent most of their careers exploring the boundaries this stuff. I don't expect r/Locallama to be art school central, but this really is Art 101 — most of the 20th century's great movements are "is art still art if we peel the preconceptions of what art is away from art?" and that's what Dadaism and Fluxus were both about.
AmphibianFrog@reddit
This is really interesting. I have some ideas in my head about what constitutes art and what doesn't, but the more precisely I try to write them down, the more they seem vague and poorly defined.
I don''t necessarily think that the fact that AI imitates human art is relevant in determining if it is art to be honest. It could be art without imitating existing art, and it could imitate existing art and not be considered art in and of itself.
Also, I see what you're saying about how humans created AI therefore AI created art is human created art, but I'm not sure that's meaningful either.
But then again, there is a lot of human created "art" that I don't consider to be particularly artistic either!
I am still not sure of any definition of art that doesn't involve some kind of intelligent creator though. It seems like the original intent is important to some degree.
So basically, I have no answer or retort to anything in your response. But it was fun trying to find a counter example and realising how thin they all were!
Recoil42@reddit
Yoko Ono is famously good at exploring the boundaries of this. Immediately coming to mind — Instructions for Paintings (1962), which is not actually a set of paintings but what you would now recognize as resembling a book of prompts for creating paintings. The paintings themselves do not exist, only the descriptions of them. See also Grapefruit (1964) which was the same, but for performances.
--Spaci--@reddit
Inspiration and imitation is different from generative AI
Recoil42@reddit
Special pleading and distinction without a difference both apply here. Just saying something is 'different' doesn't make it not art or a valid functional replacement for entirely human-produced art. It isn't even really an argument at all.
AmphibianFrog@reddit
I'm not the one who originally said "soul" but I think I know what he means.
For me personally, human made music is a lot more interesting. Human creativity comes out of lived experience.
Take for example that Dire Straits song about the shitty band in the pub. The Sultans of Swing! That song is cool on its own, but there's something about the fact that he actually walked into a pub and saw a real band playing.
There's a human connection.
Even with instrumental music, normally the creator at least liked the way it made them feel.
MindlesslyBrowsing@reddit
As a hobby musician, I'm afraid the masses don't care about that.
We still got the performance aspect I guess
AmphibianFrog@reddit
I don't think "the masses" are listening to AI generated music...
MindlesslyBrowsing@reddit
Some of it has already mainstream, sometimes as a "meme". For example BBL Drizzy
AmphibianFrog@reddit
If you take a random sample of the population in your country, on average, what percentage of the music that they listen to by choice do you think would be AI generated?
I have never met anyone in real life who listens to AI music.
And I'm really talking about pieces of music where the AI generated it completely from humans prompting it, not when AI was used to master it or add some little details to an existing piece of music.
MindlesslyBrowsing@reddit
Well i caught my parents listening to it and as I said it's been circulating through memes.
I'm not saying it's the most dominant form of music, I'm saying it's already here
AmphibianFrog@reddit
You said it's "for the masses". I don't really think that's true.
MindlesslyBrowsing@reddit
Fair, that's not what I see. To each their own 🤷
IrisColt@reddit
This song is awesome, THANKS!!! I love AI music in general (I started listening to my own generated pieces 15 years ago, just like D. Cope), but this one takes it to a whole other level.
ArtfulGenie69@reddit
You haven't heard what is possible and being generated today. When we get something that does as good at music as fish audio s2 does at voices you'll be done with the riaa as well. I'm not saying this as a ai chud either, this is coming from a dude who practices his guitar every day.
The other day I heard a Charlie Kirk ai song but they had done better lyrics and they had done it as a reggae/ska style. The song worked even though I hate Charlie. It was on a steam and it wasn't just me having a bad ear, the crowd watching was amazed by the quality.
lukistellar@reddit
And even in the better creations, you clearly can hear that this is generated by AI. It's in the highs, it sound kinda like undersampling, especially in the vocal. Also the design of the room is weird in most AI track, like strange delays and reverbs.
Here is also a good example: https://www.youtube.com/watch?v=pAXKptUYf8M
The artifacts are way stronger in this one, but it's the first AI track I actually like musically.
ArtfulGenie69@reddit
The fedelity on charlie kirk is definitely not there fully, I can hear the ai hiss over instruments and the singers voice and the piano/guitar combo that in actual reggae would be more defined. Quick strums with a mute after and the stab from the piano at the same time. The ai doesn't get the idea of both those instruments it just is guess at the sound they make and it shows. Thinking about the song as a whole, it has a start, the singer nails his lyrics, it has the call and response between the singer and the horns, the end also has a slow down ending.
I was able to get better representation of the instruments in bands when I demuxed my training tracks giving it instrument solo's, but it over trained and would add more crackle. It would sound just like surf guitar though even though the original surf guitar that it produced was utter garbage. When I found this out I also noticed that the starter and endings would also be garbled without unique descriptions, when the end was uniquely described it would actually do a sudden stop or a break down of the band into a hissing noise. Same with the starts they have to be tagged with unique instructions. Ace step is really close, hopefully there will be something magnificent and new this year in music gen.
stl314159@reddit
I listen to it while I work specifically because it has no soul. It’s just repetitive and consistent and allows my ADHD brain to get into a flow state.
Frequent-Mud8705@reddit
ever heard of jungle/DnB?
AdministrativeMeat3@reddit
Literally throw on any UKF dnb mega playlist from the last 30 years and you've got yourself an immediate all day flow state
illforgetsoonenough@reddit
I used to mix it in the 90s :D
Adidat@reddit
Hook a brotha up. My adhd ass needs this. Also any advice for some soulless music for luxury real estate videos?
I’m so curious the kind of mashups op would be able to do with a large library like that.
stl314159@reddit
I can't help with the real estate videos but these are some things from my recently played:
https://youtu.be/dAfRWzh66yY?si=3L25EXyxCRV2iqU_
https://youtu.be/dAfRWzh66yY?si=1mWtNjvbEVtWxnT_
https://youtu.be/GISyZNwpprU?si=CLdi2qXvI5x-CYn1
https://youtu.be/9ic3a7SO6cA?si=wYia1VUzBwa99kSx
https://youtu.be/pL-90ThONJs?si=88org9pWQIlBUUCy
halr9000@reddit
Stable audio 3 is really good for this. It can't even do lyrics but it takes 2 seconds to make 2 minutes of audio.
IrisColt@reddit
This.
MaruluVR@reddit
Its not bad if you finetune it on exactly the music you like
entsnack@reddit (OP)
It does need good training data and prompting. Commercial setups are constrained by copyright laws. I have a large FLAC library to train on.
vanonym_@reddit
I don't really enjoy it either for similar reasons, let alone the fact that the output quality usually is terrible, that's why I'm asking OP about it.
But to each their own
GeologistPutrid2657@reddit
the fuck happens when you wanna hear something again? have we got unlimited storage and recall as well?
tmvr@reddit
You don't need to hear it again dude, carpe diem, if it's gone, just let it go, something new will come along 😃
entsnack@reddit (OP)
storage was cheap until very recently, effectively unlimited once prices come back down.
Main_Problem_2696@reddit
Two DGX Sparks running parallel AceStep models just for infinite Shrimp Bizkit is the most overkill local setup I've seen. The RLHF interface idea is legit though. Loss of community is the real tradeoff. Generated music has no band to follow or weird subreddit to argue in. Used Runable to document my own music gen experiments. Clean dashboard in 20 minutes. Made sharing specific outputs easier. Would love to hear what Phlegminem sounds like.
HavenTerminal_com@reddit
shrimp bizkit and phlegminem are the best artist names I've ever heard and I'm a little devastated they're not real bands
TheThoccnessMonster@reddit
Can you share your configs and loadout. I’ve got a couple sparks I’d love to try this in.
cleverusernametry@reddit
Oh yes weve always had a shortage of music. Ffs.
L064N@reddit
This is so interesting wow. Do you have some favorite songs that its made?
awsom82@reddit
Ridiculous, you dont AI for music 🎶
entsnack@reddit (OP)
You do need it for an infinite supply of personalized music on demand. :-)
JonMcElyea@reddit
I think it's weird you are being down voted in a local-first community. You are describing what I consider the best personal music creation workflow using Ace-step, and for some reason, people HERE are criticizing that effort. Baffled.
Keep up the cool experiment and the satire. It's all great to read about.
xquarx@reddit
Its not hard to get hold of more music than hours in your life.
entsnack@reddit (OP)
Find me more nu metal with a whiny vocalist bro, do it, punk
MiElas-hehe@reddit
That's the beauty of music; some are one in a kind. When you do find a similar artist, boy does it feel good. If not, you make it yourself! Without AI of course, the good old authentic real human effort way. Though it might sound shit in the beginning, but that is what it gives it SOUL.
NelisMakrelis@reddit
As a musician you saying you prefer ai music over anything made after 2011 makes me both incredibly sad and dissapointed as well as straight up annoyed because that’s just simply bullshit, if you don’t like what’s on the mainstream just stream any of the other 10k songs that are uploaded by hardworking people you have something in common with; being human and knowing what that’s like.
entsnack@reddit (OP)
You misunderstood me: I prefer my AI music. Why: because it sounds better to my ears! Artists don't produce my type of music anymore because it does not sell.
Have you ever built a chair yourself? DIYed a gadget? Baked your own cake? This is the same, and it means no disrespect to people who produce these things for a living and require mass appeal to survive.
youcloudsofdoom@reddit
This analogy only makes sense if by building your own chair you also consume an insane amount of energy and resources every time you sit on it
entsnack@reddit (OP)
This entire cluster consumes about the same power as a 4090 (JUST the GPU). Go yell at the gamers for their INSANE energy consumption everytime they game lol.
youcloudsofdoom@reddit
No, I'm yelling at you for composing the most consumptive way to listen to music in the history of humanity, during a climate crisis.
entsnack@reddit (OP)
What? My entire stack consumes less power than the average 5.1 speaker system. Do you have a source for your claim or just vibes?
youcloudsofdoom@reddit
The problem isn't amplification, it's generation.
The embodied carbon cost of what you're doing is substantial; it's not simply the cost to run, but the embodied carbon in the Sparks is massive, as is that involved in training and distributing the models themselves. If you're using this system to generate new music each time you play something (which seems implied in your post), it's like buying an album on a physical format, listening to it once, then throwing it away and buying another (similar) one, and doing the same over and over again. Only what you're doing involves a LOT more rare earth metals and non-recyclable future e-waste.
Streaming music alone was already more consumptive than previous formats (https://www.cambridge.org/core/journals/popular-music/article/abs/cost-of-music/DEC6AA100C191D510213F9086CF094CC). But what you've deployed here is orders of magnitude more consumptive than the typical user streaming to their phone; data centers are at least reasonably efficient at their (ecologically catastrophic) jobs in terms of co2/stream, which your setup is far from. You've created an additional, unwarranted step in the consumption of music, using what is widely understood to be one of the most consumptive technological infrastructures (generative AI) that humanity has ever produced, and you're doing it during an unfolding climate crisis.
StewedAngelSkins@reddit
And yet he's still causing less harm than your average gamer. Seriously, if you actually care about this and aren't just using it as a proxy to be mad about AI then go after the stuff that makes the biggest impact first.
rorykoehler@reddit
Also I made music myself. This isn't remotely similar. I guess you are joking but if you aren't... i have no words.
rorykoehler@reddit
Post your slop so we can judge you!
JazzlikeLeave5530@reddit
Any time someone says "XYZ nowadays is all bad" I just assume they're lazy. Mainstream is always gonna be crap but there are so many artists nowadays everywhere making amazing songs and with like 1,000 listens on their stuff. This reads to me like "I'm so glad now that I can eat nothing but gray paste that I made myself."
StewedAngelSkins@reddit
I mean it's a cool concept but you couldn't pay me to listen to that slop. It's worse than the radio. Wait until OP finds out about soulseek...
entsnack@reddit (OP)
Ha I'm a massive "soul seeker" let's just say, I have about 10 TB of FLACs and my primary portable device is a flashmodded iPod Classic. I am actually trying to use these FLACs for training. I don't get the deal with vinyl rips on soulseek though, I keep downloading them by mistake because 192 Khz and they have a high noise floor and crackling.
StewedAngelSkins@reddit
How is this saving you money then? You already have an infinite supply of music for free.
entsnack@reddit (OP)
I want Fred Durst singing to me for a couple of hours everyday, there's only so many times I can re-listen to Hot Dog.
StewedAngelSkins@reddit
At least we agree on one thing. For me that number is 1.
StewedAngelSkins@reddit
So you want to keep listening to imitations of the exact same artists forever, to the point where even finding similar artists won't do it for you, but not the exact same songs? You have interesting taste in music.
wrathfulrapier@reddit
There are thousands of artists making millions of songs, not to mention the pre-existing millions of songs with just about any combination you could ask for if you just look it up. Especially considering that you can only generate music off of pre-existing music, it's an interesting set up but I'm not seeing the appeal (especially with the loss of community)
entsnack@reddit (OP)
find me a numetal band with a whiny male vocalist, do it, has to be whiny + numetal rap/rock. show me one in your thousands of artists doing this.
draetheus@reddit
Check out Hellhills. A little light on the rap but damn if that isn't some early 00s whiny numetal.
wrathfulrapier@reddit
Also Daron Malakian System of a Down
entsnack@reddit (OP)
I've heard ALL his songs. Simpler: find me a Chester Bennington song I haven't heard. He's not making new ones last I checked. My setup generates infinitely many Chester songs.
AmphibianFrog@reddit
They're not really Chester Bennington songs if he didn't make them though...
entsnack@reddit (OP)
Doesn't matter if what I want to hear is his voice. Look, I understand this isn't for you, I'm not sure why you don't understand this is what I need and enjoy.
AmphibianFrog@reddit
Why do you think I don't understand that? I'm expressing my opinion, not trying to decide yours!
entsnack@reddit (OP)
Cool, maybe try to create instead of opining and you'll have more interesting takes than GPT 5.5. ✌️
AmphibianFrog@reddit
I create plenty. I create plenty of music too.
But I don't really consider asking Suno to make some music to be an act of creation. I also don't consider asking Claude code to write something to be an act of creation either. Which, as a software engineer, is definitely making me think some things...
wrathfulrapier@reddit
You can't get your multi-thousand dollar setup to do this for you?
entsnack@reddit (OP)
Yes which is why I built this lol. I have 20TB of FLACs that didn't fix my need.
wrathfulrapier@reddit
I highly recommend getting hooked up with last.fm and listen brains and getting your tastes punched in for some fine tuned recommendations, you may be surprised! I would be absolutely heartbroken if I wasn't able to link up with other people about the music I loved, it's like 20% of the enjoyment for me. And as a side benefit it would free up compute power for you to focus on other tasks
entsnack@reddit (OP)
My lastfm account is 20 years old lol, I have scrobbled from Winamp on Windows 95 and displayed my listening status on MSN Messenger. Thanks for the recommendation tho.
audioen@reddit
https://www.youtube.com/watch?v=8Zzz2c0uhR0 I'm not entirely sure about it. Clearly, this is human-AI collaboration, but something like forest doomcore is underserved genre for sure. While the music is just fairly generic death metal type of thing, there is wonderful undertone of silliness about the whole production.
I like AI in a similar way that I like cartoons -- they can just draw whatever weird stuff that would be impractical to film, and thus there is unleashing of a kind of raw creation that is based on mixing existing stuff, yes, but in a way that might not have been done before.
mintybadgerme@reddit
This is simultaneously genius and the music industry's biggest nightmare. Once this happens for real, it's game over.
Recoil42@reddit
This post both rocks and is also completely indistinguishable from parody, I love it.
EvilPencil@reddit
I know right? I was hoping to see an AI music recommendation engine that plugs into the *arr stack and curates the playlists, etc. Because then the actual music would be real.
entsnack@reddit (OP)
Takes a scientist to know one. 🫡
tmvr@reddit
dangerous_inference@reddit
This is a pet peeve of mine. Zero means nothing. It's not an item, it the lack of an item. WHY ARE YOU LIKE THIS
Conscious-content42@reddit
Whoooa loading this using old.reddit.com, changed the index to 1-3, and new Reddit (www.reddit.com) is 0-2 indexed. I was like what is he talking about index of 0. Old Reddit knows the trickery.
zuraken@reddit
Same, i had looked again and saw no 0
MathmoKiwi@reddit
Favorite part of your post was starting your list from 0
Which then broke the reddit formatting
Spiritual_Trick_6655@reddit
The litmus test for satire.
xquarx@reddit
I did this, but the opposite. Build a massive FLAC collection of historical bangers, run STEM processing on them (vocal, melody, bass, drums), use AudioMuse-AI to embedd the music (track similarity), fork Mixxx and vibe integrate it all (distance, harmonics), then become a DJ. It is so much fun.
rorykoehler@reddit
That's genius. Did you post the code?
xquarx@reddit
Sorry I've not shared it anywhere, its quite involved and custom tailored with gluing various FOSS projects together with little modifications. Very clean workflow for me, but not a quick setup. More a proof of concept that could be refined into a solution.
entsnack@reddit (OP)
This sounds fun! I've done this at a small scale with demucs for a karaoke setup. I love the DJing idea!
xquarx@reddit
Yes basically that. I did small scale but then I enjoyed it so much I automated my entire library.
Photochromism@reddit
Is Ace-Step actually any good? I tried it a year ago and it sounded like shit.
Nicking0413@reddit
Can’t even tell if this is parody or not. Maybe I’m just stupid
Desperate-List442@reddit
Might be bottom 10 posts OAT
entsnack@reddit (OP)
I've never been a bottom so I can't really tell. What's it like?
Desperate-List442@reddit
Is that supposed to be like an insult or something
Neither-Ad-8957@reddit
beautiful!
gufhHX@reddit
I want to marry who ever owns this apartment and stuff. Take the ring right now I tells ya
Raredisarray@reddit
Github Repo?
entsnack@reddit (OP)
sorry I'm not a vibecoder
rorykoehler@reddit
Lol
perfopt@reddit
"the loss of community" create a website and post there - invite people to listen, comment etc.
Southern_Sun_2106@reddit
Somebody a while ago posted a public repo project of a radio staton that generates endless talkshows. It was very well done. I took that and converted it to a music/talk show station; played with different styles; then ended up having it make songs based on real poetry of several favorite poets (copyright free, from 18-19-early 20th century, about love and such, some other topics). It was great for some time, some songs were really good. Family members reported - better than paid services, and they preferred the 'home' radio station. Also tweaked talked shows to match family interests.
Everything was powered by M3 Ultra plus qwen 27B and Kokoro.
Issue - you know how you come to 'learn' the models eventually? Even 'frontier' models like Claude become stale at some point. Anyway, it all becomes feeling 'same' after a while. I ended up just doing a 'music tool' for the kids - they write their own lyrics, M3 Ultra makes music, songs, etc. That simple tool ended up having a much longer life and is still used and preferred to paid online services, surprisingly.
devilandall@reddit
Dude, that’s neeeeeeeet. Have you shared Shrimp Bizkit anywhere? I would really like to listen it, don’t have Aryas unfortunately, but good old 400s should do it for me for now
cmndr_spanky@reddit
That is wild, can I get a link to listen somehow ? You can PM if you prefer
andrew45lt@reddit
So you just listen to how the tokens are generated?
Mamaun30@reddit
Op: "I just cancelled my music subscriptions to save some cash"
Also op: "2 x DGX Spark linked via ConnectX 7..."
entsnack@reddit (OP)
I'll breakeven in no time because I'm also saving on concerts and merch. Power bills are a problem but my old neighbor doesn't mind me borrowing some of her electricity for my music consumption.
meganoob1337@reddit
the more you buy the more you save?
goobuddy@reddit
https://youtu.be/cSOHAM-ADzc?si=VkwqWjE_TwO4Xyi1&t=3
Ok-Buffalo2450@reddit
Jensen loves you.
JLeonsarmiento@reddit
bnm777@reddit
"I'm saving on concerts"
As he sits, alone, listening to machine generated tones.
What a totally non-miserable existence :/
mivog49274@reddit
yeah this guy is just way too much in the future which makes him wrong. he just speedran all of the flaws and ills produced by super capable automated systems, a global atomised system of entertainment in a dystopian society...
what's goofy though is imagining someone being satisfied with the current state of (local !) generative AI on music and presenting himself as a music lover (but not a sweat lover obviously)
The setup is neat though ! But the speakers... would rather be called sloppers here ;)
entsnack@reddit (OP)
Hey hey these are Hifiman Arya Stealth PLANAR magnetic headphones! They sound good to me!
I like your framing of the future dystopian society, we will all have tokens tailored to our preferences injected into all our 5 senses all the time. I like building things resembling that future, and also collect things from the past that were flawed attempts at present-day technology.
draconic_tongue@reddit
do redditors actually feel helpless about the future or are they just farming karma?
Hypilein@reddit
The way we’re speedrunning into some of this dystopian sci-fi shit I’m not so sure about that anymore.
Frequent-Mud8705@reddit
I used to be exited about dynamic generative music till I realized it all sounds tinny & incredibly generic. Much more fun to curate my own library of artists that I have a larger connection to
SkyFeistyLlama8@reddit
"new miserable experience"
"congratulations I'm sorry"
entsnack@reddit (OP)
I love my time alone in my listening corner ngl, knowing that I can mashup Nickelback and Korn in an instant is freeing.
Technically, all the tones you hear in played back music are machine generated. A machine reproduces the recording of humans creating music. The only difference here is the absence of human input, which has been pretty trash since 2011 IMHO.
Miserable-Dare5090@reddit
Agree on music since 2011, disagree on the essence of hearing a human create something from their experience out into the universe that other humans hear and relate to.
Shrimp Bizkit won’t be remembered in 10 years and neither will Limp Bizkit.
entsnack@reddit (OP)
Memories are what you choose to create for yourself, it does not matter what the world remembers if it does not evoke emotions in you. I for one will remember both Limp and Shrimp well into my old age. And the fact is, no one elsw can remember Shrimp, because it's for my ears only. :)
StewedAngelSkins@reddit
I genuinely can't tell if you're fucking with us. How can you possibly be spending so much money on music when your taste is "artists which are indistiguishable from procedurally generated elevator music"?
entsnack@reddit (OP)
sounds like a skill issue tbh, do you even prompt bro?
Pro-editor-1105@reddit
Cause 2 nvidia dgx sparks replaces a concert
entsnack@reddit (OP)
It does not replace a concert. What I meant is I can't watch someone perform my generated music live in concert because that musician does not exist. So I end up saving on concert tickets, at least until live musicians start performing AI generated music.
splix@reddit
I mean, few projectors to cover the walls and a few more Sparks to generate the video and you can enjoy their concert right at home
entsnack@reddit (OP)
one can dream man
TheRealMasonMac@reddit
"I decided to sell my lamborghini to save some cash, and so I bought a private jet.”
Ollie_IDE@reddit
This comment is on point haha.
DoorStuckSickDuck@reddit
You should hook up a net of lavalamps that influence the random seed you use in your music generation ;^)
entsnack@reddit (OP)
I actually have an eInk display that will eventually plot the waveform, SINAD, FFT, jitter, etc.. Once I have that I can get rid of my headphones and focus on making sure my gear measures well.
MrPecunius@reddit
Fucking bravo 👏
As a live sound engineer (just got home from a gig), musician, and fellow owner of a pile of Schiit, I absolutely see what you did there.
relentlesshack@reddit
Ceema is this you lol
saltyourhash@reddit
Hit me with the Plex deets. We need some death metal yodelling bagpipe mixes.
entsnack@reddit (OP)
you have inspired my next prompt
kodewerx@reddit
Maybe some EBM sea shanties while you are at it.
a_beautiful_rhind@reddit
Suno made me some bangers.. is ace-step actually any good? Need that sweet sweet copyrighted content in the training data.
entsnack@reddit (OP)
The copyrighted content is in my hard drive, and training on it is legal since it is for noncommercial use.
a_beautiful_rhind@reddit
How much did you have to train it?
entsnack@reddit (OP)
I only post-train and do DPO right now, still need to figure out SFT. I don't want to fuck it up to mimic my existing collection so it's tricky.
Zulfiqaar@reddit
Now you've got me interested. Looking at the way most proprietary song generators are going the past year, I'm actively preparing my distillation dataset for when I finally get around to training ACE-Step or something. Got any good tuning guides by any chance?
a_beautiful_rhind@reddit
Having to do that put me off from it. I heard it was meh by default. But maybe you will release a tune or someone will. Then I have suno at home.. I think it asks for phone# now so I don't even have suno online.
WithoutReason1729@reddit
Your post is getting popular and we just featured it on our Discord! Come check it out!
You've also been given a special flair for your contribution. We appreciate your post!
I am a bot and this action was performed automatically.
Porespellar@reddit
FINALLY a solid shitpost. Thanks OP! NGL, I would probably buy some Shrimp Biscuit merch.
entsnack@reddit (OP)
it's biZkit u lil punk
lemondrops9@reddit
Its going to be a lonely world when we all have are "own" music
FirePit45@reddit
A+. Brilliant. No notes.
You had me until Shrimp Bizkit and then I lost it.
BrianJThomas@reddit
Can you share some examples of your music?
llama-impersonator@reddit
i messed with ace-step for a couple days, then i listened to aphex twin and was like, holy shit all of that stuff i was listening to is just complete garbage. not even slop.
entsnack@reddit (OP)
I messed with barbells a few days at the gym, then I saw arnold schwazzeneger lift and was like, holy shit all the excercise i was just doing is complete garbage.
llama-impersonator@reddit
yeah watching arnold lift would be as demotivational as listening to aphex twin after slapping retro synthwave anthem with female singer griping about deez nuts 100 times into the gradio demo running acestep-v15-xl-sft
LatentSpacer@reddit
Let’s pretend the crap you’re getting out of Ace-Step is as good as the music produced over the past decades.
entsnack@reddit (OP)
Why so mad at your own lack of skill?
regnus418@reddit
Schiitty setup.
CorpusculantCortex@reddit
I want to hear shrimp bizkits latest single
Foreign_Hand4619@reddit
Now you pay your electric bill instead. Pretty sure it costs more than music subscription 😄
entsnack@reddit (OP)
It costs $60/mo on full load but my old neighbor is on state disability support so I've spliced into her panel and get it for free.
brenden77@reddit
Wild.
Dazzling_Equipment_9@reddit
This is the most interesting post I saw today, including the comment area.
AmphibianFrog@reddit
I do think the technology is very interesting, even if I don't see any appeal in it whatsoever.
SpaceTraveler2084@reddit
i see so many people who get a 10k GPU to just ask what they should do with it, this at least sound funny and ridiculous at the same point
awizemann@reddit
If you’re serious I’d want to listen in.
tavirabon@reddit
Lol, what?
If you can budget a DGX Spark, you were never at risk of whatever you think you're solving.
entsnack@reddit (OP)
You can't solve a finite supply of music with $10,000.
I'd love to sign a band feeding Limp Bizkit-like music into my brain on-demand for hours at a time. And also Eminem-like music. The cost of an on-demand band like that would be exorbitant.
tavirabon@reddit
A finite supply isn't something that needs solving any more than sunlight being finite. It's ridiculous to hear, all musicians could be teleported from Earth and you could still have a lifetime supply of music that speaks to you on a personal level. Adding AI does not help here, it makes finding good music harder if anything.
Equal_Giraffe8866@reddit
For some genres - shitty house, psytrance, gabber, chiptunes; anything on the pounding brainless electro-scale - there's simply no way a human could tell the difference at this point.
entsnack@reddit (OP)
I mean most music sucks in all genres. Which is why its plentiful and cheap. A handful or artists sell out arenas. The handful I like don't the make music I like anymore because it doesn't sell or they're dead.
Equal_Giraffe8866@reddit
People will hate on you but who will have the courage to hate on all the dogshit artists? You're doing the right thing. Good luck.
shadowmage666@reddit
Lmao what is this 😂
1millionnotameme@reddit
Share some of it, I'm curious what you think is good music
entsnack@reddit (OP)
Limp Bizkit, you can find some albums at your pawn shop.
Foreign_Risk_2031@reddit
Mmm 16khz audio
entsnack@reddit (OP)
The base audio quality is low resolution. However, you can take the image of the waveform and upsample it using WAN or any AI upsampler, then convert it back into audio.
yes_i_tried_google@reddit
This is one of the few AI Projects out there that make me go “wow, I want this in my life”.
Now going to see if my 3090 ti can pull anything off like this. Please hit me up with your Plex deets while I go and make Zoombie by the loganberries
orinoco_w@reddit
I thought Kombi by the Vanberries was better
jebenasarma@reddit
Retardmaxing
Toastti@reddit
Gotta share one of the songs itade with us
seamonn@reddit
What is this? Where can I learn about this?
entsnack@reddit (OP)
https://gepa-ai.github.io/gepa/blog/2026/02/18/introducing-optimize-anything/
complexminded@reddit
Nice Schiitt! A person of culture. Similar stack going on here
DAlmighty@reddit
I had to scroll too far to see the Schiit name. I need to upgrade my stack.
entsnack@reddit (OP)
I'm getting this piece of Schiit next week, bought it from the creator: https://www.reddit.com/r/Schiit/s/NhyBuA79of
learn_and_learn@reddit
r/music is either gonna hate this or they're gonna love this. No in betweens
the__storm@reddit
I would bet everything I own that they would hate it lol
learn_and_learn@reddit
I wanna see how it does in r/music and r/audiophile
DAlmighty@reddit
Oh Schiit
ArtfulGenie69@reddit
This is a fun idea that just about all of us could do. How good is that ace step model do you think in comparison to the previous? I had trained on the first ace step and was getting very close to something actually good. Just haven't gotten back around to test the set on the new model. It would be really cool to have the old school 90's trance and rock bands and have new stuff all the time. I also hate the new produced music. All they do is rip off some of the techno trap simple 4/4 beat and then throw whatever on top. Fake country, fake folk, and rock is just dead. They do it to every genre now. Like that dumb simple beat and then some shit studio singing on top with lyrics that are generally trash.
Tried lora on Meat is murder as well as Slipknot (for my bro haha). It had way more trouble learning metal like slipknot than Morrissey.
jcdoe@reddit
Share some tracks?
elzZza@reddit
very cool thought to use llm for music generation like this
f5alcon@reddit
How long is your music gen take? I'm using a 5070ti and even with some cpu offload can do 4 versions of the same song in about real time, most of the time one version is acceptable and I move on to the next one. I did a whole album this way in an afternoon, and lyrics and style were done by Claude who created a complete narrative for the album so it tells a story.
Currently working on image to video with ltx 2.3 to create music videos for the songs.
Tartness3491@reddit
One of the most creative uses I've seen in here. Can you share some sample remixes?
lamardoss@reddit
This is something I do with Ace also. Love it! I also have a small 2B LLM name each rendered clip before it gets played and it goes into two folders, one for the queue and one for keeping. I keep the most recent 300 clips for each Ace stream I have going (different music genres) and I can then go back and listen to them whenever I need that GPU resource to do other things and need to temporarily pause generation or if I love a clip so much that I want to keep it. Really love that stack you have there!
BobLobLawsLawBlawg@reddit
This is the best use of AI ever.
entsnack@reddit (OP)
tokens straight into my brain
Finanzamt_Endgegner@reddit
You are insane, i love it 🤣
boston101@reddit
Op you are genius and ridiculous - I love it.
entsnack@reddit (OP)
inb4 mods see this 😆
SoupSuey@reddit
Well, I never thought of that to be honest. Maybe you should share some of the best musics on SoundCloud, I’m particularly interested in hearing Shrimp Bizkit. Also, SoundCloud could be the conmunity you’re looking for.
uti24@reddit
why would you need two DGX Spark for that? Even on a single one Ace-Step 1.5 XL already works probably twice as fast as realtime
entsnack@reddit (OP)
With 240GB of unified memory I can generate roughly 12 3-4 minute songs in parallel. A lot of these songs are crappy and need to be thrown away for the next evolutionary optimization step, and the Spark is slow, so I need the VRAM and parallelism to generate good music at a reasonable rate.
Once generation quality gets better, I'll have fewer bad generations and can probably repurpose the GPU for other things.