Breaking the music supply constraint

[-]

Dany0@reddit

This is kinda cyberpunk but more the dystopian kind

[-]

Just because it's OSS doesn't mean it was ethically trained. Plus, if you really do love music, then I don't think that you'll be happy with OP's setup. I can imagine actually doing something like this for myself, but only if it was fully trained on just my playing, that would be actually fun, and maybe I'd even use it from time to time you know?

[-]

entsnack@reddit (OP)

Organic music is the past. The vinyl of 2026. We will remember the days we sniffed human sweat watching humans perform on stage.

[-]

-dysangel-@reddit

I used to smell human sweat while doing everything, but then I realised I had pretty open access to water and soap, and so I just get soapy water and mist it into my nose every so often.

[-]

Conscious-content42@reddit

"It's the smell, if there is such a thing" https://www.youtube.com/watch?v=yL9Y24ciNWs

[-]

yourgamermomthethird@reddit

Didn’t imagine 2 of my interests would be on the same sub this is pretty cool

[-]

AbsoluteHedonn@reddit

This is garbage engagement bait

[-]

entsnack@reddit (OP)

What about it makes you think so? Is it not related to local LLMs broadly construed?

[-]

truthputer@reddit

Your post is just about AI slop and rage bait.

Stop wasting our time with this shit.

[-]

entsnack@reddit (OP)

Sir, this is /r/LocalLLaMA

[-]

Vicar_of_Wibbly@reddit

It’s parody of the finest kind. Subtle, funny, and completely over the heads of the cerebrally encumbered.

[-]

entsnack@reddit (OP)

cerebrally encumbered

stealing this to use instead of smoothbrain

[-]

KingMitsubishi@reddit

This is the best post ever

[-]

Correct-Boss-9206@reddit

Pretty sure it's a joke/parody, shrimp

[-]

Song-Historical@reddit

Can you link some of your creations

[-]

entsnack@reddit (OP)

You sound bitter, has AI replaced you or something?

[-]

Song-Historical@reddit

I'm genuinely interested! I haven't heard good AI generated music tbh. I was an early user of Aria and Suno.

[-]

entsnack@reddit (OP)

Ah you should genuinely try it out then, the model is free :-)

[-]

Song-Historical@reddit

I don't have DGX sparks to spare

[-]

know-your-enemy-92@reddit

You can run ace step 1.5 easily even on 3060ti

[-]

Song-Historical@reddit

I only have the 3070 on my laptop

[-]

No-Promotion4006@reddit

We are Charlie Kirk is good AI generated music, very famous example

[-]

audioen@reddit

I just listened to it and while I would hesitate to call it actually good music, I agree it is definitely a good example. Its sound quality is nearly good enough to not be immediately distracting or weird, and it even seems to have something like a musical composition, though I personally find it quite boring to listen to -- just not enough variation.

[-]

bnm777@reddit

You misread people if you think that comment sounds bitter.

[-]

entsnack@reddit (OP)

Turns out they were indeed replaced by AI. Can't blame them for calling it leftover casserole.

[-]

ThinDoughnut3617@reddit

Yeah no, this is dystopian. Music is art, art is what it means to express as a human. No machine will & can reproduce that. AI can do whatever, but it should not touch art.

[-]

entsnack@reddit (OP)

There's a reason "struggling artist" is a trope, and my setup frees humans to do something more lucrative than making music.

[-]

ThinDoughnut3617@reddit

Like I said, music making is about expression and so much more than making a profit. If making music is meant to be lucrative and one does it for that sole reason, to seek profit, then that's a problem in itself and apart from what making music truly stands for.

[-]

entsnack@reddit (OP)

Well you sound European and maybe you should lobby for more worker protections there (or just move there). I'm enjoying listening to Chester Bennington sing new songs to me and won't give that up because someone thinks it's nOt TrUE aRt.

[-]

ThinDoughnut3617@reddit

I mean at this point you're just rage baiting. If not, you do you and enjoy it.

[-]

HeywoodJaBlessMe@reddit

loss of community

That's it. AI generated music has almost zero value except as a curiosity.

[-]

vanonym_@reddit

I uuuuh... I'm not sure I would enjoy that listening experience. But good for you I guess?

What's the output format and resolution of your system and how does that play, don't you have a huge quality drop from lossless formats provided by other listening platforms?

[-]

entsnack@reddit (OP)

The base audiobquality is low resolution. However, you can take the image of the waveform and upsample it using WAN or any AI upsampler, then convert it back into audio.

[-]

NandaVegg@reddit

>The base audio quality is low resolution. However, you can take the image of the waveform and upsample it using WAN or any AI upsampler, then convert it back into audio.

Is this really a thing?! Assuming visual waveform upscaling actually works, you will still need to 10000+ visual chunks for 2-3 minutes audio just to do that interpolation visually.

[-]

entsnack@reddit (OP)

most definitely not a thing, neither are cables.

upsampling is tricky but we are making progress, it's already done well commercially, just needs to trickle down to open weights.

[-]

IrisColt@reddit

ca-cables? Like literally physical cables?

[-]

entsnack@reddit (OP)

Yes, this one from Alo Audio (now Campfire in Portland, OR) is particularly good: https://aloaudio.com/products/gold-16

[-]

Jester14@reddit

lmfao I still can't tell if this is a troll post

[-]

entsnack@reddit (OP)

I mean the cable-shilling of a $1000 cable should have been a giveaway.

[-]

IrisColt@reddit

heh

[-]

Capt_Blahvious@reddit

Ok, now I know you're either shitting us or you are smelling your own audiophile farts.

[-]

gtrak@reddit

Only a true audiophile will hear the difference

[-]

IrisColt@reddit

Thanks!!!

[-]

vanonym_@reddit

Thank you for mentioning upsampling. I think that cables are like the very last thing I would consider important when trying to get hifi AI audio, the generation part seems to be the bottleneck but eh i didn't test it.

[-]

--Spaci--@reddit

Ive never understood AI music, not only was it never really good it obviously just had no soul

[-]

the320x200@reddit

When recording was invented there was a movement against it. "Canned music has no soul!" How could it compare to live music. Just a mechanical reproduction of the real thing!

Pretty much every new technology is seen as soulless at first.

[-]

--Spaci--@reddit

This isn't even made by a human, I think there's a pretty large difference here. You are comparing two different ways the listen to music made by humans to listening to music made by AI

[-]

Recoil42@reddit

Tbh, I've been playing around with Suno lately and there's some pretty good stuff out there. I wouldn't say we're best-of-human-level yet, but listenable? Definitely.

[-]

--Spaci--@reddit

That does have quality but still no soul, a human didn't make it. Say like im listening to jcole for example, I feel like I understand this person in like a semi parasocial way through their bars and lyrics, I don't listen to music just because it sounds good

[-]

Recoil42@reddit

Frankly if you define soulful as human-made, then that's tautological and running the discussion immediately off the rails. But I didn't say soulful, I said listenable. The contention was only that the technology is good and getting better, not that that it's ready to replace the best of human art.

[-]

SpicyWangz@reddit

It’s not ready to replace the best of human art, because it will only ever be an imitation of art.

[-]

Borkato@reddit

I thought this was a pro AI sub

[-]

funkdialout@reddit

Ah yes, please keep the echo chamber clear of dissenting opinions!

[-]

SpicyWangz@reddit

I love AI. I use it every day.

[-]

the320x200@reddit

Photography is also not art. The human didn't make the image. All they did was press a button. /s

[-]

SpicyWangz@reddit

Definitely a lesser form of art.

[-]

Recoil42@reddit

Most of the best of human art is functionally imitation of other art. That's like... the entire history of art.

[-]

SpicyWangz@reddit

All of the best human art is good because there is real emotion and story behind it.

[-]

the320x200@reddit

What happens if an artist makes a piece as a commission and their heart isn't in it but they draw it anyway? Is that not art anymore?

What if you don't know the artist's intentions or emotions and have to just judge the piece in isolation?

[-]

Recoil42@reddit

The Mona Lisa was a commission piece. Perhaps the most famous piece of artwork in history was just a rich guy's wife doing what essentially amounted to a contemporaneous version of what would today be at best, a photo studio session.

[-]

Recoil42@reddit

When John Lennon was singing about walruses it's because he really was both the eggman AND the walrus. Nailed it. The art understander has logged on.

[-]

AmphibianFrog@reddit

A lot of works of art and imitations or inspired by other works of art. But in a lot of ways "AI Art" is an imitation of "Art".

[-]

Recoil42@reddit

If art is imitative then asserting that AI art is imitiative does not make AI art not art.

[-]

AmphibianFrog@reddit

Well it depends on your definition of art! If the clouds in the sky happened to make the most beautiful image ever, it still wouldn't be art because it wasn't made with the kind of intent that makes it art.

I think the current definition of art involves a human, it at least some kind of sentient being.

[-]

Recoil42@reddit

Well it depends on your definition of art!

That's the wonderful thing, I'm using your definition of art. You very explicitly said art can be imitative. If art is imitative then media produced with the assistance of an ML model is not disqualified from being art simply by having an imitative component.

I think the current definition of art involves a human

I don't think that's true, nor would it fundamentally disqualify AI art from being art even if it were true. Machine learning models are created by humans and used by humans. The inputs are human. They do not have the property of being human-free.

FYI: Warhol, Picasso, Cage, Ono, Duchamp, Pollock, and many more spent most of their careers exploring the boundaries this stuff. I don't expect r/Locallama to be art school central, but this really is Art 101 — most of the 20th century's great movements are "is art still art if we peel the preconceptions of what art is away from art?" and that's what Dadaism and Fluxus were both about.

[-]

AmphibianFrog@reddit

This is really interesting. I have some ideas in my head about what constitutes art and what doesn't, but the more precisely I try to write them down, the more they seem vague and poorly defined.

I don''t necessarily think that the fact that AI imitates human art is relevant in determining if it is art to be honest. It could be art without imitating existing art, and it could imitate existing art and not be considered art in and of itself.

Also, I see what you're saying about how humans created AI therefore AI created art is human created art, but I'm not sure that's meaningful either.

But then again, there is a lot of human created "art" that I don't consider to be particularly artistic either!

I am still not sure of any definition of art that doesn't involve some kind of intelligent creator though. It seems like the original intent is important to some degree.

So basically, I have no answer or retort to anything in your response. But it was fun trying to find a counter example and realising how thin they all were!

[-]

Recoil42@reddit

I am still not sure of any definition of art that doesn't involve some kind of intelligent creator though.

Yoko Ono is famously good at exploring the boundaries of this. Immediately coming to mind — Instructions for Paintings (1962), which is not actually a set of paintings but what you would now recognize as resembling a book of prompts for creating paintings. The paintings themselves do not exist, only the descriptions of them. See also Grapefruit (1964) which was the same, but for performances.

[-]

--Spaci--@reddit

Inspiration and imitation is different from generative AI

[-]

Recoil42@reddit

Special pleading and distinction without a difference both apply here. Just saying something is 'different' doesn't make it not art or a valid functional replacement for entirely human-produced art. It isn't even really an argument at all.

[-]

AmphibianFrog@reddit

I'm not the one who originally said "soul" but I think I know what he means.

For me personally, human made music is a lot more interesting. Human creativity comes out of lived experience.

Take for example that Dire Straits song about the shitty band in the pub. The Sultans of Swing! That song is cool on its own, but there's something about the fact that he actually walked into a pub and saw a real band playing.

There's a human connection.

Even with instrumental music, normally the creator at least liked the way it made them feel.

[-]

MindlesslyBrowsing@reddit

As a hobby musician, I'm afraid the masses don't care about that.

We still got the performance aspect I guess

[-]

AmphibianFrog@reddit

I don't think "the masses" are listening to AI generated music...

[-]

MindlesslyBrowsing@reddit

Some of it has already mainstream, sometimes as a "meme". For example BBL Drizzy

[-]

AmphibianFrog@reddit

If you take a random sample of the population in your country, on average, what percentage of the music that they listen to by choice do you think would be AI generated?

I have never met anyone in real life who listens to AI music.

And I'm really talking about pieces of music where the AI generated it completely from humans prompting it, not when AI was used to master it or add some little details to an existing piece of music.

[-]

MindlesslyBrowsing@reddit

Well i caught my parents listening to it and as I said it's been circulating through memes.

I'm not saying it's the most dominant form of music, I'm saying it's already here

[-]

AmphibianFrog@reddit

You said it's "for the masses". I don't really think that's true.

[-]

MindlesslyBrowsing@reddit

Fair, that's not what I see. To each their own 🤷

[-]

IrisColt@reddit

This song is awesome, THANKS!!! I love AI music in general (I started listening to my own generated pieces 15 years ago, just like D. Cope), but this one takes it to a whole other level.

[-]

ArtfulGenie69@reddit

You haven't heard what is possible and being generated today. When we get something that does as good at music as fish audio s2 does at voices you'll be done with the riaa as well. I'm not saying this as a ai chud either, this is coming from a dude who practices his guitar every day.

The other day I heard a Charlie Kirk ai song but they had done better lyrics and they had done it as a reggae/ska style. The song worked even though I hate Charlie. It was on a steam and it wasn't just me having a bad ear, the crowd watching was amazed by the quality.

[-]

lukistellar@reddit

And even in the better creations, you clearly can hear that this is generated by AI. It's in the highs, it sound kinda like undersampling, especially in the vocal. Also the design of the room is weird in most AI track, like strange delays and reverbs.

Here is also a good example: https://www.youtube.com/watch?v=pAXKptUYf8M
The artifacts are way stronger in this one, but it's the first AI track I actually like musically.

[-]

ArtfulGenie69@reddit

The fedelity on charlie kirk is definitely not there fully, I can hear the ai hiss over instruments and the singers voice and the piano/guitar combo that in actual reggae would be more defined. Quick strums with a mute after and the stab from the piano at the same time. The ai doesn't get the idea of both those instruments it just is guess at the sound they make and it shows. Thinking about the song as a whole, it has a start, the singer nails his lyrics, it has the call and response between the singer and the horns, the end also has a slow down ending.

I was able to get better representation of the instruments in bands when I demuxed my training tracks giving it instrument solo's, but it over trained and would add more crackle. It would sound just like surf guitar though even though the original surf guitar that it produced was utter garbage. When I found this out I also noticed that the starter and endings would also be garbled without unique descriptions, when the end was uniquely described it would actually do a sudden stop or a break down of the band into a hissing noise. Same with the starts they have to be tagged with unique instructions. Ace step is really close, hopefully there will be something magnificent and new this year in music gen.

[-]

stl314159@reddit

I listen to it while I work specifically because it has no soul. It’s just repetitive and consistent and allows my ADHD brain to get into a flow state.

[-]

Frequent-Mud8705@reddit

ever heard of jungle/DnB?

[-]

AdministrativeMeat3@reddit

Literally throw on any UKF dnb mega playlist from the last 30 years and you've got yourself an immediate all day flow state

[-]

illforgetsoonenough@reddit

I used to mix it in the 90s :D

[-]

Adidat@reddit

Hook a brotha up. My adhd ass needs this. Also any advice for some soulless music for luxury real estate videos?

I’m so curious the kind of mashups op would be able to do with a large library like that.

[-]

stl314159@reddit

I can't help with the real estate videos but these are some things from my recently played:

https://youtu.be/dAfRWzh66yY?si=3L25EXyxCRV2iqU_
https://youtu.be/dAfRWzh66yY?si=1mWtNjvbEVtWxnT_
https://youtu.be/GISyZNwpprU?si=CLdi2qXvI5x-CYn1
https://youtu.be/9ic3a7SO6cA?si=wYia1VUzBwa99kSx
https://youtu.be/pL-90ThONJs?si=88org9pWQIlBUUCy

[-]

halr9000@reddit

Stable audio 3 is really good for this. It can't even do lyrics but it takes 2 seconds to make 2 minutes of audio.

[-]

IrisColt@reddit

This.

[-]

MaruluVR@reddit

Its not bad if you finetune it on exactly the music you like

[-]

entsnack@reddit (OP)

It does need good training data and prompting. Commercial setups are constrained by copyright laws. I have a large FLAC library to train on.

[-]

vanonym_@reddit

I don't really enjoy it either for similar reasons, let alone the fact that the output quality usually is terrible, that's why I'm asking OP about it.

But to each their own

[-]

GeologistPutrid2657@reddit

the fuck happens when you wanna hear something again? have we got unlimited storage and recall as well?

[-]

tmvr@reddit

You don't need to hear it again dude, carpe diem, if it's gone, just let it go, something new will come along 😃

[-]

entsnack@reddit (OP)

storage was cheap until very recently, effectively unlimited once prices come back down.

[-]

Main_Problem_2696@reddit

Two DGX Sparks running parallel AceStep models just for infinite Shrimp Bizkit is the most overkill local setup I've seen. The RLHF interface idea is legit though. Loss of community is the real tradeoff. Generated music has no band to follow or weird subreddit to argue in. Used Runable to document my own music gen experiments. Clean dashboard in 20 minutes. Made sharing specific outputs easier. Would love to hear what Phlegminem sounds like.

[-]

HavenTerminal_com@reddit

shrimp bizkit and phlegminem are the best artist names I've ever heard and I'm a little devastated they're not real bands

[-]

TheThoccnessMonster@reddit

Can you share your configs and loadout. I’ve got a couple sparks I’d love to try this in.

[-]

cleverusernametry@reddit

Oh yes weve always had a shortage of music. Ffs.

[-]

L064N@reddit

This is so interesting wow. Do you have some favorite songs that its made?

[-]

awsom82@reddit

Ridiculous, you dont AI for music 🎶

[-]

entsnack@reddit (OP)

You do need it for an infinite supply of personalized music on demand. :-)

[-]

JonMcElyea@reddit

I think it's weird you are being down voted in a local-first community. You are describing what I consider the best personal music creation workflow using Ace-step, and for some reason, people HERE are criticizing that effort. Baffled.

Keep up the cool experiment and the satire. It's all great to read about.

[-]

xquarx@reddit

Its not hard to get hold of more music than hours in your life.

[-]

entsnack@reddit (OP)

Find me more nu metal with a whiny vocalist bro, do it, punk

[-]

MiElas-hehe@reddit

That's the beauty of music; some are one in a kind. When you do find a similar artist, boy does it feel good. If not, you make it yourself! Without AI of course, the good old authentic real human effort way. Though it might sound shit in the beginning, but that is what it gives it SOUL.

[-]

NelisMakrelis@reddit

As a musician you saying you prefer ai music over anything made after 2011 makes me both incredibly sad and dissapointed as well as straight up annoyed because that’s just simply bullshit, if you don’t like what’s on the mainstream just stream any of the other 10k songs that are uploaded by hardworking people you have something in common with; being human and knowing what that’s like.

[-]

entsnack@reddit (OP)

You misunderstood me: I prefer my AI music. Why: because it sounds better to my ears! Artists don't produce my type of music anymore because it does not sell.

Have you ever built a chair yourself? DIYed a gadget? Baked your own cake? This is the same, and it means no disrespect to people who produce these things for a living and require mass appeal to survive.

[-]

youcloudsofdoom@reddit

This analogy only makes sense if by building your own chair you also consume an insane amount of energy and resources every time you sit on it

[-]

entsnack@reddit (OP)

insane amount of energy

This entire cluster consumes about the same power as a 4090 (JUST the GPU). Go yell at the gamers for their INSANE energy consumption everytime they game lol.

[-]

youcloudsofdoom@reddit

No, I'm yelling at you for composing the most consumptive way to listen to music in the history of humanity, during a climate crisis.

[-]

entsnack@reddit (OP)

What? My entire stack consumes less power than the average 5.1 speaker system. Do you have a source for your claim or just vibes?

[-]

youcloudsofdoom@reddit

The problem isn't amplification, it's generation.

The embodied carbon cost of what you're doing is substantial; it's not simply the cost to run, but the embodied carbon in the Sparks is massive, as is that involved in training and distributing the models themselves. If you're using this system to generate new music each time you play something (which seems implied in your post), it's like buying an album on a physical format, listening to it once, then throwing it away and buying another (similar) one, and doing the same over and over again. Only what you're doing involves a LOT more rare earth metals and non-recyclable future e-waste.

Streaming music alone was already more consumptive than previous formats (https://www.cambridge.org/core/journals/popular-music/article/abs/cost-of-music/DEC6AA100C191D510213F9086CF094CC). But what you've deployed here is orders of magnitude more consumptive than the typical user streaming to their phone; data centers are at least reasonably efficient at their (ecologically catastrophic) jobs in terms of co2/stream, which your setup is far from. You've created an additional, unwarranted step in the consumption of music, using what is widely understood to be one of the most consumptive technological infrastructures (generative AI) that humanity has ever produced, and you're doing it during an unfolding climate crisis.

[-]

StewedAngelSkins@reddit

And yet he's still causing less harm than your average gamer. Seriously, if you actually care about this and aren't just using it as a proxy to be mad about AI then go after the stuff that makes the biggest impact first.

[-]

rorykoehler@reddit

Also I made music myself. This isn't remotely similar. I guess you are joking but if you aren't... i have no words.

[-]

rorykoehler@reddit

Post your slop so we can judge you!

[-]

JazzlikeLeave5530@reddit

Any time someone says "XYZ nowadays is all bad" I just assume they're lazy. Mainstream is always gonna be crap but there are so many artists nowadays everywhere making amazing songs and with like 1,000 listens on their stuff. This reads to me like "I'm so glad now that I can eat nothing but gray paste that I made myself."

[-]

StewedAngelSkins@reddit

I mean it's a cool concept but you couldn't pay me to listen to that slop. It's worse than the radio. Wait until OP finds out about soulseek...

[-]

entsnack@reddit (OP)

Ha I'm a massive "soul seeker" let's just say, I have about 10 TB of FLACs and my primary portable device is a flashmodded iPod Classic. I am actually trying to use these FLACs for training. I don't get the deal with vinyl rips on soulseek though, I keep downloading them by mistake because 192 Khz and they have a high noise floor and crackling.

[-]

StewedAngelSkins@reddit

How is this saving you money then? You already have an infinite supply of music for free.

[-]

entsnack@reddit (OP)

I want Fred Durst singing to me for a couple of hours everyday, there's only so many times I can re-listen to Hot Dog.

[-]

StewedAngelSkins@reddit

there's only so many times I can re-listen to Hot Dog

At least we agree on one thing. For me that number is 1.

[-]

StewedAngelSkins@reddit

So you want to keep listening to imitations of the exact same artists forever, to the point where even finding similar artists won't do it for you, but not the exact same songs? You have interesting taste in music.

[-]

wrathfulrapier@reddit

There are thousands of artists making millions of songs, not to mention the pre-existing millions of songs with just about any combination you could ask for if you just look it up. Especially considering that you can only generate music off of pre-existing music, it's an interesting set up but I'm not seeing the appeal (especially with the loss of community)

[-]

entsnack@reddit (OP)

find me a numetal band with a whiny male vocalist, do it, has to be whiny + numetal rap/rock. show me one in your thousands of artists doing this.

[-]

draetheus@reddit

Check out Hellhills. A little light on the rap but damn if that isn't some early 00s whiny numetal.

[-]

wrathfulrapier@reddit

Also Daron Malakian System of a Down

[-]

entsnack@reddit (OP)

I've heard ALL his songs. Simpler: find me a Chester Bennington song I haven't heard. He's not making new ones last I checked. My setup generates infinitely many Chester songs.

[-]

AmphibianFrog@reddit

They're not really Chester Bennington songs if he didn't make them though...

[-]

entsnack@reddit (OP)

Doesn't matter if what I want to hear is his voice. Look, I understand this isn't for you, I'm not sure why you don't understand this is what I need and enjoy.

[-]

AmphibianFrog@reddit

Why do you think I don't understand that? I'm expressing my opinion, not trying to decide yours!

[-]

entsnack@reddit (OP)

Cool, maybe try to create instead of opining and you'll have more interesting takes than GPT 5.5. ✌️

[-]

AmphibianFrog@reddit

I create plenty. I create plenty of music too.

But I don't really consider asking Suno to make some music to be an act of creation. I also don't consider asking Claude code to write something to be an act of creation either. Which, as a software engineer, is definitely making me think some things...

[-]

wrathfulrapier@reddit

You can't get your multi-thousand dollar setup to do this for you?

[-]

entsnack@reddit (OP)

Yes which is why I built this lol. I have 20TB of FLACs that didn't fix my need.

[-]

wrathfulrapier@reddit

I highly recommend getting hooked up with last.fm and listen brains and getting your tastes punched in for some fine tuned recommendations, you may be surprised! I would be absolutely heartbroken if I wasn't able to link up with other people about the music I loved, it's like 20% of the enjoyment for me. And as a side benefit it would free up compute power for you to focus on other tasks

[-]

entsnack@reddit (OP)

My lastfm account is 20 years old lol, I have scrobbled from Winamp on Windows 95 and displayed my listening status on MSN Messenger. Thanks for the recommendation tho.

[-]

audioen@reddit

https://www.youtube.com/watch?v=8Zzz2c0uhR0 I'm not entirely sure about it. Clearly, this is human-AI collaboration, but something like forest doomcore is underserved genre for sure. While the music is just fairly generic death metal type of thing, there is wonderful undertone of silliness about the whole production.

I like AI in a similar way that I like cartoons -- they can just draw whatever weird stuff that would be impractical to film, and thus there is unleashing of a kind of raw creation that is based on mixing existing stuff, yes, but in a way that might not have been done before.

[-]

mintybadgerme@reddit

This is simultaneously genius and the music industry's biggest nightmare. Once this happens for real, it's game over.

[-]

Recoil42@reddit

This post both rocks and is also completely indistinguishable from parody, I love it.

[-]

EvilPencil@reddit

I know right? I was hoping to see an AI music recommendation engine that plugs into the *arr stack and curates the playlists, etc. Because then the actual music would be real.

[-]

entsnack@reddit (OP)

Takes a scientist to know one. 🫡

[-]

tmvr@reddit

[-]

dangerous_inference@reddit

This is a pet peeve of mine. Zero means nothing. It's not an item, it the lack of an item. WHY ARE YOU LIKE THIS

[-]

Conscious-content42@reddit

Whoooa loading this using old.reddit.com, changed the index to 1-3, and new Reddit (www.reddit.com) is 0-2 indexed. I was like what is he talking about index of 0. Old Reddit knows the trickery.

[-]

zuraken@reddit

Same, i had looked again and saw no 0

[-]

MathmoKiwi@reddit

Favorite part of your post was starting your list from 0

Which then broke the reddit formatting

[-]

Spiritual_Trick_6655@reddit

The litmus test for satire.

[-]

xquarx@reddit

I did this, but the opposite. Build a massive FLAC collection of historical bangers, run STEM processing on them (vocal, melody, bass, drums), use AudioMuse-AI to embedd the music (track similarity), fork Mixxx and vibe integrate it all (distance, harmonics), then become a DJ. It is so much fun.

[-]

rorykoehler@reddit

That's genius. Did you post the code?

[-]

xquarx@reddit

Sorry I've not shared it anywhere, its quite involved and custom tailored with gluing various FOSS projects together with little modifications. Very clean workflow for me, but not a quick setup. More a proof of concept that could be refined into a solution.

[-]

entsnack@reddit (OP)

This sounds fun! I've done this at a small scale with demucs for a karaoke setup. I love the DJing idea!

[-]

xquarx@reddit

Yes basically that. I did small scale but then I enjoyed it so much I automated my entire library.

[-]

Photochromism@reddit

Is Ace-Step actually any good? I tried it a year ago and it sounded like shit.

[-]

Nicking0413@reddit

Can’t even tell if this is parody or not. Maybe I’m just stupid

[-]

Desperate-List442@reddit

Might be bottom 10 posts OAT

[-]

entsnack@reddit (OP)

I've never been a bottom so I can't really tell. What's it like?

[-]

Desperate-List442@reddit

Is that supposed to be like an insult or something

[-]

Neither-Ad-8957@reddit

beautiful!

[-]

gufhHX@reddit

I want to marry who ever owns this apartment and stuff. Take the ring right now I tells ya

[-]

Raredisarray@reddit

Github Repo?

[-]

entsnack@reddit (OP)

sorry I'm not a vibecoder

[-]

rorykoehler@reddit

Lol

[-]

perfopt@reddit

"the loss of community" create a website and post there - invite people to listen, comment etc.

[-]

Southern_Sun_2106@reddit

Somebody a while ago posted a public repo project of a radio staton that generates endless talkshows. It was very well done. I took that and converted it to a music/talk show station; played with different styles; then ended up having it make songs based on real poetry of several favorite poets (copyright free, from 18-19-early 20th century, about love and such, some other topics). It was great for some time, some songs were really good. Family members reported - better than paid services, and they preferred the 'home' radio station. Also tweaked talked shows to match family interests.

Everything was powered by M3 Ultra plus qwen 27B and Kokoro.

Issue - you know how you come to 'learn' the models eventually? Even 'frontier' models like Claude become stale at some point. Anyway, it all becomes feeling 'same' after a while. I ended up just doing a 'music tool' for the kids - they write their own lyrics, M3 Ultra makes music, songs, etc. That simple tool ended up having a much longer life and is still used and preferred to paid online services, surprisingly.

[-]

devilandall@reddit

Dude, that’s neeeeeeeet. Have you shared Shrimp Bizkit anywhere? I would really like to listen it, don’t have Aryas unfortunately, but good old 400s should do it for me for now

[-]

cmndr_spanky@reddit

That is wild, can I get a link to listen somehow ? You can PM if you prefer

[-]

andrew45lt@reddit

So you just listen to how the tokens are generated?

[-]

Mamaun30@reddit

Op: "I just cancelled my music subscriptions to save some cash"

Also op: "2 x DGX Spark linked via ConnectX 7..."

[-]

entsnack@reddit (OP)

I'll breakeven in no time because I'm also saving on concerts and merch. Power bills are a problem but my old neighbor doesn't mind me borrowing some of her electricity for my music consumption.

[-]

meganoob1337@reddit

the more you buy the more you save?

[-]

goobuddy@reddit

https://youtu.be/cSOHAM-ADzc?si=VkwqWjE_TwO4Xyi1&t=3

[-]

Ok-Buffalo2450@reddit

Jensen loves you.

[-]

JLeonsarmiento@reddit

[-]

bnm777@reddit

"I'm saving on concerts"

As he sits, alone, listening to machine generated tones.

What a totally non-miserable existence :/

[-]

mivog49274@reddit

yeah this guy is just way too much in the future which makes him wrong. he just speedran all of the flaws and ills produced by super capable automated systems, a global atomised system of entertainment in a dystopian society...

what's goofy though is imagining someone being satisfied with the current state of (local !) generative AI on music and presenting himself as a music lover (but not a sweat lover obviously)

The setup is neat though ! But the speakers... would rather be called sloppers here ;)

[-]

entsnack@reddit (OP)

Hey hey these are Hifiman Arya Stealth PLANAR magnetic headphones! They sound good to me!

I like your framing of the future dystopian society, we will all have tokens tailored to our preferences injected into all our 5 senses all the time. I like building things resembling that future, and also collect things from the past that were flawed attempts at present-day technology.

[-]

draconic_tongue@reddit

do redditors actually feel helpless about the future or are they just farming karma?

[-]

Hypilein@reddit

The way we’re speedrunning into some of this dystopian sci-fi shit I’m not so sure about that anymore.

[-]

Frequent-Mud8705@reddit

I used to be exited about dynamic generative music till I realized it all sounds tinny & incredibly generic. Much more fun to curate my own library of artists that I have a larger connection to

[-]

SkyFeistyLlama8@reddit

"new miserable experience"

"congratulations I'm sorry"

[-]

entsnack@reddit (OP)

I love my time alone in my listening corner ngl, knowing that I can mashup Nickelback and Korn in an instant is freeing.

Technically, all the tones you hear in played back music are machine generated. A machine reproduces the recording of humans creating music. The only difference here is the absence of human input, which has been pretty trash since 2011 IMHO.

[-]

Miserable-Dare5090@reddit

Agree on music since 2011, disagree on the essence of hearing a human create something from their experience out into the universe that other humans hear and relate to.

Shrimp Bizkit won’t be remembered in 10 years and neither will Limp Bizkit.

[-]

entsnack@reddit (OP)

Memories are what you choose to create for yourself, it does not matter what the world remembers if it does not evoke emotions in you. I for one will remember both Limp and Shrimp well into my old age. And the fact is, no one elsw can remember Shrimp, because it's for my ears only. :)

[-]

StewedAngelSkins@reddit

I genuinely can't tell if you're fucking with us. How can you possibly be spending so much money on music when your taste is "artists which are indistiguishable from procedurally generated elevator music"?

[-]

entsnack@reddit (OP)

sounds like a skill issue tbh, do you even prompt bro?

[-]

Pro-editor-1105@reddit

Cause 2 nvidia dgx sparks replaces a concert

[-]

entsnack@reddit (OP)

It does not replace a concert. What I meant is I can't watch someone perform my generated music live in concert because that musician does not exist. So I end up saving on concert tickets, at least until live musicians start performing AI generated music.

[-]

splix@reddit

I mean, few projectors to cover the walls and a few more Sparks to generate the video and you can enjoy their concert right at home

[-]

entsnack@reddit (OP)

one can dream man

[-]

TheRealMasonMac@reddit

"I decided to sell my lamborghini to save some cash, and so I bought a private jet.”

[-]

Ollie_IDE@reddit

This comment is on point haha.

[-]

DoorStuckSickDuck@reddit

You should hook up a net of lavalamps that influence the random seed you use in your music generation ;^)

[-]

entsnack@reddit (OP)

I actually have an eInk display that will eventually plot the waveform, SINAD, FFT, jitter, etc.. Once I have that I can get rid of my headphones and focus on making sure my gear measures well.

[-]

MrPecunius@reddit

Fucking bravo 👏

As a live sound engineer (just got home from a gig), musician, and fellow owner of a pile of Schiit, I absolutely see what you did there.

[-]

relentlesshack@reddit

Ceema is this you lol

[-]

saltyourhash@reddit

Hit me with the Plex deets. We need some death metal yodelling bagpipe mixes.

[-]

entsnack@reddit (OP)

you have inspired my next prompt

[-]

kodewerx@reddit

Maybe some EBM sea shanties while you are at it.

[-]

a_beautiful_rhind@reddit

Suno made me some bangers.. is ace-step actually any good? Need that sweet sweet copyrighted content in the training data.

[-]

entsnack@reddit (OP)

The copyrighted content is in my hard drive, and training on it is legal since it is for noncommercial use.

[-]

a_beautiful_rhind@reddit

How much did you have to train it?

[-]

entsnack@reddit (OP)

I only post-train and do DPO right now, still need to figure out SFT. I don't want to fuck it up to mimic my existing collection so it's tricky.

[-]

Zulfiqaar@reddit

Now you've got me interested. Looking at the way most proprietary song generators are going the past year, I'm actively preparing my distillation dataset for when I finally get around to training ACE-Step or something. Got any good tuning guides by any chance?

[-]

a_beautiful_rhind@reddit

Having to do that put me off from it. I heard it was meh by default. But maybe you will release a tune or someone will. Then I have suno at home.. I think it asks for phone# now so I don't even have suno online.

[-]

WithoutReason1729@reddit

Your post is getting popular and we just featured it on our Discord! Come check it out!

You've also been given a special flair for your contribution. We appreciate your post!

I am a bot and this action was performed automatically.

[-]

Porespellar@reddit

FINALLY a solid shitpost. Thanks OP! NGL, I would probably buy some Shrimp Biscuit merch.

[-]

entsnack@reddit (OP)

it's biZkit u lil punk

[-]

lemondrops9@reddit

Its going to be a lonely world when we all have are "own" music

[-]

FirePit45@reddit

A+. Brilliant. No notes.

You had me until Shrimp Bizkit and then I lost it.

[-]

BrianJThomas@reddit

Can you share some examples of your music?

[-]

llama-impersonator@reddit

i messed with ace-step for a couple days, then i listened to aphex twin and was like, holy shit all of that stuff i was listening to is just complete garbage. not even slop.

[-]

entsnack@reddit (OP)

I messed with barbells a few days at the gym, then I saw arnold schwazzeneger lift and was like, holy shit all the excercise i was just doing is complete garbage.

[-]

llama-impersonator@reddit

yeah watching arnold lift would be as demotivational as listening to aphex twin after slapping retro synthwave anthem with female singer griping about deez nuts 100 times into the gradio demo running acestep-v15-xl-sft

[-]

LatentSpacer@reddit

Let’s pretend the crap you’re getting out of Ace-Step is as good as the music produced over the past decades.

[-]

entsnack@reddit (OP)

Why so mad at your own lack of skill?

[-]

regnus418@reddit

Schiitty setup.

[-]

CorpusculantCortex@reddit

I want to hear shrimp bizkits latest single

[-]

Foreign_Hand4619@reddit

Now you pay your electric bill instead. Pretty sure it costs more than music subscription 😄

[-]

entsnack@reddit (OP)

It costs $60/mo on full load but my old neighbor is on state disability support so I've spliced into her panel and get it for free.

[-]

brenden77@reddit

Wild.

[-]

Dazzling_Equipment_9@reddit

This is the most interesting post I saw today, including the comment area.

[-]

AmphibianFrog@reddit

I do think the technology is very interesting, even if I don't see any appeal in it whatsoever.

[-]

SpaceTraveler2084@reddit

i see so many people who get a 10k GPU to just ask what they should do with it, this at least sound funny and ridiculous at the same point

[-]

awizemann@reddit

If you’re serious I’d want to listen in.

[-]

tavirabon@reddit

Lol, what?

music supply constraint

If you can budget a DGX Spark, you were never at risk of whatever you think you're solving.

[-]

entsnack@reddit (OP)

You can't solve a finite supply of music with $10,000.

I'd love to sign a band feeding Limp Bizkit-like music into my brain on-demand for hours at a time. And also Eminem-like music. The cost of an on-demand band like that would be exorbitant.

[-]

tavirabon@reddit

A finite supply isn't something that needs solving any more than sunlight being finite. It's ridiculous to hear, all musicians could be teleported from Earth and you could still have a lifetime supply of music that speaks to you on a personal level. Adding AI does not help here, it makes finding good music harder if anything.

[-]

Equal_Giraffe8866@reddit

For some genres - shitty house, psytrance, gabber, chiptunes; anything on the pounding brainless electro-scale - there's simply no way a human could tell the difference at this point.

[-]

entsnack@reddit (OP)

I mean most music sucks in all genres. Which is why its plentiful and cheap. A handful or artists sell out arenas. The handful I like don't the make music I like anymore because it doesn't sell or they're dead.

[-]

Equal_Giraffe8866@reddit

People will hate on you but who will have the courage to hate on all the dogshit artists? You're doing the right thing. Good luck.

[-]

shadowmage666@reddit

Lmao what is this 😂

[-]

1millionnotameme@reddit

Share some of it, I'm curious what you think is good music

[-]

entsnack@reddit (OP)

Limp Bizkit, you can find some albums at your pawn shop.

[-]

Foreign_Risk_2031@reddit

Mmm 16khz audio

[-]

entsnack@reddit (OP)

The base audio quality is low resolution. However, you can take the image of the waveform and upsample it using WAN or any AI upsampler, then convert it back into audio.

[-]

yes_i_tried_google@reddit

This is one of the few AI Projects out there that make me go “wow, I want this in my life”.

Now going to see if my 3090 ti can pull anything off like this. Please hit me up with your Plex deets while I go and make Zoombie by the loganberries

[-]

orinoco_w@reddit

I thought Kombi by the Vanberries was better

[-]

jebenasarma@reddit

Retardmaxing

[-]

Toastti@reddit

Gotta share one of the songs itade with us

[-]

seamonn@reddit

GePa prompt optimization

What is this? Where can I learn about this?

[-]

entsnack@reddit (OP)

https://gepa-ai.github.io/gepa/blog/2026/02/18/introducing-optimize-anything/

[-]

complexminded@reddit

Nice Schiitt! A person of culture. Similar stack going on here

[-]

DAlmighty@reddit

I had to scroll too far to see the Schiit name. I need to upgrade my stack.

[-]

entsnack@reddit (OP)

I'm getting this piece of Schiit next week, bought it from the creator: https://www.reddit.com/r/Schiit/s/NhyBuA79of

[-]

learn_and_learn@reddit

r/music is either gonna hate this or they're gonna love this. No in betweens

[-]

the__storm@reddit

I would bet everything I own that they would hate it lol

[-]

learn_and_learn@reddit

I wanna see how it does in r/music and r/audiophile

[-]

DAlmighty@reddit

Oh Schiit

[-]

ArtfulGenie69@reddit

This is a fun idea that just about all of us could do. How good is that ace step model do you think in comparison to the previous? I had trained on the first ace step and was getting very close to something actually good. Just haven't gotten back around to test the set on the new model. It would be really cool to have the old school 90's trance and rock bands and have new stuff all the time. I also hate the new produced music. All they do is rip off some of the techno trap simple 4/4 beat and then throw whatever on top. Fake country, fake folk, and rock is just dead. They do it to every genre now. Like that dumb simple beat and then some shit studio singing on top with lyrics that are generally trash.

Tried lora on Meat is murder as well as Slipknot (for my bro haha). It had way more trouble learning metal like slipknot than Morrissey.

[-]

jcdoe@reddit

Share some tracks?

[-]

elzZza@reddit

very cool thought to use llm for music generation like this

[-]

f5alcon@reddit

How long is your music gen take? I'm using a 5070ti and even with some cpu offload can do 4 versions of the same song in about real time, most of the time one version is acceptable and I move on to the next one. I did a whole album this way in an afternoon, and lyrics and style were done by Claude who created a complete narrative for the album so it tells a story.

Currently working on image to video with ltx 2.3 to create music videos for the songs.

[-]

Tartness3491@reddit

One of the most creative uses I've seen in here. Can you share some sample remixes?

[-]

lamardoss@reddit

This is something I do with Ace also. Love it! I also have a small 2B LLM name each rendered clip before it gets played and it goes into two folders, one for the queue and one for keeping. I keep the most recent 300 clips for each Ace stream I have going (different music genres) and I can then go back and listen to them whenever I need that GPU resource to do other things and need to temporarily pause generation or if I love a clip so much that I want to keep it. Really love that stack you have there!

[-]

BobLobLawsLawBlawg@reddit

This is the best use of AI ever.

[-]

entsnack@reddit (OP)

tokens straight into my brain

[-]

Finanzamt_Endgegner@reddit

You are insane, i love it 🤣

[-]

boston101@reddit

Op you are genius and ridiculous - I love it.

[-]

entsnack@reddit (OP)

inb4 mods see this 😆

[-]

SoupSuey@reddit

Well, I never thought of that to be honest. Maybe you should share some of the best musics on SoundCloud, I’m particularly interested in hearing Shrimp Bizkit. Also, SoundCloud could be the conmunity you’re looking for.

[-]

uti24@reddit

why would you need two DGX Spark for that? Even on a single one Ace-Step 1.5 XL already works probably twice as fast as realtime

[-]

entsnack@reddit (OP)

With 240GB of unified memory I can generate roughly 12 3-4 minute songs in parallel. A lot of these songs are crappy and need to be thrown away for the next evolutionary optimization step, and the Spark is slow, so I need the VRAM and parallelism to generate good music at a reasonable rate.

Once generation quality gets better, I'll have fewer bad generations and can probably repurpose the GPU for other things.