Hacker Newsnew | past | comments | ask | show | jobs | submit | seth_'s commentslogin

Riffusion - Generative AI for Music | Research Scientist, Research Engineer | San Francisco | Full-time Riffusion is a small team training foundation models for music generation and building products that create more musicians in the world. We strive to create and deploy models that are expressive, fast, controllable, and inspiring at scale.

We’re establishing our founding research team and looking for individuals who love music and are excited to build a more creative future with us. Experience with large scale generative model training and diffusion architectures is preferred. Very strong software engineering and computer science fundamentals required. We’re backed by top investors and have substantial compute at the ready.

You can make music with us https://riffusion.com and reach me at { seth at riffusion dot com }.


Riffusion - Generative AI for Music | Research Scientist, Research Engineer | San Francisco | Full-time

Riffusion is a small team training foundation models for music generation and building products that create more musicians in the world. We strive to create and deploy models that are expressive, fast, controllable, and inspiring at scale.

We’re establishing our founding research team and looking for individuals who love music and are excited to build a more creative future with us. Experience with large scale generative model training and diffusion architectures is preferred. Very strong software engineering and computer science fundamentals required. We’re backed by top investors and have substantial compute at the ready.

You can make music with us https://riffusion.com and reach me at { seth at riffusion dot com }.


love the deep dive here


Those are awesome! Sorry if the app is down for you or anyone else right now btw. Our servers are a little overloaded but we'll be back up in no time


We would have an iOS app if we were better mobile engineers... hopefully someday we'll make one!


I highly suggest Flutter if you are open for ideas!


Nice! It is best at pronouncing in English, but we've had a bunch of fun trying to get other languages too. Sometimes you can make things happen phonetically.

Even for english words that it doesn't get right the first time haha


It produces this beautiful nonsense in Japanese also, and even correctly extracted the topic from the Japanese lyrics : https://www.riffusion.com/riffs/71fa41f3-1488-4e9e-8f99-5f81...


Very cool. The latent space is a wild place.


We're happy to be building a toy!

Your comment reminds me of this post: https://cdixon.org/2010/01/03/the-next-big-thing-will-start-...

It's still really early innings for this technology, so we're happy to be learning and building fun technology helps people to do creative things. Main thing we're focused on is turning it from a toy you enjoy once, to something you come back to and dive deeper into (even if it's still just for fun).


I agree, but my question is: what are specific companies that pulled this off well that you consider as role models?

Not meaning to put you in a corner, just really curious about this - I love Riffusion and hope you guys succeed.


This one! Was a wild day for us :)

https://news.ycombinator.com/item?id=33999162


nice


Would be neat to see the distribution of time that people stare at this... I imagine some will stare for hours


I've definitely fallen into that trap before lol


Author here: fwiw we are running the app on a10g GPUs, which generally can turn around a 512x512 in 3.5s with 50 inference steps. This time includes converting the image into audio which should be done on the GPU as well for real-time purposes. We did some optimization such as a traced unet, fp16 and removing autocast. There are lots of ways it could be sped up further I'm sure!


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: