More

seth_ · on May 1, 2024

Riffusion - Generative AI for Music | Research Scientist, Research Engineer | San Francisco | Full-time Riffusion is a small team training foundation models for music generation and building products that create more musicians in the world. We strive to create and deploy models that are expressive, fast, controllable, and inspiring at scale.

We’re establishing our founding research team and looking for individuals who love music and are excited to build a more creative future with us. Experience with large scale generative model training and diffusion architectures is preferred. Very strong software engineering and computer science fundamentals required. We’re backed by top investors and have substantial compute at the ready.

You can make music with us https://riffusion.com and reach me at { seth at riffusion dot com }.

seth_ · on April 1, 2024

Riffusion - Generative AI for Music | Research Scientist, Research Engineer | San Francisco | Full-time

Riffusion is a small team training foundation models for music generation and building products that create more musicians in the world. We strive to create and deploy models that are expressive, fast, controllable, and inspiring at scale.

We’re establishing our founding research team and looking for individuals who love music and are excited to build a more creative future with us. Experience with large scale generative model training and diffusion architectures is preferred. Very strong software engineering and computer science fundamentals required. We’re backed by top investors and have substantial compute at the ready.

You can make music with us https://riffusion.com and reach me at { seth at riffusion dot com }.

seth_ · on Nov 20, 2023

love the deep dive here

seth_ · on Oct 17, 2023

Those are awesome! Sorry if the app is down for you or anyone else right now btw. Our servers are a little overloaded but we'll be back up in no time

seth_ · on Oct 17, 2023

We would have an iOS app if we were better mobile engineers... hopefully someday we'll make one!

Alifatisk · on Oct 17, 2023

I highly suggest Flutter if you are open for ideas!

seth_ · on Oct 17, 2023

Nice! It is best at pronouncing in English, but we've had a bunch of fun trying to get other languages too. Sometimes you can make things happen phonetically.

Even for english words that it doesn't get right the first time haha

rhythmofrest · on Oct 17, 2023

It produces this beautiful nonsense in Japanese also, and even correctly extracted the topic from the Japanese lyrics : https://www.riffusion.com/riffs/71fa41f3-1488-4e9e-8f99-5f81...

seth_ · on Oct 17, 2023

Very cool. The latent space is a wild place.

seth_ · on Oct 17, 2023

We're happy to be building a toy!

Your comment reminds me of this post: https://cdixon.org/2010/01/03/the-next-big-thing-will-start-...

It's still really early innings for this technology, so we're happy to be learning and building fun technology helps people to do creative things. Main thing we're focused on is turning it from a toy you enjoy once, to something you come back to and dive deeper into (even if it's still just for fun).

earthnail · on Oct 18, 2023

I agree, but my question is: what are specific companies that pulled this off well that you consider as role models?

Not meaning to put you in a corner, just really curious about this - I love Riffusion and hope you guys succeed.

seth_ · on Oct 17, 2023

This one! Was a wild day for us :)

https://news.ycombinator.com/item?id=33999162

al_be_back · on Oct 17, 2023

seth_ · on May 22, 2023

Would be neat to see the distribution of time that people stare at this... I imagine some will stare for hours

ck019 · on May 23, 2023

I've definitely fallen into that trap before lol

seth_ · on Dec 15, 2022

Author here: fwiw we are running the app on a10g GPUs, which generally can turn around a 512x512 in 3.5s with 50 inference steps. This time includes converting the image into audio which should be done on the GPU as well for real-time purposes. We did some optimization such as a traced unet, fp16 and removing autocast. There are lots of ways it could be sped up further I'm sure!