This is interesting. My first thought was to wonder how a LSTM would do. Once mi...

Matumio · on Aug 2, 2017

An architecture like WaveNet could also be interesting here: https://deepmind.com/blog/wavenet-generative-model-raw-audio... (HN thread: https://news.ycombinator.com/item?id=12455510)

cwingrav · on Aug 2, 2017

I could be totally off on this, but his encoding is an image and LSTM is for time series, which would require a different representation.

I completely agree LSTM would be useful as it would by default require a different representation. I think most commenters agree this representation is overly simplistic. Amazed it works as well as it does!