deepmusic

deepmusic
Spectrogram of Mr. Brightside by Killers (song to remix)
Spectrogram of Run by Secoya (style to match)
The result

Inspiration

We were inspired by the art generator deepart.io that generates new art based off two input images allowing for unique art in a style you want. Our focus on music came from the fact that we enjoy music (one of us goes onto spotify almost every 2 minutes). We though if this worked on art this can be applied to music and this was the fruits of our labor.

What it does

Uses machine learning to synthesize audio remixes of a song in the style of another. The algorithm simply takes in two audio files and generates an output audio file. The output is the first audio file in the musical stylings of the second. It can easily generate seamless remixes for users to enjoy and music artists to thrive.

How We built it

The structure of the machine learning was developed using a 19 layer deep convolutional neural network as a visual graphic group. The algorithm is worked the same way that deepart.io works based off their scientific paper. Deep convolutional neural networks work fantastically for images and we therefore converted audio to spectrograms due to much evidence in the machine learning community that it is a very effective way to train audio machine learning. The algorithm converts the two audio inputs to spectrogram images, feeds it through the convolutional neural network, and takes the output file (which has been trained to be output as a spectrogram) is converted back to audio.

Challenges we ran into

Training multiple neural networks in such a short span of time.
Converting a spectrogram image back into an audio file in .wav format.
Regulating the volume of the result due to unknown reasons.
Properly formatting the website in an aesthetic manner.

Accomplishments that We are proud of

Creating a functional machine learning algorithm in 24 hours.
Managing to modify a pre-trained neural network.
Training it correctly to 7.5% error within a 24 hour period.
Creating a nice looking website displaying our data.

What we learned

Audio machine learning requires a lot of learning and hard work.
Never directly listen to a first time converted spectrogram to audio.
Learned to create a visually appealing website.
Learned to understand audio machine learning using spectrograms.

What is next for deepmusic

Apply the algorithm to our static website so that it is possible to feed forward with the trained network.
Change from a static website to a dynamic website on a proper server for users to use.
Code the algorithm in lower level for higher optimization and faster response time to user requests.
Continuing to train the algorithm to reduce the error because the error percentage should get better.
Apply our application to markets such a charging for API calls, have a monthly subscription for aspiring music artists to create useful remixes, and have it open to investment due to the uniqueness of the implementation to be adapted to other problems.

Built With

audio-conversion
c
css
github
html
javascript
machine-learning
neural-networks
numpy
python
scipy
spectrograms
tensorflow

Submitted to

BeachHacks
- Winner First Place

Created by

I created the algorithm for how to combine music. The python scripts for the 19 layer convolutional neural network as a (visual geometry group) and also implemented the audio to spectrogram and back to audio using python.

Ishan Agrawal
yimjaehyun Yim
jyoo030
Joshua Pedron
Striving to create solutions for worldwide problems.