Binaural Audio Rendering

While it is an essence for the binaural audio technology to render the spatial perception to the audio, I wanted to render it in a more efficient manner. While there are many binaural audio rendering models, I’ve been working to reduce in the number of parameters and operations showing on-par performance to the baseline models in terms of perceptual spatiality. At the end of the day, what’s important for binaural technology is efficient enough performance to enable real-time behavior. Similar to learning an acoustic field, I designed the neural net to adapt its magnitude and phase response based on the source’s location and orientation. Here are some binaural samples rendered by my model.

Visualization of the proposed idea

Bill Evans & Jim Hall - I Hear a Rhapsody

BTS - Dynamite

Jacob Collier - You and I

For the third one, a music by Jacob Collier was sampled and re-rendered in binaural version. I remember the 12-note vocal harmonizer that Jacob showed at his performance at MIT, which became a motivation for me to work on binaural version of his music.