RESEARCH
User-Centric AI-Based Multitrack Music Mixing Informed by Expert Practice
Music Mixing is a multi-faceted task that is technical, creative, and collaborative in nature. Once the audio tracks are recorded, these are handed over to a mixing engineer to create a cohesive mix that is well-balanced and representative of the artist's vision and identity. The task involves collaborative aspects like talking to the artist to understand their vision for the mix and exchanging demo mixes and reference songs, followed by engineering aspects where the engineer makes technical decisions to fix frequency masking, establish balance and panorama, and more. This is followed by more creative tasks that involve further transformations of audio tracks to get the desired sound. As a part of our PhD research, we have begun by understanding the intricacies of the process beyond the engineering aspect, so that we can build tools that are inspired and informed by the professional practice. To achieve this, we have so far conducted interviews and studies with professional and pro-am mixing engineers with significant experience and advanced skills in mixing to understand the workflow of mixing from the collaborative perspective. Our studies have shown that mixing engineers are more inclined towards technology that encourages assistance, collaboration, and easy integration into current workflows. We also found that mix engineers use reference songs and demo mixes as some of the directors to get a better understanding of the client's goals. Further, we are working on a deep learning-based model that uses the available expert knowledge about the engineering aspects of mixing informed by the collaborative standpoint to build a system that takes in stems and reference songs to predict parameters for the various audio effects in the mixing console for stems based on the mixing style of the reference song. This could help mix engineers create boilerplate mixes that they can further fine-tune to create the desired mix.
PUBLICATIONS
Conference
Adoption of AI technology in music mixing workflows: An investigation
Soumya Sai Vanka, Maryam Safi, Jean-Baptiste Rolland, George Fazekas
AES Europe, May 2023
Journal
Soumya Sai Vanka, Maryam Safi, Jean-Baptiste Rolland, George Fazekas
Journal of Audio Engineering Society, Nov 2023
Book
Deep learning for automatic mixing
Christian Steinmetz, Soumya Sai Vanka, Marco Martinez, Gary Bromham
ISMIR, Dec 2022
Workshop
Deep learning for automatic mixing
Christian Steinmetz, Soumya Sai Vanka, Marco Martinez, Gary Bromham
ISMIR, Dec 2022
Soumya Sai Vanka, Christian Steinmetz, Marco Martinez, Gary Bromham, Junghyun Koo, Brecht DeMan, Angeliki Mourgella
AES Convention NYC, Oct 2023
Presentations
Music Production Style Transfer and Mix Similarity
Soumya Sai Vanka
DMRN, Dec 2021
Adoption of AI technology in music mixing workflows: An investigation
Soumya Sai Vanka
Harmon.ai Talks, May 2023
Designing an AI Multitrack Mixing System
Soumya Sai Vanka
Sound Recording and Production Techniques Module (Queen Mary, London), Dec 2023
Previous Work: M.Sc. and B.Sc. (Physics)
I had the opportunity to conduct a few experiments in the field of sound and vibration during my undergraduate years when I was pursuing a Bachelor's in Physics [Hons.] Another time, I worked as a researcher (intern) at the Cryogenics department of IIT Kharagpur during the summer of 2019, where we devised a sensor to measure the Liq. Nitrogen levels in cylinders using superconductors. Click on the files to view the documentation of the experiments.
Development of an inductive based level sensor for LN2 level measurement in a tank
(May- June 2019, IIT Kharagpur, West Bengal, India)
A low-cost copper wire wound YBCO single crystal-based level sensor was devised and its performance was evaluated.
Fourier Analysis for woodwind instruments
(Oct 2017- April 2018, SSSIHL, Anantapur, Andhra Pradesh, India)
In this project, we analyzed the frequency spectrum of woodwinds, namely saxophone, flute, and clarinet to understand why they have distinctive timbre. We also studied the vibration modes of a clarinet in three different registers.
Study of the forced vibrations of a circular membrane
(Oct 2017- April 2018, SSSIHL, Anantapur, Andhra Pradesh, India)
Evidence of different modes of vibrations of the circular membrane with a change in frequency was explored and compared with the theoretical results given by Bessel’s function.