Professor Simon Dixon, AMusA LMusA BSc(Hons) PhD

Professor of Computer Science
Email: s.e.dixon@qmul.ac.ukTelephone: +44 20 7882 7681Room Number: Engineering, Eng 406Website: http://www.eecs.qmul.ac.uk/~simond/Office Hours: Tuesday 16:00-17:00, Wednesday 15:00-16:00
Teaching
Music Informatics (Postgraduate/Undergraduate)
This module introduces students to state-of-the-art methods for the analysis of music data, with a focus on music audio. It presents in-depth studies of general approaches to the low-level analysis of audio signals, and follows these with specialised methods for the high-level analysis of music signals, including the extraction of information related to the rhythm, melody, harmony, form and instrumentation of recorded music. This is followed by an examination of the most important methods of extracting high-level musical content, sound source separation, and on analysing multimodal music data.
Research
Research Interests:
Publications
-
Shanin I, Dixon S (2023). Annotating Jazz Recordings Using Lead Sheet Alignment with Deep Chroma Features. 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
-
Zhang H, Dixon S (2023). Disentangling the Horowitz Factor: Learning Content and Style From Expressive Piano Performance. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
-
Edwards D, Dixon S, Benetos E (2023). PiJAMA: Piano Jazz with Automatic MIDI Annotations. nameOfConference
DOI: 10.5334/tismir.162
-
Luo YJ, Ewert S, Dixon S (2022). Towards Robust Unsupervised Disentanglement of Sequential Data - A Case Study Using Music Audio. International Joint Conference on Artificial Intelligence
-
Proutskova P, Wolff D, Fazekas G et al. (2022). The Jazz Ontology: A semantic model and large-scale RDF repositories for jazz. nameOfConference
-
Agrawal R, Wolff D, Dixon S (2021). A Convolutional-Attentional Neural Framework for Structure-Aware Performance-Score Synchronization. nameOfConference
-
Vianna Lordelo C, Benetos E, Dixon S et al. (2021). Pitch-informed instrument assignment using a deep convolutional network with multiple kernel shapes. 22nd International Society for Music Information Retrieval Conference (ISMIR)
DOI: doi
-
Foster D, Dixon S (2021). Filosax: A Dataset of Annotated Jazz Saxophone Recordings. 22nd International Society for Music Information Retrieval Conference
DOI: doi
-
O'Hanlon K, Benetos E, Dixon S (2021). Detecting cover songs with pitch class key-invariant networks. IEEE International Workshop on Machine Learning for Signal Processing (MLSP)
-
Demirel E, Ahlbäck S, Dixon S (2021). Computational Pronunciation Analysis in Sung Utterances. European Signal Processing Conference
-
Demirel E, Ahlbäck S, Dixon S (2021). Low resource audio-to-lyrics alignment from polyphonic music recordings. IEEE International Conference on Acoustics, Speech and Signal Processing
-
Agrawal R, Wolff D, Dixon S (2021). Structure-Aware Audio-to-Score Alignment Using Progressively Dilated Convolutional Neural Networks. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
-
Zhang Y, Xia G, Levy M et al. (2021). COSMIC: A Conversational Interface for Human-AI Music Co-Creation. New Interfaces for Musical Expression
-
Vianna Lordelo C, Benetos E, Dixon S et al. (2021). Adversarial Unsupervised Domain Adaptation for Harmonic-Percussive Source Separation. nameOfConference
-
O'Connor B, Dixon S, Fazekas G (2020). An Exploratory Study on Perceptual Spaces of the Singing Voice. The 2020 Joint Conference on AI Music Creativity
DOI: doi
-
Agrawal R, Dixon S (publicationYear). Learning Frame Similarity using Siamese networks for Audio-to-Score Alignment. 2020 28th European Signal Processing Conference
-
MISHRA S, Benetos E, Sturm B et al. (2020). Reliable Local Explanations for Machine Listening. International Joint Conference on Neural Networks (IJCNN)
-
Demirel E, Ahlback S, DIxon S (2020). Automatic Lyrics Transcription using Dilated Convolutional Neural Networks with Self-Attention. nameOfConference
-
Stoller D, Tian M, Ewert S et al. (2020). Seq-U-Net: A one-dimensional causal U-net for efficient sequence modelling. nameOfConference
QMRO: qmroHref -
KUDUMAKIS P, Dixon S (publicationYear). DMRN+14: Digital Music Research Network Workshop Proceedings 2019. DMRN+14: Digital Music Research Network Workshop 2019
-
Frieler K, Basaran D, Höger F et al. (2019). Don't hide in the frames: Note-and pattern-based evaluation of automated melody extraction algorithms. nameOfConference
-
Vianna Lordelo C, Benetos E, Dixon S et al. (2019). Investigating kernel shapes and skip connections for deep learning-based harmonic-percussive separation. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
-
Mehrabi A, Dixon S, Sandler M (2019). Erratum: Vocal imitation of percussion sounds: On the perceptual similarity between imitations and imitated sounds (PLOS ONE (2019)14:8 (e0221722) DOI:10.1371/journal.pone.0219955). nameOfConference
-
Rodríguez-Algarra F, Sturm BL, Dixon S (2019). Characterising Confounding Effects in Music Classification Experiments through Interventions. nameOfConference
DOI: 10.5334/tismir.24
-
Dai J, Dixon S (2019). Intonation trajectories within tones in unaccompanied soprano, alto, tenor, bass quartet singing.. nameOfConference
DOI: 10.1121/1.5120483
-
Mehrabi A, Dixon S, Sandler M (2019). Vocal imitation of percussion sounds: On the perceptual similarity between imitations and imitated sounds.. nameOfConference
-
Agrawal R, Dixon S (2019). A Hybrid Approach to Audio-to-Score Alignment. Machine Learning for Music Discovery Workshop at International Conference on Machine Learning (ICML)
DOI: doi
-
MISHRA S, STOLLER D, BENETOS E et al. (2019). GAN-based Generation and Automatic Selection of Explanations for Neural Networks. SafeML ICLR 2019 Workshop
DOI: doi
-
Nakamura E, Nishikimi R, Dixon S et al. (2019). Probabilistic Sequential Patterns for Singing Transcription. 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference
-
Dai J, Dixon S (2019). Singing together: Pitch accuracy and interaction in unaccompanied unison and duet singing.. nameOfConference
DOI: 10.1121/1.5087817
-
BENETOS E, DIXON S, Duan Z et al. (2019). Automatic Music Transcription: An Overview. nameOfConference
-
Dai J, Dixon S (2019). Understanding intonation trajectories and patterns of vocal notes. nameOfConference
-
KUDUMAKIS P, DIXON S (2018). DMRN+13: Digital Music Research Network Workshop Proceedings 2018. DMRN+13: Digital Music Research Network Workshop 2018
-
Mishra S, Sturm BL, Dixon S (2018). “What are you listening to?” Explaining predictions of deep machine listening systems. nameOfConference
-
Stoller D, Akkermans V, Dixon S (2018). Detection of cut-points for automatic music rearrangement. IEEE International Workshop on Machine Learning for Signal Processing, MLSP
-
Li S, Dixon S, Plumbley MD (2018). A Demonstration of Hierarchical Structure Usage in Expressive Timing Analysis by Model Selection Tests. nameOfConference
-
Stoller D, Ewert S, Dixon S (2018). Adversarial Semi-Supervised Audio Source Separation Applied to Singing Voice Extraction. IEEE International Conference on Acoustics, Speech and Signal Processing
-
Mehrabi A, Choi K, Dixon S et al. (2018). Similarity Measures for Vocal-Based Drum Sample Retrieval Using Deep Convolutional Auto-Encoders. ICASSP 2018
-
STOLLER D, Ewert S, DIXON S (2018). Jointly detecting and separating singing voice: a multi-task approach. 14th International Conference on Latent Variable Analysis and Signal Separation
-
Nakamura E, BENETOS E, Yoshii K et al. (2018). Towards Complete Polyphonic Music Transcription: Integrating Multi-Pitch Detection and Rhythm Quantization. IEEE International Conference on Acoustics, Speech and Signal Processing
-
Weiß C, Mauch M, Dixon S et al. (2019). Investigating style evolution of Western classical music: A computational approach. nameOfConference
-
PANTELI M, BENETOS E, DIXON S (2018). A review of manual and computational approaches for the study of world music corpora. nameOfConference
-
Dixon S, Gómez E, Volk A (2018). Editorial: Introducing the Transactions of the International Society for Music Information Retrieval. nameOfConference
DOI: 10.5334/tismir.22
-
Stoller D, Ewert S, Dixon S (2018). Wave-U-Net: A multi-scale neural network for end-to-end audio source separation. nameOfConference
DOI: doi
-
PANTELI M, BENETOS E, DIXON S (2017). A computational study on outliers in world music. nameOfConference
-
Mohamad Z, Dixon S, Harte C (2017). Pickup position and plucking point estimation on an electric guitar via autocorrelation.. nameOfConference
DOI: 10.1121/1.5016815
-
Di Giorgi B, Dixon S, Zanoni M et al. (2017). A data-driven model of tonal chord sequence complexity. nameOfConference
-
WANG S, Ewert SEBASTIAN, Dixon SIMON (2017). Identifying Missing and Extra Notes in Piano Recordings Using Score-Informed Dictionary Learning. nameOfConference
-
NAKAMURA E, Yoshii K, Dixon S (2017). Note Value Recognition for Piano Transcription Using Markov Random Fields. nameOfConference
-
Quinton E, Dixon S, Sandler M et al. (2017). TRACKING METRICAL STRUCTURE CHANGES WITH SPARSE-NMF. 2017 IEEE International Conference on Acoustics, Speech and Signal Processing
-
Mohamad Z, Dixon S, Harte C (2017). Pickup position and plucking point estimation on an electric guitar. 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
-
Panteli M, Bittner R, Bello JP et al. (2017). Towards the characterization of singing styles in world music. IEEE International Conference on Acoustics, Speech and Signal Processing
-
Mehrabi A, Harte C, Baume C et al. (2017). Music thumbnailing for radio podcasts: A listener evaluation. nameOfConference
-
MEHRABI A, Dixon S, Sandler (2017). Vocal imitation of synthesised sounds varying in pitch, loudness and spectral centroid. nameOfConference
DOI: 10.1121/1.4974825
-
SANDLER MB, quinton E, o'hanlon K et al. (2016). Automatic Detection of Metrical Structure Changes. DMRN+11: Digital Music Research Network One-day Workshop 2016
DOI: doi
-
MISHRA S, Sturm B, Dixon S (2016). Explaining Predictions of Machine Listening Systems. DMRN+11: Digital Music Research Network One-day Workshop 2016
DOI: doi
-
Wang S, Ewert S, Dixon S (2016). Robust and efficient joint alignment of multiple musical performances. nameOfConference
-
MEHRABI A, Dixon S, Sandler M (2016). Towards a comprehensive dataset of vocal imitations of drum sounds. 2nd AES Workshop on Intelligent Music Production
DOI: doi
-
Cheng T, Mauch M, Benetos E et al. (2016). An attack/decay model for piano transcription. 17th International Society for Music Information Retrieval Conference
DOI: doi
-
Panteli M, Benetos E, Dixon S (2016). Learning a feature space for similarity in world music. 17th International Society for Music Information Retrieval Conference
DOI: doi
-
Panteli M, Benetos E, Dixon S (2016). Automatic detection of outliers in world music collections. Fourth International Conference on Analytical Approaches to World Music (AAWM 2016)
DOI: doi
-
QUINTON E, Sandler M, Dixon S (2016). ESTIMATION OF THE RELIABILITY OF MULTIPLE RHYTHM FEATURES EXTRACTION FROM A SINGLE DESCRIPTOR. nameOfConference
QMRO: qmroHref -
Sigtia S, Benetos E, Dixon S (2016). An End-to-End Neural Network for Polyphonic Piano Music Transcription. nameOfConference
-
Stoller D, Dixon S (2016). Analysis and classification of phonation modes in singing. nameOfConference
DOI: doi
-
Panteli M, Dixon S (2016). On the evaluation of rhythmic and melodic descriptors for music similarity. nameOfConference
DOI: doi
-
Song Y, Dixon S, Pearce MT et al. (2016). Perceived and Induced Emotion Responses to Popular Music: Categorical and Dimensional Models. nameOfConference
-
Cheng T, Dixon S, Mauch M (2015). Improving piano note tracking by HMM smoothing. nameOfConference
QMRO: qmroHref -
MOHAMAD Z, Dixon S, Harte C (2015). Digitally Moving an Electric Guitar Pickup. International Conference on Digital Audio Effects (DAFx-15)
DOI: doi
-
MEHRABI A, Dixon S, Sandler M (2015). VOCAL IMITATION OF PITCH, SPECTRAL SHAPE AND LOUDNESS ENVELOPES. International Society for Music Information Retrieval Conference
DOI: doi
-
Foster P, Dixon S, Klapuri A (2015). Identifying Cover Songs Using Information-Theoretic Measures of Similarity. nameOfConference
-
Sigtia S, Benetos E, Boulanger-Lewandowski N et al. (2015). A Hybrid Recurrent Neural Network for Music Transcription. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
-
Tubb R, Dixon S (2015). An Evaluation of Multidimensional Controllers for Sound Design Tasks. nameOfConference
QMRO: qmroHref -
Wilmering T, Fazekas G, Dixon S et al. (2015). Automating Annotation of Media with Linked Data Workflows. nameOfConference
QMRO: qmroHref -
Wang S, Ewert S, Dixon S (2015). Compensating for Asynchronies between Musical Voices in Score-Performance Alignment. nameOfConference
-
Mauch M, Cannam C, Bittner R et al. (2015). Computer-aided Melody Note Transcription Using the Tony Software: Accuracy and Efficiency. nameOfConference
DOI: doi
-
Cheng T, Dixon S, Mauch M (2015). Modelling the Decay of Piano Sounds. IEEE International Conference on Acoustics Speech and Signal Processing
QMRO: qmroHref -
Tidhar D, Dixon S, Benetos E et al. (2014). The temperament police. nameOfConference
DOI: 10.1093/em/cau101
-
Weyde T, Cottrell S, Dykes J et al. (2014). Big Data for Musicology. 1st International Digital Libraries for Musicology workshop
-
Mauch M, Frieler K, Dixon S (2014). Intonation in unaccompanied singing: accuracy, drift, and a model of reference pitch memory.. nameOfConference
DOI: 10.1121/1.4881915
-
Cheng T, Dixon S, Mauch M (2014). A Comparison of Extended Source-Filter Models for Musical Signal Reconstruction. nameOfConference
DOI: doi
-
Tubb R, Dixon S (2014). A Zoomable Mapping of a Musical Parameter Space Using Hilbert Curves. nameOfConference
DOI: 10.1162/COMJ_a_00254
-
Weyde T, Cottrell S, Benetos E et al. (2014). Digital Music Lab - A Framework for Analysing Big Music Data. European Conference on Data Analysis (ECDA)
QMRO: qmroHref -
Thompson L, Mauch M, Dixon S (2014). Drum Transcription via Classification of Bar-level Rhythmic Patterns. 15th International Society for Music Information Retrieval Conference
DOI: doi
-
Sigtia S, Dixon S (2014). Improved music feature learning with deep neural networks. nameOfConference
-
Stowell D, Dixon S (2014). Integration of informal music technologies in secondary school music lessons. nameOfConference
-
Sigtia S, Benetos E, Cherla S et al. (2014). RNN-based Music Language Models for Improving Automatic Music Transcription. nameOfConference
DOI: doi
-
Wang S, Ewert S, Dixon S (2014). Robust Joint Alignment of Multiple Versions of a Piece of Music. nameOfConference
DOI: doi
-
Foster P, Mauch M, Dixon S (2014). Sequential Complexity as a Descriptor for Musical Similarity. nameOfConference
-
Weiss C, Mauch M, Dixon S (2014). Timbre-invariant Audio Features for Style Analysis of Classical Music. nameOfConference
DOI: doi
-
Kirchhoff H, Badeau R, Dixon S (2014). Towards complex matrix decomposition of spectrograms based on the relative phase offsets of harmonic sounds. nameOfConference
-
Mauch M, Dixon S (2014). pYIN: a Fundamental Frequency Estimator Using Probabilistic Threshold Distributions. nameOfConference
-
Benetos E, Dixon S, Giannoulis D et al. (2013). Automatic music transcription: Challenges and future directions. nameOfConference
-
Foster P, Dixon S, Klapuri A (2013). Identification of cover songs using information theoretic measures of similarity. nameOfConference
-
Kirchhoff H, Dixon S, Klapuri A (2013). Missing template estimation for user-assisted music transcription. nameOfConference
-
Chudy M, Dixon S (2013). Recognising Cello Performers Using Timbre Models. nameOfConference
QMRO: qmroHref -
Chudy M, Carrillo AP, Dixon S (2013). On the relation between gesture, tone production and perception in classical cello performance. nameOfConference
DOI: 10.1121/1.4801077
QMRO: qmroHref -
Chudy M, Pérez Carrillo A, Dixon S (2013). On the relation between gesture, tone production, and perception in classical cello performance.. nameOfConference
DOI: 10.1121/1.4805316
QMRO: qmroHref -
Benetos E, Dixon S (2013). Multiple-instrument polyphonic music transcription using a temporally constrained shift-invariant model. nameOfConference
DOI: 10.1121/1.4790351
QMRO: qmroHref -
Cheng T, Dixon S, Mauch M (2013). A DETERMINISTIC ANNEALING EM ALGORITHM FOR AUTOMATIC MUSIC TRANSCRIPTION. nameOfConference
DOI: doi
-
Allik A, Fazekas G, Dixon S et al. (2013). A Shared Vocabulary for Audio Features. nameOfConference
QMRO: qmroHref -
Allik A, Fazekas G, Dixon S et al. (2013). Facilitating Music Information Research with Shared Open Vocabularies. nameOfConference
QMRO: qmroHref -
Chudy M, Dixon S (2013). Recognising cello performers using timbre models. nameOfConference
QMRO: qmroHref -
Benetos E, Dixon S (2012). Temporally-constrained convolutive probabilistic latent component analysis for multi-pitch detection. nameOfConference
QMRO: qmroHref -
Benetos E, Dixon S (2012). A Shift-Invariant Latent Variable Model for Automatic Music Transcription. nameOfConference
DOI: 10.1162/COMJ_a_00146
QMRO: qmroHref -
Dixon S, Mauch M, Tidhar D (2012). Estimation of harpsichord inharmonicity and temperament from musical recordings.. nameOfConference
DOI: 10.1121/1.3651238
QMRO: qmroHref -
Kirchhoff H, Dixon S, Klapuri A (2012). Shift-variant non-negative matrix deconvolution for music transcription. IEEE International Conference on Acoustics, Speech, and Signal Processing
QMRO: qmroHref -
Benetos E, Dixon S (2011). Joint multi-pitch detection using harmonic envelope estimation for polyphonic music transcription. nameOfConference
QMRO: qmroHref -
Benetos E, Dixon S (2011). A TEMPORALLY-CONSTRAINED CONVOLUTIVE PROBABILISTIC MODEL FOR PITCH DETECTION. nameOfConference
QMRO: qmroHref -
Benetos E, Dixon S (2011). Polyphonic music transcription using note onset and offset detection. nameOfConference
QMRO: qmroHref -
Dixon S, Mauch M, Anglade A (2011). Probabilistic and Logic-Based Modelling of Harmony. nameOfConference
QMRO: qmroHref -
Macrae R, Neumann J, Anguera X et al. (2011). Real-Time Synchronisation of Multimedia Streams in a Mobile Device. nameOfConference
QMRO: qmroHref -
Tidhar D, Fazekas G, Mauch M et al. (2011). Tempest - harpsichord temperament estimation in a Semantic Web environment. nameOfConference
QMRO: qmroHref -
Macrae R, Dixon S (2010). A Guitar Tablature Score Follower. IEEE International Conference on Multimedia & Expo
QMRO: qmroHref -
Mearns L, Dixon S (2010). Characterisation of Composer Style Using High Level Musical Features. nameOfConference
QMRO: qmroHref -
Tidhar D, Mauch M, Dixon S (2010). High Precision Frequency Estimation for Harpsichord Tuning Classification. Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
QMRO: qmroHref -
Anglade A, Benetos E, Mauch M et al. (2010). Improving music genre classification using automatically induced harmony rules. nameOfConference
QMRO: qmroHref -
Mauch M, Dixon S (2010). Simultaneous Estimation of Chords and Musical Context from Audio. nameOfConference
QMRO: qmroHref -
Dixon S, Sandler M, d'Inverno M et al. (2010). Towards a Distributed Research Environment for Music Informatics and Computational Musicology. nameOfConference
QMRO: qmroHref -
Khosrow-Pour M, Dixon S (2009). Audio Analysis Applications for Music. nameOfConference
QMRO: qmroHref -
Arzt A, Widmer G, Dixon S (2008). Automatic Page Turning for Musicians via Real-Time Machine Listening. European Conference on Artificial Intelligence
QMRO: qmroHref -
Dixon S (2007). Evaluation of the audio beat tracking system BeatRoot. nameOfConference
QMRO: qmroHref -
Mauch M, Dixon S, Harte C et al. (2007). Discovering chord idioms through Beatles and Real Book songs. nameOfConference
DOI: 10.1111/josi.12045
QMRO: qmroHref -
Gouyon F, Dixon S, Widmer G (2007). Evaluating low-level features for beat classification and tracking. nameOfConference
QMRO: qmroHref -
Gouyon F, Klapuri A, Dixon S et al. (2006). An experimental comparison of audio tempo induction algorithms. nameOfConference
QMRO: qmroHref -
Dixon S, Goebl W, Cambouropoulos E (2006). Perceptual smoothness of tempo in expressively performed music. nameOfConference
QMRO: qmroHref -
Gouyon F, Dixon S (2005). A review of automatic rhythm description systems. nameOfConference
QMRO: qmroHref -
Pampalk E, Dixon S, Widmer G (2004). Exploring music collections by browsing different views. nameOfConference
QMRO: qmroHref -
Widmer G, Dixon S, Goebl W et al. (2003). In search of the Horowitz factor. nameOfConference
QMRO: qmroHref -
Dixon S (2003). On the analysis of musical expression in audio signals. nameOfConference
DOI: 10.1117/12.476314
QMRO: qmroHref -
DIXON SE, Goebl W, Widmer G (2002). Real-time Tracking and Visualisation of Musical Expression. Proceedings of the 2nd International Conference on Music and Artifical Intelligence (ICMAI'02), Edinburgh, Scotland, Springer, Berlin
QMRO: qmroHref -
DIXON SE, Goebl W, Widmer G (2002). The Performance Worm: Real Time Visualisation of Expression Based on Langer's Tempo-Loudness Animation. Proceedings of the 2002 International Computer Music Conference (ICMC'2002), Gothenburg, Sweden, edited by M Nordahl (International Computer Music Association, San Fransisco)
DOI: doi
QMRO: qmroHref -
Goebl W, DIXON SE (2001). Analysis of Tempo Classes in Performances of Mozart Sonatas. Proceedings of the 7th International Syymposium on Systematic and Comparative Musicology and the 3rd International Conference on Cognitive Musicology, Jyvaeskylae, Finland, 16-19 August 2001
DOI: doi
QMRO: qmroHref -
Dixon S (2001). Automatic extraction of tempo and beat from expressive performances. nameOfConference
QMRO: qmroHref -
Dixon S (2000). A Lightweight Multi-agent Musical Beat Tracking System. nameOfConference
QMRO: qmroHref