Skip to content

MTG/RISE

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RISE - Raga Independent Svara Encoder

Self-supervised learning for raga independent svara representation, primarily aimed at Carnatic music transcription and related tasks such as performance analysis and melodic pattern recognition. This codebase accompanies the DLFM 2026 submission:

A Raga Independent Encoder for Svara Representation in Carnatic Music - Vivek Vijayan, Thomas Nuttall, Xavier Serra

Setup

git clone https://github.com/vivekvjyn/svara-representation.git
cd svara-representation
pip install -r requirements.txt
pip install -e .

Run experiments

  1. Prepare dataset:
./scripts/pitch.sh

We extract pitch contours from both the Carnatic Music Rhythm (CMR) and Carnatic Varnam datasets. For the CMR dataset, we sample plausible svara candidates from the pitch contours using the beat annotations, while for the Carnatic Varnam dataset, we use the provided svara annotations.

  1. Pre-train the model on Carnatic Music Rhythm (CMR) dataset:
./scripts/ssl.sh

We pretrain an InceptionTime encoder using the InfoNCE loss on unannotated pitch contours from the CMR dataset. Positive pairs are created by applying data augmentations such as time warping and pitch drifting.

simclr

  1. Fine-tune the pretrained model on annotated Carnatic Varnam dataset using LoRA and report F1 score:
./scripts/svaras.sh

We finetune the pretrained model on annotated data using cross-entropy loss for svara classification. Low-rank adaptation (LoRA) is used for efficient fine-tuning. we report F1 score for baseline and fine-tuned models.

simclr

  1. Cluster svara embeddings on the Carnatic Varnam dataset using HDBSCAN and report Normalized Mutual Information (NMI):
./scripts/gamakas.sh

We evaluate the learned representations by clustering svara embeddings to identify distinct svara forms (gamaka realizations). HDBSCAN is applied independently for each svara, and the resulting clusters are compared against expert-provided svara-form annotations using the Normalized Mutual Information (NMI) score.

About

Accompanying code for accepted submission at DLFM 2026 - A Raga Independent Encoder for Svara Representation in Carnatic Music

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors