Benchamrking symbolic music features

This is the code used for benchmarking different feature sets, including musif. Please, cite us:

Simonetta F., Llorens A., Serrano M., García-Portugués E., Torrente A., "Optimizing Feature Extraction for Symbolic Music", ISMIR 2023.

Dependencies

Python 3.10 (e.g. via conda or pyenv)
pdm, you barely have three options:
- pipx install pdm (need pipx, recommended)
- pip install pdm (environment specific)
- see https://pdm.fming.dev/latest/ for other alternatives
pdm sync to create the environment and install python packages
Alternatively to pdm, see cluster.md for bare venv approach
MuseScore: download AppImage (4.0.1 has a bug, use 3.6.2, instead)
Java: install using you OS package manager and check that the java command is available in the PATH
jSymbolic 2.2: download and unzip
GCC and make: install using your OS package manager
humdrum:
1. git submodule update
2. cd humdrum-tools
3. make update
4. make

In symbolic_features/settings.py set the paths to MuseScore and jSymbolic executables.

Datasets

Download the following datasets and set the paths to the root of each one in symbolic_features/settings.py

Josquin - La Rue
ASAP
Didone
EWLD
String quartets:

Haydn
Mozart
Beethoven
unzip the above three zips into one directory, e.g.: quartets/haydn, quartets/mozart, quartets/beethoven

Preprocessing

Fix invalid file names: pdm fix_names. This will fix names containing , and ; that cause errors in csv files.

Convert any file to MIDI: pdm convert2midi. You will need to run Xvfb :99 & export DISPLAY=:99 if you are running without display (e.g. in a remote ssh session)

Feature extraction

Reproduce experiments: ./extract_all.sh

Detailed commands:

jSymbolic: pdm extract --jsymbolic --extension .mid
musif:

pdm extract --musif --extension .mid
pdm extract --musif --extension .xml
pdm extract --musif --extension .krn

music21:

pdm extract --music21 --extension .mid
pdm extract --music21 --extension .xml
pdm extract --music21 --extension .krn

Classification accuracy

Reproduce experiments: pdm validation

Detailed commands

pdm classification: run all experiments with original features
pdm classification --use_first_10_pc: run all experiments with first 10 Principal Components from each task (where a task is a combination of dataset, feature set, and extension)
pdm plot: plot the AutoML optimization score across time
pdm classification --featureset='music21' --dataset='EWLD' --extension='mid' --automl_time=60: run an experiment on a single task for 60 seconds

Name		Name	Last commit message	Last commit date
Latest commit History 122 Commits
features		features
humdrum-tools @ 3cf04c9		humdrum-tools @ 3cf04c9
output		output
symbolic_features		symbolic_features
.gitignore		.gitignore
.gitmodules		.gitmodules
.ignore		.ignore
.python-version		.python-version
LICENSE		LICENSE
Readme.md		Readme.md
cluster.md		cluster.md
cluster.sh		cluster.sh
effectiveness.sh		effectiveness.sh
extract_all.sh		extract_all.sh
lsyncd.conf		lsyncd.conf
pdm.lock		pdm.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Benchamrking symbolic music features

Dependencies

Datasets

Preprocessing

Feature extraction

Classification accuracy

About

Uh oh!

Releases 1

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Benchamrking symbolic music features

Dependencies

Datasets

Preprocessing

Feature extraction

Classification accuracy

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Uh oh!

Contributors

Uh oh!

Languages