TY - GEN
T1 - Multidimensional Detection of Musical Similarity and Plagiarism in MultiTrack MIDI Files Using Clustering and Dynamic Programming
AU - Llerena Zuñiga, Andrea
AU - Zegarra Aguilar, Flavio
AU - Suni-Lopez, Franci
N1 - Publisher Copyright:
© 2025 Copyright held by the owner/author(s).
PY - 2025/11/22
Y1 - 2025/11/22
N2 - This study presents a novel methodology for detecting musical plagiarism through multidimensional analysis that combines MIDI2Vec representations with dynamic programming algorithms and multithreading optimization applied to MIDI MultiTrack files. Unlike prior work that focuses primarily on melody and rhythm, our method is the first to integrate these three computational approaches for comprehensive similarity detection across multiple musical dimensions simultaneously. MIDI files, which store musical events and parameters rather than audio signals, enable detailed analysis of harmonic, rhythmic, and structural features that may indicate both direct and indirect plagiarism. To address limitations in multichannel compositions, we implement a musical edit distance algorithm using dynamic programming. The approach is validated on a dataset comprising original works, accused compositions, and simulated plagiarism cases across various genres. With a plagiarism threshold of 70%, the method achieves 96.55% precision, 100% accuracy, 94.12% recall, and a 96.97% F1 score, demonstrating its effectiveness for copyright protection in musical compositions.
AB - This study presents a novel methodology for detecting musical plagiarism through multidimensional analysis that combines MIDI2Vec representations with dynamic programming algorithms and multithreading optimization applied to MIDI MultiTrack files. Unlike prior work that focuses primarily on melody and rhythm, our method is the first to integrate these three computational approaches for comprehensive similarity detection across multiple musical dimensions simultaneously. MIDI files, which store musical events and parameters rather than audio signals, enable detailed analysis of harmonic, rhythmic, and structural features that may indicate both direct and indirect plagiarism. To address limitations in multichannel compositions, we implement a musical edit distance algorithm using dynamic programming. The approach is validated on a dataset comprising original works, accused compositions, and simulated plagiarism cases across various genres. With a plagiarism threshold of 70%, the method achieves 96.55% precision, 100% accuracy, 94.12% recall, and a 96.97% F1 score, demonstrating its effectiveness for copyright protection in musical compositions.
KW - Computational music analysis
KW - Computational musicology
KW - Detection of musical similarities
KW - Dynamic programming
KW - Hungarian Algorithm
KW - Midi Files
KW - Multidimensional Graphs
KW - Musical plagiarism
UR - https://www.scopus.com/pages/publications/105025374088
U2 - 10.1145/3771678.3771689
DO - 10.1145/3771678.3771689
M3 - Articulo (Contribución a conferencia)
AN - SCOPUS:105025374088
T3 - Proceedings of 8th International Conference on Systems Engineering - Cybersecurity and AI: Building a reliable digital future, CIIS 2025
SP - 80
EP - 86
BT - Proceedings of 8th International Conference on Systems Engineering - Cybersecurity and AI
PB - Association for Computing Machinery, Inc
T2 - 8th International Conference on Systems Engineering, CIIS 2025
Y2 - 1 October 2025 through 3 October 2025
ER -