Search for collections on Unika Repository

A COMPARATIVE ANALYSIS OF PRE- PROCESSING TECHNIQUES FOR OPTICAL MUSIC RECOGNITION: ASSESSING THE EFFICACY OF STAFFLINE REMOVAL

PAKPAHAN, KORNELIUS BOAZ APRILIO (2025) A COMPARATIVE ANALYSIS OF PRE- PROCESSING TECHNIQUES FOR OPTICAL MUSIC RECOGNITION: ASSESSING THE EFFICACY OF STAFFLINE REMOVAL. S1 thesis, UNIVERSITAS KATOLIK SOEGIJAPRANATA SEMARANG.

[img]
Preview
Text
21.K1.0038-KORNELIUS BOAZ APRILIO PAKPAHAN-COVER_a.pdf

Download (860kB) | Preview
[img] Text
21.K1.0038-KORNELIUS BOAZ APRILIO PAKPAHAN-BAB_I_a.pdf
Restricted to Registered users only

Download (803kB)
[img] Text
21.K1.0038-KORNELIUS BOAZ APRILIO PAKPAHAN-BAB_II_a.pdf
Restricted to Registered users only

Download (696kB)
[img] Text
21.K1.0038-KORNELIUS BOAZ APRILIO PAKPAHAN-BAB_III_a.pdf
Restricted to Registered users only

Download (1MB)
[img] Text
21.K1.0038-KORNELIUS BOAZ APRILIO PAKPAHAN-BAB_IV_a.pdf
Restricted to Registered users only

Download (1MB)
[img] Text
21.K1.0038-KORNELIUS BOAZ APRILIO PAKPAHAN-BAB_V_a.pdf
Restricted to Registered users only

Download (726kB)
[img]
Preview
Text
21.K1.0038-KORNELIUS BOAZ APRILIO PAKPAHAN-DAPUS_a.pdf

Download (721kB) | Preview
[img] Text
21.K1.0038-KORNELIUS BOAZ APRILIO PAKPAHAN-LAMP_a.pdf
Restricted to Registered users only

Download (1MB)

Abstract

Standard Optical Music Recognition (OMR) systems use staff-line removal as part of the pre-processing step to improve detection accuracy for the symbols. However, with the emergence of deep learning, this step started to seem somewhat unnecessary. In this study, we present a comparative study of three pre-processing scenarios, namely, staff-line removal via Projection Profile Analysis (PPA) subject to morphological operations (morphological), RunLength Encoding (RLE), and no staff-lines removal. In order to assess its effects on recognition performance using Symbol Error Rate (SER), a CNN+LSTM model was trained on the PrIMuS dataset—a collection of 87,678 high-quality monophonic score images—under each of these circumstances. In this study we found that the scenario without removing staff lines leads to the lowest validation SER (0.62) compared with PPA+Morphological (SER: 0.75) and RLE scenarios (SER: 0.82). Although staff lines removal approaches such as PPA (0.01s/sample) and RLE (0.1s/sample) decreased computational costs, they were associated with artifacts resulting in more degraded models. The results make a case for prioritizing symbol integrity over pre-processing efficiency in deep learning-based OMR systems, and may help mitigate pre-processing issues in future work towards streamlined workflows and model optimization.

Item Type: Thesis (S1)
Subjects: 000 Computer Science, Information and General Works
000 Computer Science, Information and General Works > 005 Computer programming, programs & data
Depositing User: Mr Yosua Norman Rumondor
Date Deposited: 09 Jul 2025 04:09
Last Modified: 09 Jul 2025 04:09
URI: http://repository.unika.ac.id/id/eprint/37316
Keywords: UNSPECIFIED

Actions (login required)

View Item View Item