Detailed Information

Cited 2 time in webofscience Cited 3 time in scopus
Metadata Downloads

A Novel Query-by-Singing/Humming Method by Estimating Matching Positions Based on Multi-layered Perceptron

Full metadata record
DC Field Value Language
dc.contributor.authorPham, Tuyen Danh-
dc.contributor.authorNam, Gi Pyo-
dc.contributor.authorShin, Kwang Yong-
dc.contributor.authorPark, Kang Ryoung-
dc.date.accessioned2024-08-08T05:01:14Z-
dc.date.available2024-08-08T05:01:14Z-
dc.date.issued2013-07-30-
dc.identifier.issn1976-7277-
dc.identifier.issn1976-7277-
dc.identifier.urihttps://scholarworks.dongguk.edu/handle/sw.dongguk/18347-
dc.description.abstractThe increase in the number of music files in smart phone and MP3 player makes it difficult to find the music files which people want. So, Query-by-Singing/Humming (QbSH) systems have been developed to retrieve music from a user's humming or singing without having to know detailed information about the title or singer of song. Most previous researches on QbSH have been conducted using musical instrument digital interface (MIDI) files as reference songs. However, the production of MIDI files is a time-consuming process. In addition, more and more music files are newly published with the development of music market. Consequently, the method of using the more common MPEG-1 audio layer 3 (MP3) files for reference songs is considered as an alternative. However, there is little previous research on QbSH with MP3 files because an MP3 file has a different waveform due to background music and multiple (polyphonic) melodies compared to the humming/singing query. To overcome these problems, we propose a new QbSH method using MP3 files on mobile device. This research is novel in four ways. First, this is the first research on QbSH using MP3 files as reference songs. Second, the start and end positions on the MP3 file to be matched are estimated by using multi-layered perceptron (MLP) prior to performing the matching with humming/singing query file. Third, for more accurate results, four MLPs are used, which produce the start and end positions for dynamic time warping (DTW) matching algorithm, and those for chroma-based DTW algorithm, respectively. Fourth, two matching scores by the DTW and chroma-based DTW algorithms are combined by using PRODUCT rule, through which a higher matching accuracy is obtained. Experimental results with AFA MP3 database show that the accuracy (Top 1 accuracy of 98%, with an MRR of 0.989) of the proposed method is much higher than that of other methods. We also showed the effectiveness of the proposed system on consumer mobile device.-
dc.format.extent14-
dc.language영어-
dc.language.isoENG-
dc.publisherKSII-KOR SOC INTERNET INFORMATION-
dc.titleA Novel Query-by-Singing/Humming Method by Estimating Matching Positions Based on Multi-layered Perceptron-
dc.typeArticle-
dc.publisher.location대한민국-
dc.identifier.doi10.3837/tiis.2013.07.008-
dc.identifier.scopusid2-s2.0-84880965639-
dc.identifier.wosid000322528700008-
dc.identifier.bibliographicCitationKSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, v.7, no.7, pp 1657 - 1670-
dc.citation.titleKSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS-
dc.citation.volume7-
dc.citation.number7-
dc.citation.startPage1657-
dc.citation.endPage1670-
dc.type.docTypeArticle-
dc.identifier.kciidART002048088-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.description.journalRegisteredClasskci-
dc.description.journalRegisteredClasskciCandi-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaTelecommunications-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryTelecommunications-
dc.subject.keywordAuthorQbSH-
dc.subject.keywordAuthorMP3 Files-
dc.subject.keywordAuthorMulti-layered Perceptron-
dc.subject.keywordAuthorDynamic Time Warping-
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Engineering > Department of Electronics and Electrical Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Park, Gang Ryung photo

Park, Gang Ryung
College of Engineering (Department of Electronics and Electrical Engineering)
Read more

Altmetrics

Total Views & Downloads

BROWSE