Automatic extraction of tonal metadata from polyphonic audio recordings

Reviews
Shared by: Maine Skweeze
Stats
views:
6
rating:
not rated
reviews:
0
posted:
2/11/2009
language:
English
pages:
0
Automatic extraction of tonal metadata from polyphonic audio recordings Tonality is a relevant aspect of music perception, and then a main axis for music description. We need to represent this aspect of music using a set of features computed from an audio recording. These features can be used for contentbased retrieval and navigation through digital music collections. {emilia.gomez,perfecto.herrera}@iua.upf.es http://www.iua.upf.es/mtg Emilia Gómez and P. Herrera Description scheme: Tonal Metadata Block diagram for feature extraction: (1) Feature Extraction (3) Results Instantaneous HPCP for a 10 seconds excerpt of a song: (1) Temporal validity: - Instantaneous descriptors: valid for a time point. - Segment descriptors: defined within an audio segment. - Global: representative of the whole excerpt or piece. (2) Level of abstraction: - Low-level: computed directly from the audio signal or from other low-level descriptors. - High-level: it requires an inductive inference procedure. (2) (4) (1) Low-level Instantaneous feature computation: HPCP (Harmonic Pitch Class Profile). It represents the intensity of each pitch class mapped to an octave. (2) High-level Instantaneous feature computation: Chord, Chord Strength HPCP Global values and correlation for Major/Minor tonalities: (3) Low-level Segment/Global descriptors computation: Average HPCP (4) High-level Segment/Global descriptors computation: Key, Key Strength E Minor Key Estimation Evaluation: Small test database with different styles, key, mode. Labeled by hand (35 sounds). Correct key note Correct mode Correct key 65,5 % 83,24 % 64,2 % Name HPCP Chord Chord Strength Global HPCP Key Key Strength Temporal Validity Instantaneous Instantaneous Segment/Global Segment/Global Segment/Global Segment/Global Level of Abstraction Low High High Low High High Database of 525 classical pieces labeled by their title. Data Type Float vector Textual label Float value Float vector Textual label Float value Correct key Mode error Tuning error 70 % 3% 6% References: • Fujishima, T. 1999. “Realtime chord recognition of musical sound: a system using Common Lisp Music”. ICMC. • Krumhansl, C.L. 1990. “Cognitive Foundations of Musical Pitch”. Oxford University Press, New York. • Sheh, A. and Ellis, D. 2003. “Chord Segmentation and Recognition using EM-Trained Hidden Markov Models “. ISMIR. List of Descriptors

Related docs
Metadata Extraction Tool, Tutorial
Views: 6  |  Downloads: 0
Metadata
Views: 2  |  Downloads: 0
Metadata
Views: 0  |  Downloads: 0
Descriptive metadata
Views: 0  |  Downloads: 0
Metadata
Views: 0  |  Downloads: 0
premium docs
Other docs by Maine Skweeze
Leasehold Estates
Views: 314  |  Downloads: 3
Future Possessory Interests[0]
Views: 263  |  Downloads: 9
Behavioral Economics
Views: 1062  |  Downloads: 108
A Shield About Me
Views: 297  |  Downloads: 1
Greatest Commands
Views: 276  |  Downloads: 4
cp10
Views: 111  |  Downloads: 0
Step-Saver Data Systems v WySE Technology (TSL)
Views: 499  |  Downloads: 8
Teleportation Physics Study
Views: 604  |  Downloads: 21
dv500info
Views: 87  |  Downloads: 0
dv100
Views: 239  |  Downloads: 1
Hannan v Dusch
Views: 445  |  Downloads: 9
Wine Tasting Glossary: Italian-English
Views: 715  |  Downloads: 17
Marshall Lefkowitz Briefs
Views: 269  |  Downloads: 0
Civil Procedure notes
Views: 299  |  Downloads: 5