03. Gefror'ne Tränen. (Frozen Tears)
Through-composed song in f minor containing a prelude, three stanzas, and a postlude. [more]
Path-enhanced, transposition-invariant chroma features (3 sec) correlated to harmonic and melodic progression; deviations in key are indicated by different colors. [more]
Singer and Pianist: Anonymous. Recorded digitally on July, 11th 2010.
Source, License: CC BY-NC-SA 3.0
Information about our segmentation of »03. Gefror'ne Tränen«
Through-composed song in f minor containing a prelude, three stanzas, and a postlude.
The key of the first and the last stanza is mostly f minor and its parallel major key Ab. Second stanza is harmonically indifferent: its first half may be seen as dominant key C major, the second part as Eb major.
First and second stanza each appearing once. In constrast, the last stanza is repeated.
Since first and last stanza share tonal and rhythmic aspects (in our structure annotation labelled A and C) one may also annotate this song as a ternary form.
The corresponding segmentation would appear as I A B A' A'' I.
Lyrics: Project Gutenberg
CENS (Chroma energy normalized statistics)
This feature corresponds to harmonic and melodic properties of a musical piece. Chroma features like CENS are computed by a window-wise subband decomposition of the audio file into semitones (pitches). For each pitch of an octave (C, C♯, D, ..., B), the corresponding pitch energies are summarized up, which reduces the influence of overtones. Subsequently, the resulting chroma features are quantized, smoothed (in temporal direction), and normalized with respect to the ℓ2-norm.
For transposition-invariant similarity, we compute a similarity matrix of the CENS feature sequence with a shifted version of it instead of the usual self-similarity matrix of the feature sequence with itself. Thus, we get 12 similarity matrices (one for each shift) and take the point-wise maximum afterwards. The brightness of the resulting matrix indicates this maximal similarity along all shifts, and the color indicates the index of the used shift. In the following figure the colormap of these shifts is shown.
One can see that our colormap is inspired by the circle of fifths where neighbor keys share similar colors. Black indicates no shift, red a shift towards the dominant key, blue towards the subdominant. The green colors are used for more distant keys. Note, that cyan and yellow correspond to the parallel major/minor key as well.
- Emilia Gómez: Tonal Description of Music Audio Signals, PhD thesis, UPF Barcelona, 2006.
- Anssi Klapuri: Multipitch Analysis of Polyphonic Music and Speech Signals using an Auditory Model, IEEE TASLP 2008, pp. 255–266.
- Meinard Müller: Information Retrieval for Music and Motion, Springer 2007, Section 3.3.
- Meinard Müller, Michael Clausen: Transposition-Invariant Self-Similarity Matrices, ISMIR 2007, pp. 47–50.
- Gregory H Wakefield: Mathematical representation of joint time-chroma distributions, ISOP 1999, pp. 637–645.