Paper Abstract and Keywords |
Presentation |
2005-02-03 12:00
Audio Signal Segmentation and Classification using Fuzzy C-Means Clustering
-- A Study on Definition of Distance between Speech and Music Class -- Naoki Nitanda, Miki Haseyama, Hideo Kitajima (Hokkaido Univ.) |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
Automatic segmentation and classification technique of audio signal is required for audiovisual indexing, and we have been proposed an audio signal segmentation and classification method. This method segments the audio signal into different audio signals at their boundaries, and classifies them into five audio classes, which are silence, speech, music, speech with music, and speech with noise. This paper defines a distance between speech and music class in order to judge that a speech with music class is similar to which speech or music class. The proposed method consists of three steps: (1) audio features, which represent the characteristic of speech, music, and speech with music signal, are extracted; (2) principal component analysis is applied to the extracted audio features; (3) fuzzy c-means clustering is applied to the principal components, and distance can be computed by using membership values, which are obtained from fuzzy clustering. Experimental results performed by applying the proposed method to real audio signal are shown to verify its high performance. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
audio signal / segmentation / classification / indexing / fuzzy c-means / / / |
Reference Info. |
IEICE Tech. Rep., vol. 104, no. 648, IE2004-183, pp. 51-56, Feb. 2005. |
Paper # |
IE2004-183 |
Date of Issue |
2005-01-27 (ITS, IE) |
ISSN |
Print edition: ISSN 0913-5685 |
Download PDF |
|
Conference Information |
Committee |
IE ITS ITE-AIT ITE-ME |
Conference Date |
2005-02-03 - 2005-02-04 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
|
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
|
Paper Information |
Registration To |
IE |
Conference Code |
2005-02-IE-ITS |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Audio Signal Segmentation and Classification using Fuzzy C-Means Clustering |
Sub Title (in English) |
A Study on Definition of Distance between Speech and Music Class |
Keyword(1) |
audio signal |
Keyword(2) |
segmentation |
Keyword(3) |
classification |
Keyword(4) |
indexing |
Keyword(5) |
fuzzy c-means |
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Naoki Nitanda |
1st Author's Affiliation |
Hokkaido University (Hokkaido Univ.) |
2nd Author's Name |
Miki Haseyama |
2nd Author's Affiliation |
Hokkaido University (Hokkaido Univ.) |
3rd Author's Name |
Hideo Kitajima |
3rd Author's Affiliation |
Hokkaido University (Hokkaido Univ.) |
4th Author's Name |
|
4th Author's Affiliation |
() |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2005-02-03 12:00:00 |
Presentation Time |
30 minutes |
Registration for |
IE |
Paper # |
ITS2004-49, IE2004-183 |
Volume (vol) |
vol.104 |
Number (no) |
no.646(ITS), no.648(IE) |
Page |
pp.51-56 |
#Pages |
6 |
Date of Issue |
2005-01-27 (ITS, IE) |
|