Paper Abstract and Keywords |
Presentation |
2023-10-14 16:15
Comparative study on different speaker embedding spaces focusing on the relation to perceptual inter-speaker similarity Wakuto Morita, Daisuke Saito, Nobuaki Minematsu (Univ. of Tokyo) SP2023-31 WIT2023-22 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
This study examines the correspondence between inter-speaker similarity based on speaker embeddings and perceptual speaker similarity based on human listening tests. In our previous study, we have shown that the tendency of correspondence mentioned above depends on the dimension of embedding space. This paper introduces a speaker embedding method which can encode discriminative information on speaker individuality even in low dimensions, and discusses the effect of differences in embedding methods on the correspondence with human perception. The experimental results have shown that 1) a general tendency independent of the embedding methods was confirmed and 2) the degree of change in the tendency depended on the embedding methods. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
Speaker Embeddings / Human Perception / Triplet Loss / Poincaré Embeddings / / / / |
Reference Info. |
IEICE Tech. Rep., vol. 123, no. 212, SP2023-31, pp. 21-26, Oct. 2023. |
Paper # |
SP2023-31 |
Date of Issue |
2023-10-07 (SP, WIT) |
ISSN |
Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
SP2023-31 WIT2023-22 |
Conference Information |
Committee |
WIT SP IPSJ-SLP |
Conference Date |
2023-10-14 - 2023-10-14 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
Kyushu Institute of Technology |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
Speech and Well-being Information Technology, etc. |
Paper Information |
Registration To |
SP |
Conference Code |
2023-10-WIT-SP-SLP |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Comparative study on different speaker embedding spaces focusing on the relation to perceptual inter-speaker similarity |
Sub Title (in English) |
|
Keyword(1) |
Speaker Embeddings |
Keyword(2) |
Human Perception |
Keyword(3) |
Triplet Loss |
Keyword(4) |
Poincaré Embeddings |
Keyword(5) |
|
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Wakuto Morita |
1st Author's Affiliation |
The University of Tokyo (Univ. of Tokyo) |
2nd Author's Name |
Daisuke Saito |
2nd Author's Affiliation |
The University of Tokyo (Univ. of Tokyo) |
3rd Author's Name |
Nobuaki Minematsu |
3rd Author's Affiliation |
The University of Tokyo (Univ. of Tokyo) |
4th Author's Name |
|
4th Author's Affiliation |
() |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2023-10-14 16:15:00 |
Presentation Time |
25 minutes |
Registration for |
SP |
Paper # |
SP2023-31, WIT2023-22 |
Volume (vol) |
vol.123 |
Number (no) |
no.212(SP), no.213(WIT) |
Page |
pp.21-26 |
#Pages |
6 |
Date of Issue |
2023-10-07 (SP, WIT) |
|