Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
EA |
2022-05-13 15:00 |
Online |
Online |
Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino (NTT) EA2022-9 |
Many application studies rely on audio DNN models pre-trained on a large-scale dataset as essential feature extractors, ... [more] |
EA2022-9 pp.41-45 |
PRMU |
2021-12-17 11:00 |
Online |
Online |
PRMU2021-46 |
(To be available after the conference date) [more] |
PRMU2021-46 pp.118-123 |
SP, EA, SIP |
2020-03-02 16:45 |
Okinawa |
Okinawa Industry Support Center (Cancelled but technical report was issued) |
[Fellow Memorial Lecture]
Building Dictionary for Media Information Kunio Kashino (NTT) EA2019-133 SIP2019-135 SP2019-82 |
[more] |
EA2019-133 SIP2019-135 SP2019-82 p.187 |
RISING (2nd) |
2019-11-27 13:55 |
Tokyo |
Fukutake Learning Theater, Hongo Campus, Univ. Tokyo |
|
(To be available after the conference date) [more] |
|
PRMU |
2019-10-18 13:55 |
Tokyo |
|
[Fellow Memorial Lecture]
Knowledge Acquisition and Media Search Based on Crossmodal Information Processing Kunio Kashino (NTT) PRMU2019-36 |
Media search can be roughly divided into two methods: a method in which fragments of media content are used as queries a... [more] |
PRMU2019-36 p.27 |
SIP, EA, SP, MI (Joint) [detail] |
2018-03-20 10:40 |
Okinawa |
|
[Invited Talk]
Progress of Research on Cross-modal Scene Analysis Kunio Kashino (NTT) EA2017-166 SIP2017-175 SP2017-149 |
The problem of scene analysis means to understand, or describe, various external objects or events associated with spati... [more] |
EA2017-166 SIP2017-175 SP2017-149 p.353 |
IN |
2018-01-23 13:00 |
Aichi |
WINC AICHI |
[Invited Talk]
Media Fingerprinting Technologies Supporting Digital Media Contents Distribution and those Applications Takahito Kawanishi, Hidehisa Nagano, Minoru Mori, Xiaomeng Wu, Yasunori Oishi, Kaoru Hiramatsu, Kunio Kashino (NTT) IN2017-85 |
Media fingerprinting is a technique which identify a media sample or quickly locate similar parts in large media databas... [more] |
IN2017-85 p.83 |
NLC, IPSJ-NL, SP, IPSJ-SLP (Joint) [detail] |
2017-12-21 12:50 |
Tokyo |
Waseda Univ. Green Computing Systems Research Organization |
[Poster Presentation]
Shota Ikawa (Univ. Tokyo), Kunio Kashino (Univ. Tokyo/NTT) SP2017-58 |
Representing various acoustic events by natural language seems to play an important role in natural man-machine communic... [more] |
SP2017-58 pp.17-20 |
IBISML |
2017-11-09 13:00 |
Tokyo |
Univ. of Tokyo |
Bayesian nonparametric model for uncountable factor analysis Masahiro Nakano (NTT), Daichi Mochihashi, Tomoko Matsui (ISM), Kunio Kashino (NTT) IBISML2017-59 |
[more] |
IBISML2017-59 pp.185-192 |
VLD, DC, CPSY, RECONF, CPM, ICD, IE, IPSJ-SLDM, IPSJ-EMB, IPSJ-ARC (Joint) [detail] |
2017-11-08 12:45 |
Kumamoto |
Kumamoto-Kenminkouryukan Parea |
[Invited Talk]
Accurate Color Reproduction using Multiband Image and Its Applications Masaru Tsuchida, Kaoru Hiramatsu, Kunio Kashino (NTT) CPM2017-88 ICD2017-47 IE2017-73 |
Estimation of spectral reflectance of an object’s surface is required for accurate color reproduction as if the object i... [more] |
CPM2017-88 ICD2017-47 IE2017-73 pp.47-48 |
PRMU |
2017-10-13 14:50 |
Kumamoto |
|
PRMU2017-98 |
It is important to estimate what attracts many people. However, when we observe many people simultaneously, gaze estimat... [more] |
PRMU2017-98 pp.199-204 |
SP, SIP, EA |
2017-03-02 12:45 |
Okinawa |
Okinawa Industry Support Center |
Non-native speech conversion with consistency-aware recursive network and generative adversarial network Keisuke Oyamada (Univ. of Tsukuba), Hirokazu Kameoka, Takuhiro Kaneko (NTT), Hiroyasu Ando (Univ. of Tsukuba), Kaoru Hiramatsu, Kunio Kashino (NTT) EA2016-139 SIP2016-194 SP2016-134 |
This paper deals with the problem of automatically modifying the pronunciation of non-native speech.
Since the pronunci... [more] |
EA2016-139 SIP2016-194 SP2016-134 pp.315-320 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 15:10 |
Tokyo |
NTT Musashino R&D |
[Poster Presentation]
Fast algorithm for statistical phrase/accent command estimation based on generative model incorporating spectral features Ryotaro Sato (The Univ. of Tokyo), Hirokazu Kameoka, Kunio Kashino (NTT) SP2016-56 |
On the basis of the Fujisaki model, we propose a fast algorithm for estimating the model parameters, namely, the timings... [more] |
SP2016-56 pp.43-48 |
SP, IPSJ-SLP, NLC, IPSJ-NL (Joint) [detail] |
2016-12-20 16:40 |
Tokyo |
NTT Musashino R&D |
Generative Adversarial Network-based Postfiltering for Statistical Parametric Speech Synthesis Takuhiro Kaneko, Hirokazu Kameoka, Nobukatsu Hojo, Yusuke Ijima, Kaoru Hiramatsu, Kunio Kashino (NTT) SP2016-61 |
In the field of speech synthesis, statistical parametric speech synthesis has been widely used due to the flexibility an... [more] |
SP2016-61 pp.89-94 |
IBISML |
2016-11-17 14:00 |
Kyoto |
Kyoto Univ. |
Infinite graphs with finite diameters Chihiro Watanabe, Masahiro Nakano, Xiaomeng Wu, Takahito Kawanishi, Kaoru Hiramatsu, Kunio Kashino (NTT) IBISML2016-77 |
Recently, new stochastic processes have been proposed for analyzing structured data with combinatorial constraints. Part... [more] |
IBISML2016-77 pp.221-228 |
PRMU, IPSJ-CVIM, MVE [detail] |
2016-01-21 11:00 |
Osaka |
|
A method for gaze correction between video conference participants that synthesizes eye areas image within the perceptual range of eye contact Takuya Inoue, Tomokazu Takahashi, Takatsugu Hirayama, Yasutomo Kawanishi, Daisuke Deguchi, Ichiro Ide, Hiroshi Murase (Nagoya Univ.), Takayuki Kurozumi, Kunio Kashino (NTT) PRMU2015-119 MVE2015-41 |
Recently, the spread of Web cameras has facilitated video-conferencing. Since a Web camera is usually located outside th... [more] |
PRMU2015-119 MVE2015-41 pp.53-58 |
IBISML |
2015-11-27 14:00 |
Ibaraki |
Epochal Tsukuba |
[Poster Presentation]
Stochastic process representation of R-tree Masahiro Nakano, Xiaomeng Wu, Minoru Mori, Akisato Kimura, Kunio Kashino (NTT) IBISML2015-76 |
This paper discusses a new stochastic process that represents the innitely exchangeable rectangular partitioning of a m... [more] |
IBISML2015-76 pp.175-182 |
PRMU, IBISML, IPSJ-CVIM [detail] |
2015-09-14 13:00 |
Ehime |
|
Towards Automatic 3D Reconstruction of Categories Using Manifolds Kent Fujiwara, Minoru Mori, Kunio Kashino (NTT) PRMU2015-67 IBISML2015-27 |
[more] |
PRMU2015-67 IBISML2015-27 pp.1-6 |
PRMU, BioX |
2015-03-20 17:10 |
Kanagawa |
|
Specific object search based on result re-ranking using region-of-interest information Masaya Murata, Hidehisa Nagano, Takahito Kawanishi, Kaoru Hiramatsu, Kunio Kashino (NTT) BioX2014-80 PRMU2014-200 |
In this paper we first explain our retrieval approach for the specific object search task based on image queries. We the... [more] |
BioX2014-80 PRMU2014-200 pp.245-249 |
PRMU |
2014-10-10 13:30 |
Chiba |
|
A Study on Image Transformation of Eye Areas for Synthesizing Eye-Contacts in Video Conferencing Takuya Inoue, Tomokazu Takahashi, Takatsugu Hirayama, Daisuke Deguchi, Ichiro Ide, Hiroshi Murase (Nagoya Univ.), Takayuki Kurozumi, Kunio Kashino (NTT) PRMU2014-60 |
In recent years, the spread of Web cameras have facilitated video conferencing. Although a Web camera is usually located... [more] |
PRMU2014-60 pp.33-38 |