27. [Toda, 2014] T. Toda. Augmented speech production based on real-time statistical voice conversion. Proc.
GlobalSIP, pp. 755-759, 2014.
[Banno et al., 2007] H. Banno1, H. Hata, M. Morise, T. Takahashi, T. Irino, H. Kawahara. Implementation of
realtime STRAIGHT speech manipulation system: Report on its first implementation. Acoustical Science and
Technology. Vol. 28, No. 3, pp. 140-146, 2007.
[Kobayashi et al., 2016a] K. Kobayashi, T. Toda, S. Nakamura. F0 transformation techniques for statistical
voice conversion with direct waveform modification with spectral differential. Proc. IEEE SLT, pp. 693-700,
2016.
[Abe et al., 1990] M. Abe, S. Nakamura, K. Shikano, H. Kuwabara. Voice conversion through vector
quantization. J. Acoust. Soc. Jpn (E), Vol. 11, No. 2, pp. 71-76, 1990.
[Stylianou et al., 1998] Y. Stylianou, O. Capp´e, E. Moulines. Continuous probabilistic transform for voice
conversion. IEEE Trans. Speech & Audio Process., Vol. 6, No. 2, pp. 131-142, 1998.
[Toda et al., 2007a] T. Toda, A.W. Black, K. Tokuda. Voice conversion based on maximum likelihood
estimation of spectral parameter trajectory. IEEE Transactions on Audio, Speech and Language Processing,
Vol. 15, No. 8, pp. 2222-2235, 2007.
[Tobing et al., 2016] P.L. Tobing, T. Toda, H. Kameoka, S. Nakamura. Acoustic-to-articulatory inversion
mapping based on latent trajectory Gaussian mixture model. Proc. INTERSPEECH, pp. 953-957, 2016.
[徳田 他, 1997] 徳田恵一, 益子貴史, 小林隆夫, 今井 聖. 動的特徴を用いた HMMからの音声パラメータ
生成アルゴリズム. 日本音響学会誌, Vol. 53, No. 3, pp. 192–200, 1997.
[Furui, 1981] S Furui. Cepstral analysis technique for automatic speaker verification. IEEE Trans. Acoustics,
Speech, and Signal Process. Vol. 29, No. 2, pp. 254-272, 1981.
参考文献(1)
参考文献:1
28. [Takamichi et al., 2016] Takamichi, T. Toda, A.W. Black, G. Neubig, S. Sakti, S. Nakamura. Post-filters to
modify the modulation spectrum for statistical parametric speech synthesis. IEEE/ACM Transactions on
Audio, Speech and Language Processing, Vol. 24, No. 4, pp. 755-767, Apr. 2016.
[Toda et al., 2007b] T. Toda, Y. Ohtani, K. Shikano. One-to-many and many-to-one voice conversion based
on eigenvoices. Proc. IEEE ICASSP, pp. 1249-1252, 2007
[Toda et al., 2006] T. Toda, Y. Ohtani, K. Shikano. Eigenvoice conversion based on Gaussian mixture model.
Proc. INTERSPEECH, pp. 2446-2449, 2006.
[Kuhn et al., 2000] R. Kuhn, J.-C. Junqua, P. Nguyen, N. Niedzielski. Rapid speaker adaptation in eigenvoice
space. IEEE Trans. Speech & Audio Process. Vol. 8, No. 6, pp. 695-707, 2000.
[Ohtani et al., 2009] Y. Ohtani, T. Toda, H. Saruwatari, K. Shikano. Non-parallel training for many-to-many
eigenvoice conversion. Proc. IEEE ICASSP, pp. 4822-4825, Dallas, USA, Mar. 2010.
[Ohta et al., 2010] K. Ohta, T. Toda, Y. Ohtani, H. Saruwatari, K. Shikano. Adaptive voice-quality control
based on one-to-many eigenvoice conversion. Proc. INTERSPEECH, pp. 2158-2161, 2010.
[Kobayashi et al., 2014] K. Kobayashi, T. Toda, H. Doi, T. Nakano, M. Goto, G. Neubig, S. Sakti, S. Nakamura.
Voice timbre control based on perceived age in singing voice conversion. IEICE Transactions on Information
and Systems, Vol. E97-D, No. 6, pp. 1419-1428, 2014.
[Kobayashi et al., 2016b] K. Kobayashi, T. Toda, T. Nakano, M. Goto, S. Nakamura. Improvements of voice
timbre control based on perceived age in singing voice conversion. IEICE Transactions on Information and
Systems, Vol. E99-D, No. 11, pp. 2767-2777, 2016.
参考文献(2)
参考文献:2