Takashi Fukuda photo

Contact Information

Takashi Fukuda
Speech and Language Processing
Tokyo Research Laboratory, Yamato, Japan
FUKUDA1atjp.ibm.com      +81dash3dash5144dash2867


Tab navigation

Awards:

  • IPSJ Yamashita SIG Research Award, Information Processing Society of Japan, 2013
  • IEICE ISS Young Researcher's Award in Speech Field, The Institute of Electronics, Information and Communication Engineers(IEICE), 2012
  • The 28th Awaya Kiyoshi Academic Encouraging Award, The Acoustical Society of Japan(ASJ), 2010.

Journal Papers:

  • Takashi Fukuda, Osamu Ichikawa, and Masafumi Nishimura, "Long-term Spectro-temporal and Static Harmonic Features for Voice Activity Detection," IEEE Journal of Selected Topics in Signal Processing,Vol.4, No.5, pp.834-844, 2010.
  • Osamu Ichikawa, Takashi Fukuda, and Masafumi Nishimura, "Dynamic Features in the Linear-Logarithmic Hybrid Domain for Automatic Speech Recognition in a Reverberant Environment," IEEE Journal of Selected Topics in Signal Processing,Vol.4, No.5, pp.816-823, 2010.
  • Osamu Ichikawa, Takashi Fukuda, and Masafumi Nishimura, "DOA Estimation with Local-Peak-Weighted CSP," EURASIP Journal on Advances in Signal Processing,Volume 2010, Article ID 358729, 9 pages, 2010.
  • Osamu Ichikawa, Takashi Fukuda, and Masafumi Nishimura, "Local Peak Enhancement for In-Car Speech Recognition in Noisy Environment," The Institute of Electronics, Information and Communication Engineers (IEICE) Transactions on Information and Systems, Vol. E91-D, No.3, pp.635-639, March 2008.
  • Mohammad Nurul Huda, Muhammad Ghulam, Takashi Fukuda,Kouichi Katsurada, and Tsuneo Nitta, "Canonicalization of Feature Parameters for Robust Speech Recognition Based on Distinctive Phonetic Feature (DPF) Vectors," The Institute of Electronics, Information and Communication Engineers (IEICE) Transactions on Information and Systems,Vol. E91-D, No.3, pp.488-498, March 2008.
  • Muhammad GHULAM, Takashi Fukuda, Kohichi Katsurada, Junsei Horikawa, and Tsuneo Nitta, "PS-ZCPA based features extraction with auditory masking, modulation enhancement and noise reduction for robust ASR," The Institute of Electronics, Information and Communication Engineers (IEICE) Transactions on Information and Systems, Vol.E89-D, No.3, pp.1015-1023, March 2005.
  • Takashi Fukuda and Tsuneo Nitta, "Orthogonalized Distinctive Phonetic Feature Extraction for Noise-robust Automatic Speech Recognition," The Institute of Electronics, Information and Communication Engineers (IEICE) Transactions on Information and Systems, Vol.E87-D, No.5, pp.1110-1118, May 2004.
  • Muhammad Ghulam, Takaharu Sato, Takashi Fukuda, and Tsuneo Nitta, "Confidence Scoring for Accurate HMM-based Speech Recognition by Using Monophone-Level Normalization Based on Subspace Method," The Institute of Electronics, Information and Communication Engineers (IEICE) Transactions on Information and Systems, Vol.E86-D, No.3, pp.430-437, March 2003.
  • Takashi Fukuda and Tsuneo Nitta, "Improvement in both Tasks of LVCSR and ISWR by using Peripheral Feature Extraction and CMN Control," Journal of Information Processing Society of Japan (IPSJ), Vol.43,No.7,pp.2022-2029,July 2002.

International Conference Papers:

  • Osamu Ichikawa, Steven J. Rennie, Takashi Fukuda, and Masafumi Nishimura, "Channel-mapping for speech corpus recycling," Proc. of 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2013), pp.7160-7164, May 2013, Vancouver, Canada.
  • Takashi Fukuda, Ryuki Tachibana, Upendra Chaudhari, Bhuvana Ramabhadran, and Puming Zhan, "Constructing Ensembles of Dissimilar Acoustic Models using Hidden Attributes of Training Data," Proc. of 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), pp.4141-4144, March 2012, Kyoto, Japan.
  • Osamu Ichikawa, Steven Rennie, Takashi Fukuda, and Masafumi Nishimura, "Model-based Noise Reduction Reveraging Frequency-wise Confidence Metric for In-car Speech Recognition," Proc. of 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), pp.4921-4924, March 2012, Kyoto, Japan.
  • Ryuki Tachibana, Takashi Fukuda, Upendra Chaudhari, Bhuvana Ramabhadran, and Puming Zhan, "Frame-level AnyBoost for LVCSR with the MMI Criterion," Proc. of IEEE Workshop on Automatic Speech Recognition and Unterstanding (ASRU 2011), pp.12-17, December 2011, Hawaii, USA.
  • Takashi Fukuda, Osamu Ichikawa, and Masafumi Nishimura, "Combining Feature Space Discriminative Training with Long-term Spectro-temporal Features for Noise-robust Speech Recognition," Proc. of 12th Annual Conference on the International Speech Communication Association (Interspeech 2011), pp.229-232, August 2011, Florence, Italy.
  • Takashi Fukuda, Osamu Ichikawa, and Masafumi Nishimura, "Breath-detection-based Telephony Speech Phrasing," Proc. of 12th Annual Conference on the International Speech Communication Association (Interspeech 2011), pp.2625-2628, August 2011, Florence, Italy.
  • Takashi Fukuda, Osamu Ichikawa, and Masafumi Nishimura, "Improved Voice Activity Detection Using Static Harmonic Features," Proc. of 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), pp.4482-4485, March 2010, Dallas, Texas, USA.
  • Osamu Ichikawa, Takashi Fukuda, and Masafumi Nishimura, "Dynamic Features in the Linear Domain for Robust Automatic Speech Recognition in a Reverberant Environment," Proc. of 11th European Conference on Speech Communication and Technology (Eurospeech 2009 / Interspeech 2009), pp.44-47, September 2009, Brighton, U.K.
  • Takashi Fukuda, Osamu Ichikawa, and Masafumi Nishimura, "Short- and Long-term Dynamic Features for Robust Speech Recognition," Proc of 10th International Conference on Spoken Language Processing (ICSLP 2008 / Interspeech 2008), pp.2262-2265, September 2008, Brisbane, Australia.
  • Takashi Fukuda, Osamu Ichikawa, and Masafumi Nishimura, "Phone-duration-dependent Long-term Dynamic Features for Stochastic Model-based Voice Activity Detection," Proc of 10th International Conference on Spoken Language Processing (ICSLP 2008 / Interspeech 2008), pp.1293-1296, September 2008, Brisbane, Australia.
  • Osamu Ichikawa, Takashi Fukuda, and Masafumi Nishimura, "Local Peak Enhancement Combined with Noise Reduction Algorithms for Robust Automatic Speech Recognition in Automobiles," Proc. of 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), pp.4865-4868, April 2008, Las Vegas, Nevada, USA.
  • Takashi Fukuda and Tsuneo Nitta, "Designing Multiple Distinctive Phonetic Feature Extractors for Canonicalization by Using Clustering Technique," Proc. of 9th European Conference on Speech Communication and Technology (Eurospeech 2005 / Interspeech 2005), pp.3141-3144,September 2005, Lisbon, Portugal.
  • Muhammad Ghulam, Takashi Fukuda, Junsei Horikawa, and Tsuneo Nitta, "Pitch-Synchronous ZCPA (PS-ZCPA)-Based Feature Extraction with Auditory Masking," Proc. 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Vol. I, pp.517-520, March 2005, Philadelphia, Pennsylvania, USA.
  • Takashi Fukuda and Tsuneo Nitta, "Canonicalization of Feature Parameters for Automatic Speech Recognition," Proc. of 8th International Conference on Spoken Language Processing (ICSLP 2004 / Interspeech 2004), Vol.IV, pp.2537-2540, October 2004, Korea.
  • Muhammad Ghulam, Takashi Fukuda, Junsei Horikawa, and T. Nitta, "A Noise-Robust Feature Extraction Method Based on Pitch-Synchronous ZCPA for ASR," Proc. 8th International Conference on Spoken Language Processing (ICSLP 2004 / Interspeech 2004), Vol.I, pp.133-136, October 2004, Jeju, Korea.
  • Takashi Fukuda and Tsuneo Nitta, "Noise-robust Automatic Speech Recognition Using Orthogonalized Distinctive Phonetic Feature Vectors," Proc. of 8th European Conference on Speech Communication and Technology (Eurospeech 2003 / Interspeech 2003), Vol.III, pp.2189-2192, September 2003, Geneva, Switzerland.
  • Takashi Fukuda and Tsuneo Nitta, "Noise-robust ASR by Using Distinctive Phonetic Features Approximated with Logarithmic Normal Distribution of HMM," Proc. of 8th European Conference on Speech Communication and Technology (Eurospeech 2003 / Interspeech 2003), Vol.III, pp.2185-2188,September 2003, Geneva, Switzerland.
  • Muhammad Ghulam, Takashi Fukuda, and Tsuneo Nitta, "Voice Quality Normalization in an Utterance for Robust ASR," Proc. 8th European Conference on Speech Communication and Technology (Eurospeech 2003 / Interspeech 2003), Vol.III, pp.2173-2176, September 2003, Geneva, Switzerland.
  • Tsuneo Nitta, Shingo Iseji, Takashi Fukuda, Hirobumi Yamada, and Katsurada Katsurada, "Key-word Spotting Using Phonetic Distinctive Features Extracted from Output of an LVCSR Engine," Proc. ISCA & IEEE Workshop on Spontaneous Speech Processing and Recognition (SSPR 2003), pp.99-102, April 2003, Tokyo, Japan.
  • Takashi Fukuda, Wataru Yamamoto and Tsuneo Nitta, "Distinctive Phonetic Feature Extraction for Robust Speech Recognition," Proc. of 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2003), Vol.Ⅱ, pp.25-28,April 2003, Hong Kong, China.
  • Muhammad Ghulam, Takaharu Sato, Takashi Fukuda, and Tsuneo Nitta, "Improving Performance of an HMM-based ASR System By Using Monophone-Level Normalized Confidence Measure," Proc. 7th International Conference on Spoken Language Processing (ICSLP 2002 / Interspeech 2002), Vol.IV, pp.2453-2456, September 2002, Denver, Colorado, USA.
  • Takaharu Sato, Muhammad Ghulam, Takashi Fukuda, and Tsuneo Nitta, "Confidence Scoring for Accurate HMM-based Word Recognition By Using SM-based Monophone Score Normalization," Proc. 2002 IEEE International Conference on Acoustic, Speech, and Signal Processing (ICASSP 2002), Vol.I, pp.217-220, May 2002, Orlando, Florida, USA.
  • Takashi Fukuda, Masashi Takigawa and Tsuneo Nitta, "Peripheral Features for HMM-based Speech Recognition," Proc. of 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Vol.I, pp.129-132, May 2001, Salt Lake City, Utah, USA.
  • Tsuneo Nitta, Masashi Takigawa, and Takashi Fukuda, "A Novel Feature Extraction Using Multiple Acoustic Feature Planes for HMM-based Speech Recognition," Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000 / Interspeech 2000), Vol.I, pp.385-388, October 2000, Beijing, China.