Researchers

Parham Mokhtari

Main Lab Location:

NICT (Keihanna)

Specific Research Topic:

Human Vocal Communication of Affect

Phone: 

+81-774-98-6407

Mailing Address:

3-5 Hikaridai, Seika-cho, Soraku-gun, Kyoto, 619-0289

Email: 

Homepage:

Broadly, my research seeks to clarify and discover links between the physical, psychophysical, physiological and neural mechanisms involved in the acoustic modality of human perception and communication.

Currently I investigate the vocal expression of human emotions. Different affective states in the brain are accompanied by distinct patterns of physiological changes, which influence our vocal apparatus and hence the resulting acoustic signal. To detect the underlying states from acoustic-auditory information, it is therefore important to reliably estimate the shape of the normally invisible vocal tract (speech inverse problem) and especially the laryngeal mode of vibration (glottal source waveform) which determines voice quality. To achieve this, I use a combination of signal processing, machine learning, acoustic simulation, and non-invasive measurements such as electroglottography (EGG) and structural MRI.

This research aims to enable sympathetic interactions between humans and all kinds of robots or machines that are expected to be increasingly pervasive in society.

Selected Publications:

Mokhtari, P., Takemoto, H., Nishimura, R. and Kato, H. (2016). Vertical normal modes of human ears: individual variation and frequency estimation from pinna anthropometry. Journal of the Acoustical Society of America, 140(2), 814-831. http://dx.doi.org/10.1121/1.4960481

Mokhtari, P., Takemoto, H., Nishimura, R. and Kato, H. (2015). Frequency and amplitude estimation of the first peak of head-related transfer functions from pinna anthropometry. Journal of the Acoustical Society of America, 137(2), 690-701. http://dx.doi.org/10.1121/1.4906160

Mokhtari, P., Takemoto, H., Nishimura, R. and Kato, H. (2010). Optimum loss factor for a perfectly matched layer in finite-difference time-domain acoustic simulation. IEEE Transactions on Audio, Speech, and Language Processing, 18(5), 1068-1071. http://dx.doi.org/10.1109/TASL.2009.2035036

Mokhtari, P., Takemoto, H. and Kitamura, T. (2008). Single-matrix formulation of a time domain acoustic model of the vocal tract with side branches. Speech Communication, 50(3), 179-190. http://dx.doi.org/10.1016/j.specom.2007.08.001

Mokhtari, P., Kitamura, T., Takemoto, H. and Honda, K. (2007). Principal components of vocal tract area functions and inversion of vowels by linear regression of cepstrum coefficients. Journal of Phonetics, 35(1), 20-39. http://dx.doi.org/10.1016/j.wocn.2006.01.001