Anat Lerner, Vered Silber-Varod, Fernando Batista and Helena Moniz
In Proceedings of Speech Prosody (2016), 31 May-3 June 2016, Boston, MA, USA
The goal of this research is to identify speaker’s role via machine learning of broad acoustic parameters, in order to understand how an occupation, or a role, affects voice characteristics. The examined corpus consists of recordings taken under the same psychological paradigm (Process Work). Four interns were involved in four genuine client-therapist treatment sessions, where each individual had to train her therapeutic skills on her colleague that, in her turn, participated as a client. This uniform setting provided a unique opportunity to examine how role affects speaker’s prosody. By a collection of machine learning algorithms, we tested automatic classification of the role across sessions. Results based on the acoustic properties show high classification rates, suggesting that there are discriminative acoustic features of speaker’s role, as either a therapist or a client.