The Community for Technology Leaders
RSS Icon
Issue No.03 - July-September (2012 vol.3)
pp: 273-284
Gelareh Mohammadi , Idiap Research Institute, Martigny
Alessandro Vinciarelli , University of Glasgow, Glasgow and Idiap Research Institute, Martigny
Whenever we listen to a voice for the first time, we attribute personality traits to the speaker. The process takes place in a few seconds and it is spontaneous and unaware. While the process is not necessarily accurate (attributed traits do not necessarily correspond to the actual traits of the speaker), still it significantly influences our behavior toward others, especially when it comes to social interaction. This paper proposes an approach for the automatic prediction of the traits the listeners attribute to a speaker they never heard before. The experiments are performed over a corpus of 640 speech clips (322 identities in total) annotated in terms of personality traits by 11 assessors. The results show that it is possible to predict with high accuracy (more than 70 percent depending on the particular trait) whether a person is perceived to be in the upper or lower part of the scales corresponding to each of the Big -Five, the personality dimensions known to capture most of the individual differences.
Correlation, Speech, Psychology, Humans, Accuracy, Robots, Support vector machines, automatic personality perception, Personality traits, prosody, Big Five, social signal processing
Gelareh Mohammadi, Alessandro Vinciarelli, "Automatic Personality Perception: Prediction of Trait Attribution Based on Prosodic Features", IEEE Transactions on Affective Computing, vol.3, no. 3, pp. 273-284, July-September 2012, doi:10.1109/T-AFFC.2012.5
[1] J.S. Uleman, L.S. Newman, and G.B. Moskowitz, “People as Flexible Interpreters: Evidence and Issues from Spontaneous Trait Inference,” Advances in Experimental Social Psychology, M.P. Zanna, ed., vol. 28, pp. 211-279, Academic Press, 1996.
[2] J.S. Uleman, S.A. Saribay, and C.M. Gonzalez, “Spontaneous Inferences, Implicit Impressions, and Implicit Theories,” Ann. Rev. of Psychology, vol. 59, pp. 329-360, 2008.
[3] C. Olivola and A. Todorov, “Elected in 100 Milliseconds: Appearance-Based Trait Inferences and Voting,” J. Nonverbal Behavior, vol. 34, no. 2, pp. 83-110, 2010.
[4] B. Reeves and C. Nass, The Media Equation: How People Treat Computers, Television, and New Media like Real People and Places. Cambridge Univ. Press, 1996.
[5] C. Nass and S. Brave, Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship. The MIT Press, 2005.
[6] A. Tapus and M. Mataric, “Socially Assistive Robots: The Link between Personality, Empathy, Physiological Signals, and Task Performance,” Proc. Assoc. for Advancement of Artificial Intelligence Spring Symp., 2008.
[7] D. Kolar, D. Funder, and C. Colvin, “Comparing the Accuracy of Personality Judgments by the Self and Knowledgeable Others,” J. Personality, vol. 64, no. 2, pp. 311-337, 1996.
[8] D. Ozer and V. Benet-Martinez, “Personality and the Prediction of Consequential Outcomes,” Ann. Rev. of Psychology, vol. 57, pp. 401-421, 2006.
[9] K.R. Scherer, “Personality Inference from Voice Quality: The Loud Voice of Extroversion,” European J. Social Psychology, vol. 8, pp. 467-487, 1978.
[10] R. Picard, Affective Computing. The MIT Press, 2000.
[11] A. Vinciarelli, M. Pantic, and H. Bourlard, “Social Signal Processing: Survey of an Emerging Domain,” Image and Vision Computing J., vol. 27, no. 12, pp. 1743-1759, 2009.
[12] B. Rammstedt and O. John, “Measuring Personality in One Minute or Less: A 10-Item Short Version of the Big Five Inventory in English and German,” J. Research in Personality, vol. 41, no. 1, pp. 203-212, 2007.
[13] C. Nass and K.M. Lee, “Does Computer-Synthesized Speech Manifest Personality? Experimental Tests of Recognition, Similarity-Attraction and Consistency-Attraction,” J. Experimental Psychology: Applied, vol. 7, no. 3, pp. 171-181, 2001.
[14] M. Tkalcic, T. Tasic, and J. Kosir, “Emotive and Personality Parameters in Multimedia Recommender Systems,” Proc. IEEE Int'l Conf. Affective Computing and Intelligent Interaction, 2009.
[15] M. Pantic and A. Vinciarelli, “Implicit Human-Centered Tagging,” IEEE Signal Processing Magazine, vol. 26, no. 6, pp. 173-180, Nov. 2009.
[16] D. Funder, “Personality,” Ann. Rev. of Psychology, vol. 52, pp. 197-221, 2001.
[17] G. Matthews, I. Deary, and M. Whiteman, Personality Traits. Cambridge Univ. Press, 2003.
[18] G. Saucier and L. Goldberg, “The Language of Personality: Lexical Perspectives on the Five-Factor Model,” The Five-Factor Model of Personality, J. Wiggins, ed., Guilford Press, 1996.
[19] E. Sapir, “Speech as a Personality Trait,” The Am. J. Sociology, vol. 32, no. 6, pp. 892-905, 1927.
[20] D.W. Addington, “The Relationship of Selected Vocal Characteristics to Personality Perception,” J. Speech Monographs, vol. 35, no. 4, pp. 492-503, 1968.
[21] G.B. Ray, “Vocally Cued Personality Prototypes: An Implicit Personality Theory Approach,” J. Comm. Monographs, vol. 53, no. 3, pp. 266-276, 1986.
[22] K.R. Scherer, “Effect of Stress on Fundamental Frequency of the Voice,” J. Acoustical Soc. of Am., vol. 62, no. S1, pp. 25-26, 1977.
[23] K.R. Scherer and U. Scherer, “Speech Behavior and Personality,” Speech Evaluation in Psychiatry, pp. 115-135, Grune & Stratton, 1981.
[24] B.L. Smith, B.L. Brown, W.J. Strong, and A.C. Rencher, “Effect of Speech Rate on Personality Perception,” J. Language and Speech, vol. 18, pp. 146-152, 1975.
[25] B.L. Brown, H. Giles, and J.N. Thakerar, “Speaker Evaluation as a Function of Speech Rate, Accent and Context,” J. Language and Comm., vol. 5, no. 3, pp. 207-220, 1985.
[26] R.L. Street and B.L. Brady, “Speech Rate Acceptance Ranges as a Function of Evaluative Domain, Listener Speech Rate and Communication Context,” J. Comm. Monographs, vol. 49, pp. 290-308, 1982.
[27] M.A. Stewart, B.L. Brown, and S. Stewart, “A Comparison of Computer Manipulated Speech Rate with Subjectively Manipulated Speech Rate in Effects upon Personality Attributions,” unpublished manuscript, 1984.
[28] G.E. Moore, “Personality Traits and Voice Quality Deficiencies,” J. Speech Disorders, vol. 4, pp. 33-36, 1939.
[29] C.F. Diehl, R. White, and K.W. Burk, “Voice Quality and Anxiety,” J. Speech and Hearing Research, vol. 2, pp. 282-285, 1959.
[30] A.W. Siegman and B. Pope, “Effects of Question Specificity and Anxiety Producing Messages on Verbal Fluency in the Initial Interview,” J. Personality and Social Psychology, vol. 2, pp. 522-530, 1965.
[31] K. Scherer, “Personality Markers in Speech,” Social Markers in Speech, pp. 147-209, Cambridge Univ. Press, 1979.
[32] P. Ekman, W. Friesen, M. O'Sullivan, and K. Scherer, “Relative Importance of Face, Body, and Speech in Judgments of Personality and Affect,” J. Personality and Social Psychology, vol. 38, no. 2, pp. 270-277, 1980.
[33] M. Schmitz, A. Krüger, and S. Schmidt, “Modelling Personality in Voices of Talking Products through Prosodic Parameters,” Proc. 12th Int'l Conf. Intelligent User Interfaces, pp. 313-316, 2007.
[34] J. Trouvain, S. Schmidt, M. Schroder, M. Schmitz, and W.J. Barry, “Modeling Personality Features by Changing Prosody in Synthetic Speech,” Proc. Third Int'l Conf. Speech Prosody, 2006.
[35] E. Krahmer, S. Van Buuren, and W. Wesselink, “Audio-Visual Personality Cues for Embodied Agents: An Experimental Evaluation,” Proc. Workshop Embodied Conversational Characters as Individuals, 2003.
[36] A. Tapus, C. Tapus, and M. Mataric, “User-Robot Personality Matching and Assistive Robot Behavior Adaptation for Post-Stroke Rehabilitation Therapy,” Intelligent Service Robotics, vol. 1, no. 2, pp. 169-183, 2008.
[37] F. Mairesse, M.A. Walker, M.R. Mehl, and R.K. Moore, “Using Linguistic Cues for the Automatic Recognition of Personality in Conversation and Text,” J. Artificial Intelligence Research, vol. 30, pp. 457-500, 2007.
[38] F. Mairesse and M. Walker, “Words Mark the Nerds: Computational Models of Personality Recognition through Language,” Proc. 28th Ann. Conf. Cognitive Science Soc., pp. 543-548, 2006.
[39] G. Mohammadi, A. Vinciarelli, and M. Mortillaro, “The Voice of Personality: Mapping Nonverbal Vocal Behavior into Trait Attributions,” Proc. Second Int'l Workshop Social Signal Proc., pp. 17-20, 2010.
[40] T. Polzehl, S. Moller, and F. Metze, “Automatically Assessing Personality from Speech,” Proc. Fourth IEEE Int'l Conf. Semantic Computing, pp. 134-140, 2010.
[41] F. Pianesi, N. Mana, and A. Cappelletti, “Multimodal Recognition of Personality Traits in Social Interactions,” Proc. 10th Int'l Conf. Multimodal Interfaces, pp. 53-60, 2008.
[42] D.O. Olguin, P.A. Gloor, and A. Pentland, “Capturing Individual and Group Behavior with Wearable Sensors,” Proc. Assoc. for Advancement of Artificial Intelligence Spring Symp., 2009.
[43] G. Zen, B. Lepri, E. Ricci, and O. Lanz, “Space Speaks: Towards Socially and Personality Aware Visual Surveillance,” Proc. ACM Int'l Workshop Multimodal Pervasive Video Analysis, pp. 37-42, 2010.
[44] J. Biesanz and S. West, “Personality Coherence: Moderating Self-Other Profile Agreement and Profile Consensus,” J. Personality and Social Psychology, vol. 79, no. 3, pp. 425-437, 2000.
[45] R. Hogan and M. Harris-Bond, “Culture and Personality,” The Cambridge Handbook of Personality Psychology, P. Corr and G. Matthews, eds., pp. 577-588, Cambridge Univ. Press, 2009.
[46] P. Boersma, “Praat, a System for Doing Phonetics by Computer,” Glot Int'l, vol. 5, nos. 9/10, pp. 341-345, 2002.
[47] C. Song, Z. Qu, N. Blumm, and A. Barabási, “Limits of Predictability in Human Mobility,” Science, vol. 327, no. 5968, pp. 1018-1020, 2010.
[48] D. Liu and J. Nocedal, “On the Limited Memory BFGS Method for Large Scale Optimization,” Math. Programming, vol. 45, no. 1, pp. 503-528, 1989.
[49] R. Kohavi, “A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection,” Proc. Int'l Joint Conf. Artificial Intelligence, pp. 1137-1145, 1995.
[50] N. Ambady, F. Bernieri, and J. Richeson, “Towards a Histology of Social Behavior: Judgmental Accuracy from Thin Slices of Behavior,” Advances in Experimental Social Psychology, M. Zanna, ed., pp. 201-272, Academic Press, 2000.
[51] N. Ambady and R. Rosenthal, “Thin Slices of Expressive Behavior as Predictors of Interpersonal Consequences: A Meta-Analysis,” Psychological Bull., vol. 111, no. 2, pp. 256-274, 1992.
[52] C. Bishop, Pattern Recognition and Machine Learning. Springer, 2006.
[53] C. Judd, L. James-Hawkins, V. Yzerbyt, and Y. Kashima, “Fundamental Dimensions of Social Judgment: Understanding the Relations between Judgments of Competence and Warmth,” J. Personality and Social Psychology, vol. 89, no. 6, pp. 899-913, 2005.
[54] S. Ketrow, “Attributes of a Telemarketer's Voice Persuasiveness,” J. Direct Marketing, vol. 4, no. 3, pp. 8-21, 1990.
34 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool