{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,22]],"date-time":"2026-04-22T20:12:42Z","timestamp":1776888762356,"version":"3.51.2"},"reference-count":58,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2019,12,11]],"date-time":"2019-12-11T00:00:00Z","timestamp":1576022400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Danish Innovation Fund"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["J. Hum.-Robot Interact."],"published-print":{"date-parts":[[2020,3,31]]},"abstract":"<jats:p>In this article, we address to what extent the proverb \u201cthe sound makes the music\u201d also applies to human-robot interaction, and whether robots could profit from using speech characteristics similar to those used by charismatic speakers like Steve Jobs. In three empirical studies, we investigate the effects of using Steve Jobs\u2019 and Mark Zuckerberg's speech characteristics during the generation of robot speech on the robot's persuasiveness and its impressionistic evaluation. The three studies address different human-robot interaction situations, which range from online questionnaires to real-time interactions with a large service robot, yet all involve both behavioral measures and users\u2019 assessments. The results clearly show that robots can profit from using charismatic speech.<\/jats:p>","DOI":"10.1145\/3344274","type":"journal-article","created":{"date-parts":[[2019,12,11]],"date-time":"2019-12-11T13:27:18Z","timestamp":1576070838000},"page":"1-21","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":25,"title":["Speech Melody Matters\u2014How Robots Profit from Using Charismatic Speech"],"prefix":"10.1145","volume":"9","author":[{"given":"Kerstin","family":"Fischer","sequence":"first","affiliation":[{"name":"University of Southern, Sonderborg, Denmark"}]},{"given":"Oliver","family":"Niebuhr","sequence":"additional","affiliation":[{"name":"University of Southern Denmark, Mads Clausens Institute"}]},{"given":"Lars C.","family":"Jensen","sequence":"additional","affiliation":[{"name":"University of Southern Denmark"}]},{"given":"Leon","family":"Bodenhagen","sequence":"additional","affiliation":[{"name":"University of Southern Denmark, Maersk-McKinney Moller Institute"}]}],"member":"320","published-online":{"date-parts":[[2019,12,11]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Proceedings of the 8th ACM\/IEEE International Conference on Human Robot Interaction (HRI\u201913)","author":"Andrist S.","unstructured":"S. Andrist , E. Spannan , and B. Mutlu . 2013. Rhetorical robots: Making robots more effective speakers using linguistic cues of expertise . In Proceedings of the 8th ACM\/IEEE International Conference on Human Robot Interaction (HRI\u201913) . IEEE Press, Piscataway, NJ. 341--348. S. Andrist, E. Spannan, and B. Mutlu. 2013. Rhetorical robots: Making robots more effective speakers using linguistic cues of expertise. In Proceedings of the 8th ACM\/IEEE International Conference on Human Robot Interaction (HRI\u201913). IEEE Press, Piscataway, NJ. 341--348."},{"key":"e_1_2_1_2_1","volume-title":"Proceedings of the 10th ACM\/IEEE International Conference on Human-Robot Interaction (HRI\u201915)","author":"Andrist S.","unstructured":"S. Andrist , M. Ziadee , H. Boukaram , B. Mutlu , and M. Sakr . 2015. Effects of culture on the credibility of robot speech: A comparison between English and Arabic . In Proceedings of the 10th ACM\/IEEE International Conference on Human-Robot Interaction (HRI\u201915) . ACM. New York, NY. 157--164. S. Andrist, M. Ziadee, H. Boukaram, B. Mutlu, and M. Sakr. 2015. Effects of culture on the credibility of robot speech: A comparison between English and Arabic. In Proceedings of the 10th ACM\/IEEE International Conference on Human-Robot Interaction (HRI\u201915). ACM. New York, NY. 157--164."},{"key":"e_1_2_1_3_1","article-title":"The benefit of interactions with physically present robots over video-displayed agents. Int","author":"Bainbridge W. A.","year":"2010","unstructured":"W. A. Bainbridge , J. W. Hart , E. S. Kim , and B. Scasselati . 2010 . The benefit of interactions with physically present robots over video-displayed agents. Int . J. Soc. Rob. 1--2. W. A. Bainbridge, J. W. Hart, E. S. Kim, and B. Scasselati. 2010. The benefit of interactions with physically present robots over video-displayed agents. Int. J. Soc. Rob. 1--2.","journal-title":"J. Soc. Rob. 1--2."},{"key":"e_1_2_1_4_1","volume-title":"Proceedings of the 43rd Meeting of the German Acoustical Society (DAGA\u201917)","author":"Berger S.","unstructured":"S. Berger , O. Niebuhr , and B. Peters . 2017. Winning over an audience\u2014A perception-based analysis of prosodic features of charismatic speech . In Proceedings of the 43rd Meeting of the German Acoustical Society (DAGA\u201917) . 1--4. S. Berger, O. Niebuhr, and B. Peters. 2017. Winning over an audience\u2014A perception-based analysis of prosodic features of charismatic speech. In Proceedings of the 43rd Meeting of the German Acoustical Society (DAGA\u201917). 1--4."},{"key":"e_1_2_1_5_1","volume-title":"Proceedings of the Conference of the International Speech Communication Association (INTERSPEECH\u201915)","author":"Betz Simon","year":"2015","unstructured":"Simon Betz , Petra Wagner , and David Schlangen . 2015 . Micro-structure of disfluencies: Basics for conversational speech synthesis . In Proceedings of the Conference of the International Speech Communication Association (INTERSPEECH\u201915) . Simon Betz, Petra Wagner, and David Schlangen. 2015. Micro-structure of disfluencies: Basics for conversational speech synthesis. In Proceedings of the Conference of the International Speech Communication Association (INTERSPEECH\u201915)."},{"key":"e_1_2_1_6_1","volume-title":"Proceedings of the 8th Conference of the International Speech Communication Association (INTERSPEECH\u201907)","author":"Biadsy F.","unstructured":"F. Biadsy , J. Hirschberg , A. Rosenberg , and W. Dakka . 2007. Comparing American and Palestinian perceptions of charisma using acoustic-prosodic and lexical analysis . In Proceedings of the 8th Conference of the International Speech Communication Association (INTERSPEECH\u201907) 2221--2224. F. Biadsy, J. Hirschberg, A. Rosenberg, and W. Dakka. 2007. Comparing American and Palestinian perceptions of charisma using acoustic-prosodic and lexical analysis. In Proceedings of the 8th Conference of the International Speech Communication Association (INTERSPEECH\u201907) 2221--2224."},{"key":"e_1_2_1_7_1","first-page":"341","article-title":"Praat: A system for doing phonetics by computer","volume":"4","author":"Boersma P.","year":"2001","unstructured":"P. Boersma . 2001 . Praat: A system for doing phonetics by computer . Glot Int. 4 , 341 -- 345 . P. Boersma. 2001. Praat: A system for doing phonetics by computer. Glot Int. 4, 341--345.","journal-title":"Glot Int."},{"key":"e_1_2_1_8_1","volume-title":"Proceedings of the 15th International Congress of Phonetic Sciences. 2417--2420","author":"Campbell N.","unstructured":"N. Campbell and P. Mokhtari . 2003. Voice quality\u2014The 4th prosodic dimension . In Proceedings of the 15th International Congress of Phonetic Sciences. 2417--2420 . N. Campbell and P. Mokhtari. 2003. Voice quality\u2014The 4th prosodic dimension. In Proceedings of the 15th International Congress of Phonetic Sciences. 2417--2420."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/2157689.2157798"},{"key":"e_1_2_1_10_1","volume-title":"Wired for Speech. How Voice Activates and Advances the Human-Computer Relationship","author":"Clifford Nass","unstructured":"Nass Clifford and Brave Scott . 2015. Wired for Speech. How Voice Activates and Advances the Human-Computer Relationship . The MIT Press , Cambridge, MA . Nass Clifford and Brave Scott. 2015. Wired for Speech. How Voice Activates and Advances the Human-Computer Relationship. The MIT Press, Cambridge, MA."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1007\/s12369-015-0329-4"},{"key":"e_1_2_1_12_1","doi-asserted-by":"crossref","unstructured":"V. Dellwo M. Huckvale and M. Ashby. 2007. How is individuality expressed in voice? An introduction to speech production and description for speaker classification. In Speaker Classification I C. M\u00fcller (Ed.). Springer New York 1--20.  V. Dellwo M. Huckvale and M. Ashby. 2007. How is individuality expressed in voice? An introduction to speech production and description for speaker classification. In Speaker Classification I C. M\u00fcller (Ed.). Springer New York 1--20.","DOI":"10.1007\/978-3-540-74200-5_1"},{"key":"e_1_2_1_13_1","volume-title":"An Introduction to Text-to-Speech Synthesis","author":"Dutoit T.","unstructured":"T. Dutoit . 2013. An Introduction to Text-to-Speech Synthesis . Kluwer , Dordrecht . T. Dutoit. 2013. An Introduction to Text-to-Speech Synthesis. Kluwer, Dordrecht."},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.2466\/pms.1996.83.1.243"},{"key":"e_1_2_1_15_1","volume-title":"Proceedings of the 12th IEEE International Workshop on Robot and Human Interactive Communication (ROMAN\u201903)","author":"Goetz J.","unstructured":"J. Goetz , S. Kiesler , and A. Powers . 2003. Matching robot appearance and behavior to tasks to improve human-robot cooperation . In Proceedings of the 12th IEEE International Workshop on Robot and Human Interactive Communication (ROMAN\u201903) . IEEE, Los Alamitos, CA, 55--60. J. Goetz, S. Kiesler, and A. Powers. 2003. Matching robot appearance and behavior to tasks to improve human-robot cooperation. In Proceedings of the 12th IEEE International Workshop on Robot and Human Interactive Communication (ROMAN\u201903). IEEE, Los Alamitos, CA, 55--60."},{"key":"e_1_2_1_16_1","volume-title":"Proceedings of the IEEE Workshop on Advanced Robotics and Its Social Impacts (ARSO\u201909)","author":"Graf B.","unstructured":"B. Graf , U. Reiser , M. Hagele , J. Mauz , and P. Klein . 2009. Robotic home assistant Care-O-Bot 3-product vision and innovation platform . In Proceedings of the IEEE Workshop on Advanced Robotics and Its Social Impacts (ARSO\u201909) . 139--144. B. Graf, U. Reiser, M. Hagele, J. Mauz, and P. Klein. 2009. Robotic home assistant Care-O-Bot 3-product vision and innovation platform. In Proceedings of the IEEE Workshop on Advanced Robotics and Its Social Impacts (ARSO\u201909). 139--144."},{"key":"e_1_2_1_17_1","volume-title":"Eye and brain\u2014The Psychology of Seeing","author":"Gregory R. L.","unstructured":"R. L. Gregory . 1997. Eye and brain\u2014The Psychology of Seeing . Oxford University Press , Oxford, UK . R. L. Gregory. 1997. Eye and brain\u2014The Psychology of Seeing. Oxford University Press, Oxford, UK."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-47665-0_5"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1177\/1059601114525436"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/ARSO.2012.6213397"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.17485\/ijst\/2015\/v8iS5\/61476"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/TRO.2008.921566"},{"key":"e_1_2_1_23_1","volume-title":"Simultaneous Structure in Phonology","author":"Ladd D. R.","unstructured":"D. R. Ladd . 2014. Simultaneous Structure in Phonology . Oxford University Press , Oxford, UK . D. R. Ladd. 2014. Simultaneous Structure in Phonology. Oxford University Press, Oxford, UK."},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1080\/02699931.2013.837378"},{"key":"e_1_2_1_25_1","volume-title":"Proceedings of the 4th International Symposium on Tonal Aspects of Languages (TAL\u201914)","author":"Landgraf R.","year":"2014","unstructured":"R. Landgraf . 2014 . Are you serious? Irony and the perception of emphatic intensification . In Proceedings of the 4th International Symposium on Tonal Aspects of Languages (TAL\u201914) . 91--94. R. Landgraf. 2014. Are you serious? Irony and the perception of emphatic intensification. In Proceedings of the 4th International Symposium on Tonal Aspects of Languages (TAL\u201914). 91--94."},{"key":"e_1_2_1_26_1","volume-title":"Proceedings of the 6th ACM\/IEEE International Conference on Human-Robot Interaction (HRI\u201911)","author":"Leyzberg D.","unstructured":"D. Leyzberg , E. Avrunin , J. Liu , and B. Scassellati . 2011. Robots that express emotion elicit better human teaching . In Proceedings of the 6th ACM\/IEEE International Conference on Human-Robot Interaction (HRI\u201911) . 347--354. D. Leyzberg, E. Avrunin, J. Liu, and B. Scassellati. 2011. Robots that express emotion elicit better human teaching. In Proceedings of the 6th ACM\/IEEE International Conference on Human-Robot Interaction (HRI\u201911). 347--354."},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.csl.2015.03.008"},{"key":"e_1_2_1_28_1","volume-title":"Proceedings of the IEEE\/ACM International Conference on Human-Robot Interaction (HRI\u201917)","author":"Malte Jung","year":"2017","unstructured":"Jung Malte . 2017 . Affective grounding in human-robot interaction . In Proceedings of the IEEE\/ACM International Conference on Human-Robot Interaction (HRI\u201917) . Jung Malte. 2017. Affective grounding in human-robot interaction. In Proceedings of the IEEE\/ACM International Conference on Human-Robot Interaction (HRI\u201917)."},{"key":"e_1_2_1_29_1","unstructured":"R. Mannell. 2017. Phonetics and phonology. Introduction to Prosody: Theories and Models. Retrieved from http:\/\/clas.mq.edu.au\/speech\/phonetics\/phonology\/intonation\/index.html.  R. Mannell. 2017. Phonetics and phonology. Introduction to Prosody: Theories and Models. Retrieved from http:\/\/clas.mq.edu.au\/speech\/phonetics\/phonology\/intonation\/index.html."},{"key":"e_1_2_1_30_1","volume-title":"An Introduction to the Psychology of Hearing","author":"Moore B. J.","unstructured":"B. J. Moore . 2013. An Introduction to the Psychology of Hearing . Brill , Leiden . B. J. Moore. 2013. An Introduction to the Psychology of Hearing. Brill, Leiden."},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1609\/aimag.v32i4.2376"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.20855\/jav.2019.24.21531"},{"key":"e_1_2_1_33_1","volume-title":"Proceedings of the 8th International Conference of Speech Prosody. 1--2.","author":"Niebuhr O.","unstructured":"O. Niebuhr , A. Brem , and E. Nowak-T\u00f3t . 2016b. Prosodic constructions of charisma in business speeches\u2014A contrastive acoustic analysis of Steve Jobs and Mark Zuckerberg . In Proceedings of the 8th International Conference of Speech Prosody. 1--2. O. Niebuhr, A. Brem, and E. Nowak-T\u00f3t. 2016b. Prosodic constructions of charisma in business speeches\u2014A contrastive acoustic analysis of Steve Jobs and Mark Zuckerberg. In Proceedings of the 8th International Conference of Speech Prosody. 1--2."},{"key":"e_1_2_1_34_1","doi-asserted-by":"crossref","unstructured":"O. Niebuhr H. Reetz J. Barnes and A. Yu. 2019. Fundamental aspects in the perception of f0. In The Handbook of Prosody. C. Gussenhoven A. Chen (Eds.) Oxford University Press Oxford UK.  O. Niebuhr H. Reetz J. Barnes and A. Yu. 2019. Fundamental aspects in the perception of f0. In The Handbook of Prosody. C. Gussenhoven A. Chen (Eds.) Oxford University Press Oxford UK.","DOI":"10.1093\/oxfordhb\/9780198832232.013.3"},{"key":"e_1_2_1_35_1","volume-title":"What makes a charismatic speaker? A computer-based acoustic prosodic analysis of Steve Jobs","author":"Niebuhr O.","unstructured":"O. Niebuhr , J. Vo\u00dfe , and Am Brem . 2016a. What makes a charismatic speaker? A computer-based acoustic prosodic analysis of Steve Jobs \u2019 tone of voice. Comput. Hum. Behav . 64 366--382. O. Niebuhr, J. Vo\u00dfe, and Am Brem. 2016a. What makes a charismatic speaker? A computer-based acoustic prosodic analysis of Steve Jobs\u2019 tone of voice. Comput. Hum. Behav. 64 366--382."},{"key":"e_1_2_1_36_1","volume-title":"Proceedings of the 9th International Conference of Speech Prosody. 359--364","author":"Niebuhr O.","unstructured":"O. Niebuhr , R. Skarnitzl , and L. Tyle\u010dkov\u00e1 . 2018. The acoustic fingerprint of a charismatic voice\u2014Initial evidence from correlations between long-term spectral features and listener ratings . In Proceedings of the 9th International Conference of Speech Prosody. 359--364 . O. Niebuhr, R. Skarnitzl, and L. Tyle\u010dkov\u00e1. 2018. The acoustic fingerprint of a charismatic voice\u2014Initial evidence from correlations between long-term spectral features and listener ratings. In Proceedings of the 9th International Conference of Speech Prosody. 359--364."},{"key":"e_1_2_1_37_1","doi-asserted-by":"crossref","first-page":"3","DOI":"10.20396\/joss.v6i1.14983","article-title":"Advancing research and practice in entrepreneurship through speech analysis\u2014From descriptive rhetorical terms to phonetically informed acoustic charisma metrics","volume":"6","author":"Niebuhr O.","year":"2017","unstructured":"O. Niebuhr , S. Tegtmeier , and A. Brem . 2017 . Advancing research and practice in entrepreneurship through speech analysis\u2014From descriptive rhetorical terms to phonetically informed acoustic charisma metrics . J. Speech Sci. 6 , 3 -- 26 . O. Niebuhr, S. Tegtmeier, and A. Brem. 2017. Advancing research and practice in entrepreneurship through speech analysis\u2014From descriptive rhetorical terms to phonetically informed acoustic charisma metrics. J. Speech Sci. 6, 3--26.","journal-title":"J. Speech Sci."},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2017-28"},{"key":"e_1_2_1_39_1","volume-title":"Proceedings of the 10th Twente Student Conference. 1--7.","author":"Nienhuis M.","year":"2009","unstructured":"M. Nienhuis . 2009 . Prosodic correlates of rhetorical appeal: Voice wave analysis of ethos, pathos, and logos . In Proceedings of the 10th Twente Student Conference. 1--7. M. Nienhuis. 2009. Prosodic correlates of rhetorical appeal: Voice wave analysis of ethos, pathos, and logos. In Proceedings of the 10th Twente Student Conference. 1--7."},{"key":"e_1_2_1_40_1","volume-title":"Proceedings of the IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN\u201912)","author":"Nishio S.","unstructured":"S. Nishio , K. Ogawa , Y. Kanakogi , S. Itakura , and H. Ishiguro . 2012. Do robot appearance and speech affect people's attitude? Evaluation through the ultimatum game . In Proceedings of the IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN\u201912) . S. Nishio, K. Ogawa, Y. Kanakogi, S. Itakura, and H. Ishiguro. 2012. Do robot appearance and speech affect people's attitude? Evaluation through the ultimatum game. In Proceedings of the IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN\u201912)."},{"key":"e_1_2_1_41_1","volume-title":"Proceedings of the 18th Conference of the International Speech Communication Association (INTERSPEECH\u201917)","author":"Nov\u00e1k-T\u00f3t E.","unstructured":"E. Nov\u00e1k-T\u00f3t , O. Niebuhr , and A. Chen . 2017. A gender bias in the acoustic-melodic features of charismatic speech? In Proceedings of the 18th Conference of the International Speech Communication Association (INTERSPEECH\u201917) . 1--5. E. Nov\u00e1k-T\u00f3t, O. Niebuhr, and A. Chen. 2017. A gender bias in the acoustic-melodic features of charismatic speech? In Proceedings of the 18th Conference of the International Speech Communication Association (INTERSPEECH\u201917). 1--5."},{"key":"e_1_2_1_42_1","unstructured":"A. Pej\u010di\u0107. 2014. Intonational characteristics of persuasiveness in English and Serbian. Nov. Cahiers Linguis. Francaise 31 141--151.  A. Pej\u010di\u0107. 2014. Intonational characteristics of persuasiveness in English and Serbian. Nov. Cahiers Linguis. Francaise 31 141--151."},{"key":"e_1_2_1_43_1","volume-title":"Proceedings of the Human-Robot Interaction Conference (HRI\u201906)","author":"Powers A.","unstructured":"A. Powers and S. Kiesler . 2006. The advisor robot: Tracing people's mental model from a robot's physical attributes . In Proceedings of the Human-Robot Interaction Conference (HRI\u201906) . A. Powers and S. Kiesler. 2006. The advisor robot: Tracing people's mental model from a robot's physical attributes. In Proceedings of the Human-Robot Interaction Conference (HRI\u201906)."},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/1877826.1877843"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/2559636.2559680"},{"key":"e_1_2_1_46_1","volume-title":"Proceedings of the 6th Conference of the International Speech Communication Association (INTERSPEECH). 513--516","author":"Rosenberg A.","unstructured":"A. Rosenberg and J. Hirschberg . 2005. Acoustic\/prosodic and lexical correlates of charismatic speech . In Proceedings of the 6th Conference of the International Speech Communication Association (INTERSPEECH). 513--516 . A. Rosenberg and J. Hirschberg. 2005. Acoustic\/prosodic and lexical correlates of charismatic speech. In Proceedings of the 6th Conference of the International Speech Communication Association (INTERSPEECH). 513--516."},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2008.11.001"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/2744206"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.csl.2014.08.003"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1016\/0378-2166(94)90116-3"},{"key":"e_1_2_1_51_1","volume-title":"Proceedings of the 14th Conference of the International Speech Communication Association (INTERSPEECH\u201913)","author":"Signorello R.","unstructured":"R. Signorello and D. Demolin . 2013. The physiological use of the charismatic voice in political speech . In Proceedings of the 14th Conference of the International Speech Communication Association (INTERSPEECH\u201913) . 987--991. R. Signorello and D. Demolin. 2013. The physiological use of the charismatic voice in political speech. In Proceedings of the 14th Conference of the International Speech Communication Association (INTERSPEECH\u201913). 987--991."},{"key":"e_1_2_1_52_1","first-page":"09047","article-title":"Towards end-to-end prosody transfer for expressive speech synthesis with Tacotron","volume":"1803","author":"Skerry-Ryan R. J.","year":"2018","unstructured":"R. J. Skerry-Ryan , E. Battenberg , Y. Xiao , Y. Wang , D. Stanton , J. Shor , and R. A. Saurous . 2018 . Towards end-to-end prosody transfer for expressive speech synthesis with Tacotron . ArXiv Preprint Arxiv : 1803 . 09047 . R. J. Skerry-Ryan, E. Battenberg, Y. Xiao, Y. Wang, D. Stanton, J. Shor, and R. A. Saurous. 2018. Towards end-to-end prosody transfer for expressive speech synthesis with Tacotron. ArXiv Preprint Arxiv:1803.09047.","journal-title":"ArXiv Preprint Arxiv"},{"key":"e_1_2_1_53_1","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1016\/j.ijhcs.2005.07.002","article-title":"Persuasion and social perception of human vs. synthetic voice across person as source and computer as source conditions","volume":"64","author":"Stern S. E.","year":"2006","unstructured":"S. E. Stern , J. W. Mullennix , and I. Yaroslavsky . 2006 . Persuasion and social perception of human vs. synthetic voice across person as source and computer as source conditions . Int. J. Hum.-Comput. Interact. 64 , 43 -- 52 . S. E. Stern, J. W. Mullennix, and I. Yaroslavsky. 2006. Persuasion and social perception of human vs. synthetic voice across person as source and computer as source conditions. Int. J. Hum.-Comput. Interact. 64, 43--52.","journal-title":"Int. J. Hum.-Comput. Interact."},{"key":"e_1_2_1_55_1","volume-title":"Proceedings of the IEEE\/ACM International Conference on Human-Robot Interaction (HRI\u201914)","author":"Strait M.","unstructured":"M. Strait , C. Canning , and M. Scheutz . 2014. Let me tell you! Investigating the effects of robot communication strategies in advice-giving situations based on robot appearance, interaction modality, and distance . In Proceedings of the IEEE\/ACM International Conference on Human-Robot Interaction (HRI\u201914) M. Strait, C. Canning, and M. Scheutz. 2014. Let me tell you! Investigating the effects of robot communication strategies in advice-giving situations based on robot appearance, interaction modality, and distance. In Proceedings of the IEEE\/ACM International Conference on Human-Robot Interaction (HRI\u201914)"},{"key":"e_1_2_1_56_1","volume-title":"Text-to-speech Synthesis","author":"Taylor P.","unstructured":"P. Taylor . 2009. Text-to-speech Synthesis . Cambridge University Press , Cambridge, UK . P. Taylor. 2009. Text-to-speech Synthesis. Cambridge University Press, Cambridge, UK."},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.5555\/973927.973930"},{"key":"e_1_2_1_58_1","volume-title":"Proceedings of the 9th ACM\/IEEE International Conference on Human-Robot Interaction (HRI\u201914)","author":"Tielman M.","unstructured":"M. Tielman , M. Neerincx , J. J. Meyer , and R. Looije . 2014. Adaptive emotional expression in robot-child interaction . In Proceedings of the 9th ACM\/IEEE International Conference on Human-Robot Interaction (HRI\u201914) . ACM, New York, 407--414. M. Tielman, M. Neerincx, J. J. Meyer, and R. Looije. 2014. Adaptive emotional expression in robot-child interaction. In Proceedings of the 9th ACM\/IEEE International Conference on Human-Robot Interaction (HRI\u201914). ACM, New York, 407--414."},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1109\/ROMAN.2008.4600750"}],"container-title":["ACM Transactions on Human-Robot Interaction"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3344274","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3344274","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:44:24Z","timestamp":1750203864000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3344274"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,12,11]]},"references-count":58,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,3,31]]}},"alternative-id":["10.1145\/3344274"],"URL":"https:\/\/doi.org\/10.1145\/3344274","relation":{},"ISSN":["2573-9522"],"issn-type":[{"value":"2573-9522","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,12,11]]},"assertion":[{"value":"2018-05-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2019-07-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2019-12-11","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}