{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,13]],"date-time":"2026-04-13T17:09:12Z","timestamp":1776100152554,"version":"3.50.1"},"reference-count":27,"publisher":"Institution of Engineering and Technology (IET)","issue":"2","license":[{"start":{"date-parts":[[2021,3,10]],"date-time":"2021-03-10T00:00:00Z","timestamp":1615334400000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/"}],"content-domain":{"domain":["ietresearch.onlinelibrary.wiley.com"],"crossmark-restriction":true},"short-container-title":["CAAI Trans on Intel Tech"],"published-print":{"date-parts":[[2021,6]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>In robot binaural sound source localization (SSL), locating the direction of the sound source accurately in the shortest time is important. It refers to the algorithm complexity, but even more to the shortest duration of the required signal. A novel binaural SSL method based on feature and frequency weighting is proposed. More specifically, in the training stage, the direction\u2010related interaural cross\u2010correlation function(CCF) and interaural intensity difference(IID) in each frequency band are calculated under noiseless conditions, which are considered the templates. In the testing stage, first the cosine similarities between the CCF and IID of the test signal and templates are calculated in all features and frequency bands. Then, the direction likelihood can be obtained by weighting the similarities. Finally, the direction with maximum likelihood is specified as the direction of the sound source. Experiments were carried out on CIPIC dataset subject 003 with different noises in the noisex\u201092 dataset and demonstrated that the method can accurately locate the sound source with a short signal duration.<\/jats:p>","DOI":"10.1049\/cit2.12009","type":"journal-article","created":{"date-parts":[[2021,3,10]],"date-time":"2021-03-10T12:04:35Z","timestamp":1615377875000},"page":"214-223","update-policy":"https:\/\/doi.org\/10.1002\/crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["Binaural sound source localization based on weighted template matching"],"prefix":"10.1049","volume":"6","author":[{"given":"Hong","family":"Liu","sequence":"first","affiliation":[{"name":"Key Laboratory of Machine Perception Shenzhen Graduate School Peking University  Shenzhen China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0103-0215","authenticated-orcid":false,"given":"Yongheng","family":"Sun","sequence":"additional","affiliation":[{"name":"Key Laboratory of Machine Perception Shenzhen Graduate School Peking University  Shenzhen China"}]},{"given":"Ge","family":"Yang","sequence":"additional","affiliation":[{"name":"College of Liangjiang Artificial Intelligence Chongqing University of Technology  Chongqing China"}]},{"given":"Yang","family":"Chen","sequence":"additional","affiliation":[{"name":"Yanka Kupala State University of Grodno  Grodno Belarus"}]}],"member":"265","published-online":{"date-parts":[[2021,3,10]]},"reference":[{"key":"e_1_2_6_2_1","doi-asserted-by":"publisher","DOI":"10.1006\/csla.1994.1016"},{"key":"e_1_2_6_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2014.6853832"},{"key":"e_1_2_6_4_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.csl.2015.03.003"},{"key":"e_1_2_6_5_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2018-1269"},{"issue":"8","key":"e_1_2_6_6_1","first-page":"1241","article-title":"Multiple sound source counting and localization based on TF\u2010wise spatial spectrum clustering","volume":"27","author":"Yang B.","year":"2019","journal-title":"IEEE Trans. Acoust. Speech. Signal. Process."},{"key":"e_1_2_6_7_1","first-page":"986","article-title":"A probabilistic model for binaural sound localization","volume":"36","author":"Willert V.","year":"2006","journal-title":"IEEE Trans. Syst. Man Cybern. Syst."},{"key":"e_1_2_6_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASSP.1976.1162830"},{"key":"e_1_2_6_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASL.2012.2183869"},{"key":"e_1_2_6_10_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2017-954"},{"key":"e_1_2_6_11_1","doi-asserted-by":"publisher","DOI":"10.1121\/1.414456"},{"key":"e_1_2_6_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2007.366996"},{"key":"e_1_2_6_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2006.281758"},{"key":"e_1_2_6_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSP.2015.2447496"},{"key":"e_1_2_6_15_1","doi-asserted-by":"crossref","unstructured":"Liu H. Fu Z. Li X. A two\u2010layer probabilistic model based on time\u2010delay compensation for binaural sound localization. In:IEEE International Conference on Robotics and Automation (ICRA) pp.2705\u20132712(2013)","DOI":"10.1109\/ICRA.2013.6630949"},{"key":"e_1_2_6_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASL.2010.2052156"},{"key":"e_1_2_6_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2017.2703650"},{"key":"e_1_2_6_18_1","unstructured":"Pang C. Zhang J. Liu H.:Direction of arrival estimation based on reverberation weighting and noise error estimator. In:Proceedings of INTERSPEECH.25 1618\u20131632(2017)"},{"key":"e_1_2_6_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2019.2919378"},{"key":"e_1_2_6_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASL.2010.2042128"},{"key":"e_1_2_6_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2015.7178457"},{"key":"e_1_2_6_22_1","first-page":"861","volume-title":"Proceedings of INTERSPEECH","author":"Karthik G.R.","year":"2018"},{"key":"e_1_2_6_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2018.2855960"},{"key":"e_1_2_6_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2017.2750760"},{"key":"e_1_2_6_25_1","doi-asserted-by":"publisher","DOI":"10.1121\/1.381498"},{"key":"e_1_2_6_26_1","first-page":"99","volume-title":"\u2018The CIPIC HRTF database\u2019, Workshop on Applications of Signal Processing to Audio and Acoustics","author":"Algazi V.R.","year":"2001"},{"key":"e_1_2_6_27_1","volume-title":"TIMIT Acoustic\u2010Phonetic Continuous speech Corpus","author":"Zue V.","year":"1993"},{"key":"e_1_2_6_28_1","doi-asserted-by":"publisher","DOI":"10.1016\/0167-6393(93)90095-3"}],"container-title":["CAAI Transactions on Intelligence Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1049\/cit2.12009","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/full-xml\/10.1049\/cit2.12009","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/ietresearch.onlinelibrary.wiley.com\/doi\/pdf\/10.1049\/cit2.12009","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,28]],"date-time":"2025-10-28T12:47:52Z","timestamp":1761655672000},"score":1,"resource":{"primary":{"URL":"https:\/\/ietresearch.onlinelibrary.wiley.com\/doi\/10.1049\/cit2.12009"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,3,10]]},"references-count":27,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2021,6]]}},"alternative-id":["10.1049\/cit2.12009"],"URL":"https:\/\/doi.org\/10.1049\/cit2.12009","archive":["Portico"],"relation":{},"ISSN":["2468-6557","2468-2322"],"issn-type":[{"value":"2468-6557","type":"print"},{"value":"2468-2322","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,3,10]]},"assertion":[{"value":"2020-09-30","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-12-21","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-03-10","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}