{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,27]],"date-time":"2025-10-27T16:21:11Z","timestamp":1761582071254,"version":"build-2065373602"},"reference-count":29,"publisher":"Institution of Engineering and Technology (IET)","issue":"1","license":[{"start":{"date-parts":[[2021,3,2]],"date-time":"2021-03-02T00:00:00Z","timestamp":1614643200000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/"}],"content-domain":{"domain":["ietresearch.onlinelibrary.wiley.com"],"crossmark-restriction":true},"short-container-title":["CAAI Trans on Intel Tech"],"published-print":{"date-parts":[[2022,3]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Various time\u2010frequency (T\u2010F) masks are being applied to sound source localization tasks. Moreover, deep learning has dramatically advanced T\u2010F mask estimation. However, existing masks are usually designed for speech separation tasks and are suitable only for single\u2010channel signals. A novel complex\u2010valued T\u2010F mask is proposed that reserves the head\u2010related transfer function (HRTF), customized for binaural sound source localization. In addition, because the convolutional neural network that is exploited to estimate the proposed mask takes binaural spectral information as the input and output, accurate binaural cues can be preserved. Compared with conventional T\u2010F masks that emphasize single speech source\u2013dominated T\u2010F units, HRTF\u2010reserved masks eliminate the speech component while keeping the direct propagation path. Thus, the estimated HRTF is capable of extracting more reliable localization features for the final direction of arrival estimation. Hence, binaural sound source localization guided by the proposed T\u2010F mask is robust under noisy and reverberant acoustic environments. The experimental results demonstrate that the new T\u2010F mask is superior to conventional T\u2010F masks and lead to the better performance of sound source localization in adverse environments.<\/jats:p>","DOI":"10.1049\/cit2.12010","type":"journal-article","created":{"date-parts":[[2021,3,3]],"date-time":"2021-03-03T02:58:48Z","timestamp":1614740328000},"page":"26-33","update-policy":"https:\/\/doi.org\/10.1002\/crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["Head\u2010related transfer function\u2013reserved time\u2010frequency masking for robust binaural sound source localization"],"prefix":"10.1049","volume":"7","author":[{"given":"Hong","family":"Liu","sequence":"first","affiliation":[{"name":"Key Laboratory of Machine Perception Shenzhen Graduate School Peking University  Shenzhen China"}]},{"given":"Peipei","family":"Yuan","sequence":"additional","affiliation":[{"name":"Key Laboratory of Machine Perception Shenzhen Graduate School Peking University  Shenzhen China"}]},{"given":"Bing","family":"Yang","sequence":"additional","affiliation":[{"name":"Key Laboratory of Machine Perception Shenzhen Graduate School Peking University  Shenzhen China"}]},{"given":"Ge","family":"Yang","sequence":"additional","affiliation":[{"name":"School of Artificial Intelligence Chongqing University of Technology  Chongqing China"}]},{"given":"Yang","family":"Chen","sequence":"additional","affiliation":[{"name":"Yanka Kupala State University of Grodno  Grodno Belarus"}]}],"member":"265","published-online":{"date-parts":[[2021,3,2]]},"reference":[{"key":"e_1_2_7_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/9780470043387"},{"key":"e_1_2_7_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2009.4959563"},{"key":"e_1_2_7_4_1","first-page":"673","volume-title":"IEEE International Conference on Acoustics, Speech and Signal Processing","author":"Visser E.","year":"2007"},{"key":"e_1_2_7_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2017.2651373"},{"key":"e_1_2_7_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2014.2360646"},{"key":"e_1_2_7_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2016.2625458"},{"key":"e_1_2_7_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2017.2703650"},{"key":"e_1_2_7_9_1","first-page":"861","volume-title":"Subband weighting for binaural speech source localization","author":"Girija Ramesan K.","year":"2018"},{"key":"e_1_2_7_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASL.2012.2183869"},{"key":"e_1_2_7_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2017.2750760"},{"key":"e_1_2_7_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASL.2010.2042128"},{"key":"e_1_2_7_13_1","doi-asserted-by":"publisher","DOI":"10.1121\/1.1791872"},{"key":"e_1_2_7_14_1","doi-asserted-by":"publisher","DOI":"10.1121\/1.2871597"},{"key":"e_1_2_7_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2018.2876169"},{"key":"e_1_2_7_16_1","first-page":"322","volume-title":"Robust tdoa estimation based on time\u2010frequency masking and deep neural networks","author":"Wang Z.Q.","year":"2018"},{"key":"e_1_2_7_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2019.2919378"},{"key":"e_1_2_7_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ROBIO49542.2019.8961817"},{"key":"e_1_2_7_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/ROBIO49542.2019.8961527"},{"key":"e_1_2_7_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2018.2842159"},{"key":"e_1_2_7_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2019.8683589"},{"key":"e_1_2_7_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2015.7178061"},{"key":"e_1_2_7_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2015.2512042"},{"key":"e_1_2_7_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2018.2874708"},{"key":"e_1_2_7_25_1","doi-asserted-by":"publisher","DOI":"10.1121\/1.1349185"},{"key":"e_1_2_7_26_1","first-page":"1","volume-title":"Adam: a method for stochastic optimization","author":"Kingma D.","year":"2014"},{"key":"e_1_2_7_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASPAA.2001.969552"},{"key":"e_1_2_7_28_1","doi-asserted-by":"publisher","DOI":"10.6028\/NIST.IR.4930"},{"key":"e_1_2_7_29_1","doi-asserted-by":"publisher","DOI":"10.1016\/0167-6393(93)90095-3"},{"key":"e_1_2_7_30_1","first-page":"48","article-title":"A MATLAB simulation of shoebox room acoustics for use in research and teaching","volume":"3","author":"Campbell D.","year":"2005","journal-title":"Comput. Inf. Syst. J."}],"container-title":["CAAI Transactions on Intelligence Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1049\/cit2.12010","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/full-xml\/10.1049\/cit2.12010","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/ietresearch.onlinelibrary.wiley.com\/doi\/pdf\/10.1049\/cit2.12010","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,27]],"date-time":"2025-10-27T09:23:46Z","timestamp":1761557026000},"score":1,"resource":{"primary":{"URL":"https:\/\/ietresearch.onlinelibrary.wiley.com\/doi\/10.1049\/cit2.12010"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,3,2]]},"references-count":29,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2022,3]]}},"alternative-id":["10.1049\/cit2.12010"],"URL":"https:\/\/doi.org\/10.1049\/cit2.12010","archive":["Portico"],"relation":{},"ISSN":["2468-6557","2468-2322"],"issn-type":[{"type":"print","value":"2468-6557"},{"type":"electronic","value":"2468-2322"}],"subject":[],"published":{"date-parts":[[2021,3,2]]},"assertion":[{"value":"2020-09-30","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-12-25","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-03-02","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}