{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T10:09:38Z","timestamp":1753870178651,"version":"3.41.2"},"reference-count":63,"publisher":"Wiley","issue":"3","license":[{"start":{"date-parts":[[2024,6,5]],"date-time":"2024-06-05T00:00:00Z","timestamp":1717545600000},"content-version":"vor","delay-in-days":4,"URL":"http:\/\/onlinelibrary.wiley.com\/termsAndConditions#vor"}],"content-domain":{"domain":["onlinelibrary.wiley.com"],"crossmark-restriction":true},"short-container-title":["Statistical Analysis"],"published-print":{"date-parts":[[2024,6]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Nowadays data streams from different sources, like blockchain\u2010based and traditional financial transactions, social networks, and interconnected Internet of Things (IoT) devices, are becoming increasingly large in volume and the need to recognize patterns in real time from these streams, while adapting to their velocity and veracity, is emerging. Established machine learning algorithms used for pattern recognition methods have not been designed taking under account the volume, velocity, diversity, and accuracy of the data streams. This research contributes with an approach for assessing the pattern recognition capabilities of established machine learning algorithms when handling volatile data in real time and proposes a system that adapts the algorithms to the requirements of data streams, as well as assesses their pattern recognition capabilities based on established criteria. The system was applied for assessing five machine learning algorithms with input from a data stream from Bluetooth beacons tracking consumers in a retail store. This research can support future data scientists and analysts who need to reveal data patterns in big, volatile data streams in real time in order to support effective decision\u2010making in the respective application domain. Copyright \u00a9 2024 John Wiley &amp; Sons, Ltd.<\/jats:p>","DOI":"10.1002\/sam.11701","type":"journal-article","created":{"date-parts":[[2024,6,5]],"date-time":"2024-06-05T10:50:50Z","timestamp":1717584650000},"update-policy":"https:\/\/doi.org\/10.1002\/crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Assessment of the real\u2010time pattern recognition capability of machine learning algorithms"],"prefix":"10.1002","volume":"17","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3365-6175","authenticated-orcid":false,"given":"Elias","family":"Polytarchos","sequence":"first","affiliation":[{"name":"Department of Management Science &amp; Technology Athens University of Economics &amp; Business  Athens Greece"}]},{"given":"Cleopatra","family":"Bardaki","sequence":"additional","affiliation":[{"name":"Department of Informatics &amp; Telematics Harokopio University  Tavros Greece"}]},{"given":"Katerina","family":"Pramatari","sequence":"additional","affiliation":[{"name":"Department of Management Science &amp; Technology Athens University of Economics &amp; Business  Athens Greece"}]}],"member":"311","published-online":{"date-parts":[[2024,6,5]]},"reference":[{"key":"e_1_2_8_2_1","unstructured":"M.Abadi P.Barham J.Chen Z.Chen A.Davis J.Dean M.Devin S.Ghemawat G.Irving M.Isard M.Kudlur J.Levenberg R.Monga S.Moore D. G.Murray B.Steiner P.Tucker V.Vasudevan P.Warden M.Wicke Y.Yu andX.Zheng.Tensorflow: A system for large\u2010scale machine learning.2016."},{"key":"e_1_2_8_3_1","doi-asserted-by":"crossref","unstructured":"C. C.Aggarwal J.Han J.Wang andP. S.Yu.A framework for clustering evolving data streams.2003.","DOI":"10.1145\/872757.872826"},{"key":"e_1_2_8_4_1","doi-asserted-by":"crossref","unstructured":"M.Alamuri B. R.Surampudi andA.Negi.A survey of distance\/similarity measures for categorical data.2014.","DOI":"10.1109\/IJCNN.2014.6889941"},{"volume-title":"Introduction to machine learning","year":"2020","author":"Alpaydin E.","key":"e_1_2_8_5_1"},{"key":"e_1_2_8_6_1","first-page":"1399","article-title":"Online learning of complex prediction problems using simultaneous projections","volume":"9","author":"Amit Y.","year":"2008","journal-title":"J. Mach. Learn. Res."},{"key":"e_1_2_8_7_1","doi-asserted-by":"publisher","DOI":"10.1006\/jcss.2001.1795"},{"volume-title":"A survey of clustering data mining techniques, grouping multidimensional data","year":"1998","author":"Berkhin P.","key":"e_1_2_8_8_1"},{"volume-title":"Bayesian theory","year":"2000","author":"Bernardo J.","key":"e_1_2_8_9_1"},{"volume-title":"Pattern recognition and machine learning","year":"2011","author":"Bishop C. M.","key":"e_1_2_8_10_1"},{"key":"e_1_2_8_11_1","doi-asserted-by":"crossref","unstructured":"F.Cao M.Ester W.Qian andA.Zhou.Density\u2010based clustering over an evolving data stream with noise.2006.","DOI":"10.1137\/1.9781611972764.29"},{"key":"e_1_2_8_12_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-88-7642-642-1"},{"key":"e_1_2_8_13_1","first-page":"300","article-title":"Comprehensive survey on distance\/similarity measures between probability density functions","volume":"4","author":"Cha S.","year":"2007","journal-title":"Int. J. Math. Models Methods Appl. Sci."},{"key":"e_1_2_8_14_1","doi-asserted-by":"publisher","DOI":"10.1186\/s40537-016-0040-9"},{"key":"e_1_2_8_15_1","doi-asserted-by":"publisher","DOI":"10.1214\/aoms\/1177729330"},{"key":"e_1_2_8_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2022.3206330"},{"key":"e_1_2_8_17_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11831-019-09344-w"},{"key":"e_1_2_8_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/MCI.2015.2471196"},{"key":"e_1_2_8_19_1","doi-asserted-by":"publisher","DOI":"10.1080\/01969727308546046"},{"key":"e_1_2_8_20_1","unstructured":"E.FixandJ. L.Hodges.Discriminatory analysis nonparametric discrimination: Consistency properties. technical report 41951."},{"key":"e_1_2_8_21_1","doi-asserted-by":"publisher","DOI":"10.1080\/08982112.2011.581039"},{"key":"e_1_2_8_22_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010844028087"},{"key":"e_1_2_8_23_1","doi-asserted-by":"publisher","DOI":"10.1002\/sam.11552"},{"key":"e_1_2_8_24_1","doi-asserted-by":"crossref","unstructured":"S.Guha R.Rastogi andK.Shim.Cure: An efficient clustering algorithm for large databases.1998.","DOI":"10.1145\/276304.276312"},{"key":"e_1_2_8_25_1","doi-asserted-by":"crossref","unstructured":"A.Haque L.Khan M.Baron B.Thuraisingham andC.Aggarwal.Efficient handling of concept drift and concept evolution over stream data.2016IEEE 32nd International Conference on Data Engineering.","DOI":"10.1109\/ICDE.2016.7498264"},{"key":"e_1_2_8_26_1","doi-asserted-by":"publisher","DOI":"10.1162\/neco.2006.18.7.1527"},{"key":"e_1_2_8_27_1","doi-asserted-by":"crossref","unstructured":"P.Hore L. O.Hall andD. B.Goldgof.Single pass fuzzy c means.2007.","DOI":"10.1109\/FUZZY.2007.4295372"},{"key":"e_1_2_8_28_1","first-page":"547","article-title":"Etude comparative de la distribution florale dans une portion des alpes et des jura","volume":"37","author":"Jaccard P.","year":"1901","journal-title":"Bull. Soc. Vaudoise Des Sci. Naturell."},{"key":"e_1_2_8_29_1","doi-asserted-by":"publisher","DOI":"10.1006\/inco.1996.2612"},{"key":"e_1_2_8_30_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1017938623079"},{"key":"e_1_2_8_31_1","doi-asserted-by":"crossref","unstructured":"A. F.Krause K.Essig M.Piefke andT.Schack.No\u2010prop\u2010fast\u00a0\u2013\u00a0a high\u2010speed multilayer neural network learning algorithm: Mnist benchmark and eye\u2010tracking data classification.2013.","DOI":"10.1007\/978-3-642-41013-0_46"},{"key":"e_1_2_8_32_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.inffus.2017.02.004"},{"key":"e_1_2_8_33_1","doi-asserted-by":"crossref","unstructured":"Y.LawandC.Zaniolo.An adaptive nearest neighbor classification algorithm for data streams.2005.","DOI":"10.1007\/11564126_15"},{"volume-title":"Theory of point estimation","year":"1998","author":"Lehmann E. L.","key":"e_1_2_8_34_1"},{"key":"e_1_2_8_35_1","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9781139924801"},{"key":"e_1_2_8_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2017.2696365"},{"key":"e_1_2_8_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2020.2990196"},{"key":"e_1_2_8_38_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2017.06.084"},{"key":"e_1_2_8_39_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00521-018-3353-0"},{"key":"e_1_2_8_40_1","doi-asserted-by":"crossref","unstructured":"M. M.Masud Q.Chen L.Khan C.Aggarwal J.Gao J.Han andB.Thuraisingham.Addressing concept\u2010evolution in concept\u2010drifting data streams.2010IEEE International Conference on Data Mining.","DOI":"10.1109\/ICDM.2010.160"},{"key":"e_1_2_8_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2012.109"},{"key":"e_1_2_8_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2010.61"},{"key":"e_1_2_8_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/272991.272995"},{"key":"e_1_2_8_44_1","doi-asserted-by":"publisher","DOI":"10.1002\/spe.2341"},{"key":"e_1_2_8_45_1","doi-asserted-by":"publisher","DOI":"10.1002\/sam.11451"},{"volume-title":"Online ensemble learning","year":"2001","author":"Oza N. C.","key":"e_1_2_8_46_1"},{"key":"e_1_2_8_47_1","doi-asserted-by":"publisher","DOI":"10.1002\/0471708607"},{"key":"e_1_2_8_48_1","unstructured":"E.Polytarchos C.Bardaki andK.Pramatari.Error prediction and predictive memory maintenance through real time machine learning methods.2023."},{"key":"e_1_2_8_49_1","doi-asserted-by":"publisher","DOI":"10.1109\/JIOT.2016.2618909"},{"volume-title":"Artificial intelligence. A modern approach","year":"2022","author":"Russell S.","key":"e_1_2_8_50_1"},{"key":"e_1_2_8_51_1","doi-asserted-by":"crossref","unstructured":"A.Saffari C.Leistner J.Santner M.Godec andH.Bischof.On\u2010line random forests.2009IEEE ICCV Workshop on on\u2010Line Computer Vision.","DOI":"10.1109\/ICCVW.2009.5457447"},{"key":"e_1_2_8_52_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbusres.2015.08.017"},{"key":"e_1_2_8_53_1","unstructured":"J. A.SilvaandE. R.Hruschka.Extending k\u2010means\u2010based algorithms for evolving data streams with variable number of clusters.2011."},{"key":"e_1_2_8_54_1","doi-asserted-by":"crossref","unstructured":"E. J.Spinosa A.deCarvalho andJ.Gama.Cluster\u2010based novel concept detection in data streams applied to intrusion detection in computer networks.2008.","DOI":"10.1145\/1363686.1363912"},{"key":"e_1_2_8_55_1","doi-asserted-by":"crossref","unstructured":"A.StrehlandJ.Ghosh.Value\u2010based customer grouping from large retail data sets.2000.","DOI":"10.1117\/12.381756"},{"volume-title":"Introduction to data mining","year":"2005","author":"Tan P.","key":"e_1_2_8_56_1"},{"key":"e_1_2_8_57_1","unstructured":"P.Taylor.Volume of data\/information created captured copied and consumed worldwide from 2010 to 2020 with forecasts from 2021 to 2025.https:\/\/www.statista.com\/statistics\/871513\/worldwide\u2010data\u2010created\/2023."},{"key":"e_1_2_8_58_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jss.2016.06.016"},{"volume-title":"Information retrieval","year":"2000","author":"Rijsbergen C. J.","key":"e_1_2_8_59_1"},{"key":"e_1_2_8_60_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2016.05.006"},{"key":"e_1_2_8_61_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neunet.2012.09.020"},{"key":"e_1_2_8_62_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2007.07.023"},{"key":"e_1_2_8_63_1","doi-asserted-by":"publisher","DOI":"10.1007\/s40745-015-0040-1"},{"key":"e_1_2_8_64_1","first-page":"29","article-title":"Online embedding and clustering of evolving data streams","volume":"2","author":"Zubaroglu A.","year":"2022","journal-title":"Stat. Anal. Data Min."}],"container-title":["Statistical Analysis and Data Mining: The ASA Data Science Journal"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1002\/sam.11701","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,6,25]],"date-time":"2024-06-25T05:23:32Z","timestamp":1719293012000},"score":1,"resource":{"primary":{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/10.1002\/sam.11701"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,6]]},"references-count":63,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2024,6]]}},"alternative-id":["10.1002\/sam.11701"],"URL":"https:\/\/doi.org\/10.1002\/sam.11701","archive":["Portico"],"relation":{},"ISSN":["1932-1864","1932-1872"],"issn-type":[{"type":"print","value":"1932-1864"},{"type":"electronic","value":"1932-1872"}],"subject":[],"published":{"date-parts":[[2024,6]]},"assertion":[{"value":"2024-01-15","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-05-17","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-06-05","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}],"article-number":"e11701"}}