, , vol.87
98 Conclusions and Future Work Contents 7.1 Conclusions ,
, Open Issues and Future Directions
, Nicolaus Henke, and Monica Trench. Artificial intelligence: the next digital frontier? McKinsey Global Institute Report, 2017.
Pattern recognition and machine learning, 2006. ,
Mining data streams: a review, SIGMOD 34, vol.2, pp.18-26, 2005. ,
, Data streams: models and algorithms, vol.31, 2007.
Principles of data mining, 2001. ,
A survey of synopsis construction in data streams, Data Streams, pp.169-207, 2007. ,
Learning from data streams: processing techniques in sensor networks, 2007. ,
Data mining: concepts and techniques, 2011. ,
A survey on concept drift adaptation, Computing Surveys (CSUR), vol.46, p.44, 2014. ,
Data stream mining in ubiquitous environments: state-of-theart and current directions, Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, vol.4, pp.116-138, 2014. ,
Estimating the effective dimension of large biological datasets using Fisher separability analysis, International Joint Conference on Neural Networks (IJCNN), pp.1-8, 2019. ,
Machine learning and complex biological data, 2019. ,
Spam filtering: how the dimensionality reduction affects the accuracy of Naive Bayes classifiers, Journal of Internet Services and Applications, vol.1, issue.3, pp.183-200, 2011. ,
Linear dimensionality reduction: Survey, insights, and generalizations, Journal of Machine Learning Research (JMLR), vol.16, issue.1, pp.2859-2900, 2015. ,
Dimensionality reduction: a comparative, Journal of Machine Learning Research, vol.10, p.13, 2009. ,
A survey of dimensionality reduction techniques, 2014. ,
Issues in evaluation of stream learning algorithms, pp.329-338, 2009. ,
Machine learning for streaming data: state of the art, challenges, and opportunities, ACM SIGKDD Explorations Newsletter, vol.21, pp.6-22, 2019. ,
An improved data stream summary: the count-min sketch and its applications, Journal of Algorithms, vol.55, pp.58-75, 2005. ,
Feature hashing for large scale multitask learning, International Conference on Machine Learning (ICML), pp.1113-1120, 2009. ,
Learning from time-changing data with adaptive windowing, International Conference on Data Mining (ICDM). SIAM, pp.443-448, 2007. ,
An introduction to kernel and nearest-neighbor nonparametric regression, The American Statistician, vol.46, pp.175-185, 1992. ,
Compressed sensing, IEEE Transactions on Information Theory, vol.52, issue.4, pp.1289-1306, 2006. ,
Leveraging bagging for evolving data streams, Joint European conference on machine learning and knowledge discovery in databases, pp.135-150, 2010. ,
Adaptive random forests for evolving data stream classification, Machine Learning, pp.1-27, 2017. ,
Umap: Uniform manifold approximation and projection for dimension reduction, 2018. ,
Data stream analysis: foundations, progress in classification and tools ,
Survey on feature transformation techniques for data streams, International Joint Conference on Artificial Intelligence (IJCAI), 2020. ,
A survey of stream data mining, National Conference with International participation ETAI, pp.19-21, 2007. ,
Discovery of frequent patterns in transactional data streams, Transactions on large-scale data-and knowledge-centered systems II, pp.1-30, 2010. ,
Models and issues in data stream systems, pp.1-16, 2002. ,
Querying and mining data streams: you only get one look a tutorial, ACM SIGMOD, pp.635-635, 2002. ,
Approximate frequency counts over data streams, Very Large Data Bases (VLDB), pp.346-357, 2002. ,
Space/time trade-offs in hash coding with allowable errors, Communications of the ACM, vol.13, pp.422-426, 1970. ,
New ensemble methods for evolving data streams, pp.139-148, 2009. ,
Data stream mining a practical approach, 2009. ,
Machine learning for data streams: with practical examples in MOA, 2018. ,
Bayesian network classifiers, Machine learning, vol.29, pp.131-163, 1997. ,
Efficient data stream classification via probabilistic adaptive windows, Symposium On Applied Computing (SIGAPP), pp.801-806, 2013. ,
KNN classifier with self adjusting memory for heterogeneous concept drift, International Conference on Data Mining (ICDM), pp.291-300, 2016. ,
Mining high-speed data streams, SIGKDD. ACM, pp.71-80, 2000. ,
Accurate decision trees for mining high-speed data streams, SIGKDD. ACM, pp.523-528, 2003. ,
Decision trees for mining data streams, Intelligent Data Analysis (IDA), vol.10, pp.23-45, 2006. ,
Adaptive learning from evolving data streams, Intelligent Data Analysis (IDA), pp.249-260, 2009. ,
Ensemble based systems in decision making, IEEE Circuits and Systems Magazine, vol.6, issue.3, pp.21-45, 2006. ,
Bagging predictors, ML 24, vol.2, pp.123-140, 1996. ,
Ensemble methods in machine learning, International Workshop on Multiple Classifier Systems, pp.1-15, 2000. ,
A survey on ensemble learning for data stream classification, Computing Surveys (CSUR), vol.50, p.23, 2017. ,
Online bagging and boosting, IEEE International Conference on Systems, Man and Cybernetics, vol.3, pp.2340-2345, 2005. ,
Streaming random patches for evolving data stream classification, International Conference on Data Mining (ICDM), 2019. ,
Computational methods of feature selection, 2007. ,
Feature extraction, construction and selection: A data mining perspective, vol.453, 1998. ,
A survey on feature drift adaptation: Definition, benchmark, challenges and future directions, Journal of Systems and Software, vol.127, pp.278-294, 2017. ,
A survey on data preprocessing for data stream mining: Current status and future directions, Neurocomputing, vol.239, pp.39-57, 2017. ,
A survey on online feature selection with streaming features, Frontiers of Computer Science, vol.12, pp.479-493, 2018. ,
Streaming feature selection algorithms for big data: A survey, Applied Computing and Informatics, 2019. ,
Incremental PCA for on-line visual learning and recognition, Object Recognition Supported by User interaction for Service Robots, vol.3, pp.781-784, 2002. ,
Candid covariance-free incremental principal component analysis, Transactions on Pattern Analysis and Machine Intelligence, vol.25, pp.1034-1040, 2003. ,
Incremental learning for robust visual tracking, International Journal of Computer Vision (IJCV), pp.125-141, 2008. ,
Memory limited, streaming PCA, Neural Information Processing Systems (NIPS), pp.2886-2894, 2013. ,
Single-pass PCA of large high-dimensional data, International Joint Conference on Artificial Intelligence (IJCAI, 2017. ,
Online classification algorithm for data streams based on fast iterative Kernel principal component analysis, International Conference on Natural Computation (ICNC), vol.1, pp.232-236, 2009. ,
Fast iterative kernel principal component analysis, Journal of Machine Learning Research (JMLR), vol.8, pp.1893-1918, 2007. ,
Online principal component analysis in high dimension: Which algorithm to choose?, In: International Statistical Review, vol.86, pp.29-50, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01700948
An introduction to MDS, In: Sound Quality Research Unit, vol.46, pp.1-26, 2003. ,
Incremental multidimensional scaling, The Learning Workshop. Citeseer, vol.173, p.227, 2010. ,
Streaming classical multidimensional scaling, New York Scientific Data Summit (NYSDS), pp.1-2, 2018. ,
Greedy layerwise training of deep networks, Neural Information Processing Systems (NIPS), pp.153-160, 2007. ,
Extracting and composing robust features with denoising autoencoders, International Conference on Machine Learning (ICML), pp.1096-1103, 2008. ,
Online incremental feature learning with denoising autoencoders, Artificial Intelligence and Statistics, pp.1453-1461, 2012. ,
Threaded ensembles of supervised and unsupervised neural networks for stream learning, pp.304-315, 2016. ,
Discriminant analysis and statistical pattern recognition, vol.544, 2004. ,
Efficient handling of concept drift and concept evolution over stream data, International Conference on Data Engineering (ICDE), pp.481-492, 2016. ,
Incremental linear discriminant analysis for classification of data streams, Transactions on Systems, Man, and Cybernetics, vol.35, pp.905-914, 2005. ,
IDR/QR: An incremental dimension reduction algorithm via QR decomposition, Transactions on Knowledge and Data Engineering, vol.17, pp.1208-1222, 2005. ,
Incremental linear discriminant analysis using sufficient spanning sets and its applications, International Journal of Computer Vision (IJCV), vol.91, pp.216-232, 2011. ,
Efficient and robust feature extraction by maximum margin criterion, Neural Information Processing Systems (NIPS), pp.97-104, 2004. ,
IMMC: incremental maximum margin criterion, SIGKDD. ACM, pp.725-730, 2004. ,
, The random projection method, vol.65, 2005.
An elementary proof of the Johnson-Lindenstrauss lemma, International Computer Science Institute, pp.1-5, 1999. ,
Extensions of Lipschitz maps into Banach spaces, Israel Journal of Mathematics, vol.54, pp.129-138, 1986. ,
Real-time dynamic MR image reconstruction using Kalman filtered compressed sensing, International Conference on Acoustics, Speech and Signal Processing, pp.393-396, 2009. ,
Real-time visual tracking using compressive sensing, Computer Vision and Pattern Recognition (CVPR), pp.1305-1312, 2011. ,
Vehicle classification using compressive sensing, International Conference on Recent Trends in Electronics, Information and Communication Technology (RTEICT), pp.692-696, 2017. ,
Database-friendly random projections: Johnson-Lindenstrauss with binary coins, Journal of computer and System Sciences, vol.66, issue.4, pp.671-687, 2003. ,
Localitysensitive hashing scheme based on p-stable distributions, Symposium on Computational Geometry (SOCG), pp.253-262, 2004. ,
A global geometric framework for nonlinear dimensionality reduction, science 290, vol.5500, pp.2319-2323, 2000. ,
Nonlinear manifold learning for data stream, International Conference on Data Mining (ICDM). SIAM, pp.33-44, 2004. ,
Error metrics for learning reliable manifolds from streaming data, International Conference on Data Mining (ICDM). SIAM. 2017, pp.750-758 ,
Visualizing data using t-SNE, Journal of Machine Learning Research (JMLR), vol.9, pp.2579-2605, 2008. ,
Present position and potential developments: Some personal views statistical theory the prequential approach, Journal of the Royal Statistical Society: Series A (General), vol.147, pp.278-290, 1984. ,
Extensions to the k-means algorithm for clustering large data sets with categorical values, Data Mining and Knowledge Discovery, vol.2, issue.3, pp.283-304, 1998. ,
Sketch-based naive Bayes algorithms for evolving data streams, International Conference on Big Data, pp.604-613, 2018. ,
Synopses for massive data: Samples, histograms, wavelets, sketches, pp.1-294 ,
Data Sketching, Queue 15, vol.2, p.60, 2017. ,
Graphical model sketch, Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pp.81-97, 2016. ,
Moa: Massive online analysis, Journal of Machine Learning Research (JMLR), vol.11, pp.1601-1604, 2010. ,
A streaming ensemble algorithm (SEA) for largescale classification, SIGKDD. ACM, pp.377-382, 2001. ,
Classification and regression trees, Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, vol.1, issue.1, pp.14-23, 2011. ,
Database mining: A performance perspective, Transactions on Knowledge and Data Engineering (TKDE), vol.5, pp.914-925, 1993. ,
Mining time-changing data streams, pp.97-106, 2001. ,
Splice-2 comparative evaluation: electricity pricing, 1999. ,
The enron corpus: A new dataset for email classification research, European Conference on Machine Learning (ECML), pp.217-226, 2004. ,
Learning word vectors for sentiment analysis, ACL-HLT, pp.142-150, 2011. ,
Agglomeration and elimination of terms for dimensionality reduction, International Conference on Intelligent Systems Design and Applications, pp.547-552, 2009. ,
Discretization from data streams: applications to histograms and data mining, ACM symposium on Applied computing, pp.662-667, 2006. ,
Weka: A machine learning workbench, Proceedings of the second Australia and New Zealand Conference on Intelligent Information Systems, pp.357-361, 1994. ,
Compressed k-nearest neighbors ensembles for evolving data streams, European Conference on Artificial Intelligence (ECAI), 2020. ,
Learning compressed sensing, Snowbird Learning Workshop. Citeseer, 2007. ,
Sparse image and signal processing: Wavelets and related geometric multiscale analysis, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01658853
From sparse solutions of systems of equations to sparse modeling of signals and images, SIAM Review, vol.51, pp.34-81, 2009. ,
Compressed sensing and sparse filtering, 2014. ,
Batch-incremental versus instance-incremental learning in dynamic and evolving data, Intelligent Data Analysis (IDA), pp.313-323, 2012. ,
A performance comparison of measurement matrices in compressive sensing, International Journal of Communication Systems, vol.31, p.3576, 2018. ,
Deterministic sensing matrices in compressive sensing: a survey, The Scientific World Journal, 2013. ,
Compressed sensing of streaming data, 51st Annual Allerton Conference on Communication, Control, and Computing. IEEE. 2013, pp.1242-1249 ,
A simple proof of the restricted isometry property for random matrices, Constructive Approximation, vol.28, pp.253-263, 2008. ,
Persistent homology-a survey, Contemporary Mathematics, vol.453, pp.257-282, 2008. ,
The persistent homology of distance functions under random projection, Symposium on Computational Geometry (SOCG), p.328, 2014. ,
New and improved Johnson-Lindenstrauss embeddings via the restricted isometry property, Mathematical Analysis, vol.43, pp.1269-1281, 2011. ,
Evangelos Banos, Nick Bassiliades, and Ioannis Vlahavas, Journal of Intelligent Information Systems, vol.32, pp.191-212, 2009. ,
Compressed adaptive random forests for evolving data streams, International Joint Conference on Neural Networks (IJCNN), 2020. ,
Random forests, Machine learning, vol.45, pp.5-32, 2001. ,
Design and analysis of the Nomao challenge active learning in the real-world, ALRA, Workshop ECML-PKDD. sn, 2012. ,
Human activity recognition on smartphones using a multiclass hardware-friendly support vector machine, pp.216-223, 2012. ,
Efficient batchincremental classification for evolving data streams, Intelligent Data Analysis (IDA), 2020. ,
Evolving extended naive Bayes classifiers, International Conference on Data Mining Workshops. IEEE, pp.643-647, 2006. ,
Applied logistic regression, vol.398, 2013. ,
Support-vector networks, pp.273-297, 1995. ,
Batchincremental learning for mining data streams, 2004. ,
Analysis of a complex of statistical variables into principal components, In: Journal of Educational Psychology, vol.24, p.417, 1933. ,
Lazy learning, 2013. ,
Data mining: the textbook, 2015. ,
Scikit-learn: Machine learning in Python, Journal of Machine Learning Research (JMLR), vol.12, pp.2825-2830, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00650905
Scikit-multiflow: a multi-output streaming framework, Journal of Machine Learning Research (JMLR), vol.19, pp.2915-2914, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-02287993
Self hyper-parameter tuning for data streams, Data Streams, 2018. ,