Using Lasso Procedure for Variables Selection of Autoregressive Model for High Dimensional Time Series of Caenorhabditis Elegans Motion
DOI:
https://doi.org/10.62933/jxdyj297Keywords:
High dimensional time series, , Auto-regressive,, Variable selection, , LASSO, , Caenorhabditis elegans motionAbstract
The Lasso is a common model for selection and also is a common estimation procedure for linear models. In this study, Lasso estimator will be used for obtaining more fitted autoregressive time series models. Simulation procedure has been used to generate a time series of the motion caenorhabditis elegans (CE represented by the tan-angles of wave-motion). Each observation of this time series is a recorded frame (0.5 second) of 2.5 hours video of CE motion. in this study, the real and simulated univariate time series of CE motion (tan-angles) are modeled via Lasso and autoregressive models (hybrid Lasso-AR approch) after multi-processes of variable selection. The results of simulated and real univariate time series reflects more fitted models after performing variables selection procedure. In conclusion, hybrid Lasso-AR approach can be used for best high dimensional time series modelling.
References
[1] Yemini, E., Jucikas, T., Grundy, L.J., Brown, A.E.X., and Schafer, W.R. (2013). A database of Caenorhabditis elegans behavioral phenotypes. Nature Methods, 10(9): p. 877-879.
[2] Adamek, R., Smeekes, S., and Wilms, I. (2022). Lasso inference for high-dimensional time series. Journal of Econometrics.
[3] Zhang, Y., Li, R., and Tsai, C.-L. (2010). Regularization parameter selections via generalized information criterion. Journal of the American Statistical Association, 105(489): p. 312-323.
[4] Guyon, I. and Elisseeff, A. (2003). An introduction to variable and feature selection. Journal of machine learning research, 3(Mar): p. 1157-1182.
[5] Nardi, Y. and Rinaldo, A. (2011). Autoregressive process modeling via the lasso procedure. Journal of Multivariate Analysis, 102(3): p. 528-549.
[6] Chan, N.H., Yau, C.Y., and Zhang, R.-M. (2015). LASSO estimation of threshold autoregressive models. Journal of Econometrics, 189(2): p. 285-296.
[7] Adamek, R., Smeekes, S., and Wilms, I. (2023). Lasso inference for high-dimensional time series. Journal of Econometrics, 235(2): p. 1114-1143.
[8] Iturbide, E., Cerda, J., and Graff, M. (2013). A comparison between LARS and LASSO for initialising the time-series forecasting auto-regressive equations. Procedia Technology, 7: p. 282-288.
[9] Brockwell, P.J. and Davis, R.A. (2009). Time series: theory and methods. Springer science & business media.
[10] Liu, L.-M. (2006). Time Series Analysis and Forecasting. 2nd ed. Illinois, USA: Scientific Computing Associates Corp.
[11] Shukur, O.B. (2015). Artifical Neural Network and Kalman Filter Approaches Based on ARIMA for Daily Wind Speed Forecasting, Universiti Teknologi Malaysia.
[12] Muhammadullah, S., Urooj, A., Khan, F., Alshahrani, M.N., Alqawba, M., and Al-Marzouki, S. (2022). Comparison of weighted lag adaptive LASSO with Autometrics for covariate selection and forecasting using time-series data. Complexity, 2022: p. 1-10.
[13] Müller, I.M. (2021). Feature selection for energy system modeling: Identification of relevant time series information. Energy and AI, 4: p. 100057.
[14] Fahrmeir, L., Kneib, T., Lang, S., Marx, B., Fahrmeir, L., Kneib, T., Lang, S., and Marx, B. (2013). Generalized linear models. Regression: Models, methods and applications: p. 269-324.
[15] Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Methodological), 58(1): p. 267-288.
[16] Friedman, J., Hastie, T., and Tibshirani, R. (2010). Regularization paths for generalized linear models via coordinate descent. Journal of statistical software, 33(1): p. 1.
[17] Wood, S.N. (2006). Low‐rank scale‐invariant tensor product smooths for generalized additive mixed models. Biometrics, 62(4): p. 1025-1036.
[18] Freijeiro‐González, L., Febrero‐Bande, M., and González‐Manteiga, W. (2022). A critical review of LASSO and its derivatives for variable selection under dependence among covariates. International Statistical Review, 90(1): p. 118-145.
[19] Wang, T. (2021). A combined model for short-term wind speed forecasting based on empirical mode decomposition, feature selection, support vector regression and cross-validated lasso. PeerJ Computer Science, 7: p. e732.
[20] Androulakis, E., Koukouvinos, C., and Mylona, K. (2011). Tuning parameter estimation in penalized least squares methodology. Communications in Statistics-Simulation and Computation, 40(9): p. 1444-1457.
[21] Li, Y., Dicker, L., and Zhao, S.D. (2014). The Dantzig selector for censored linear regression models. Statistica Sinica, 24(1): p. 251.
[22] Hirotugu, A. (1973). Information theory and an extension of the maximum likelihood principle, in Selected Papers of Hirotugu Akaike, Springer. p. 199-213.
[23] Burnham, K.P. and Anderson, D.R. (2002). Model selection and multimodel inference: a practical information-theoretic approach. Springer Science & Business Media.
[24] Dawson, C. and Wilby, R. (2001). Hydrological modelling using artificial neural networks. Progress in physical Geography, 25(1): p. 80-108.
[25] Billah, B., Hyndman, R.J., and Koehler, A.B. (2005). Empirical information criteria for time series forecasting model selection. Journal of Statistical Computation and Simulation, 75(10): p. 831-840.
[26] Bozdogan, H. (2000). Akaike's Information Criterion and Recent Developments in Information Complexity. Journal of Mathematical Psychology, 44(1): p. 62-91.
[27] Zhang, Y. and Yang, Y. (2015). Cross-validation for selecting a model selection procedure. Journal of Econometrics, 187(1): p. 95-112.
[28] Hyndman, R.J. and Koehler, A.B. (2006). Another look at measures of forecast accuracy. International journal of forecasting, 22(4): p. 679-688.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Mohammed Khames Rasheed, Osamah Basheer Shukur (Author)

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Licensed under a CC-BY license: https://creativecommons.org/licenses/by-nc-sa/4.0/