Using Lasso Procedure for Variables Selection of Autoregressive Model for High Dimensional Time Series of Caenorhabditis Elegans Motion

Authors

  • Mohammed Khames Rasheed Department of Statistics and Informatics, College of Computer Science and Mathematics, University of Mosul, Mosul, Iraq Author
  • Osamah Basheer Shukur Department of Statistics and Informatics, College of Computer Science and Mathematics, University of Mosul, Mosul, Iraq Author

DOI:

https://doi.org/10.62933/jxdyj297

Keywords:

High dimensional time series, , Auto-regressive,, Variable selection, , LASSO, , Caenorhabditis elegans motion

Abstract

The Lasso is a common model for selection and also is a common estimation procedure for linear models. In this study, Lasso estimator will be used for obtaining more fitted autoregressive time series models. Simulation procedure has been used to generate a time series of the motion caenorhabditis elegans (CE represented by the tan-angles of wave-motion). Each observation of this time series is a recorded frame (0.5 second) of 2.5 hours video of CE motion. in this study, the real and simulated univariate time series of CE motion (tan-angles) are modeled via Lasso and autoregressive models (hybrid Lasso-AR approch) after multi-processes of variable selection. The results of simulated and real univariate time series reflects more fitted models after performing variables selection procedure. In conclusion, hybrid Lasso-AR approach can be used for best high dimensional time series modelling.

References

[1] Yemini, E., Jucikas, T., Grundy, L.J., Brown, A.E.X., and Schafer, W.R. (2013). A database of Caenorhabditis elegans behavioral phenotypes. Nature Methods, 10(9): p. 877-879.

[2] Adamek, R., Smeekes, S., and Wilms, I. (2022). Lasso inference for high-dimensional time series. Journal of Econometrics.

[3] Zhang, Y., Li, R., and Tsai, C.-L. (2010). Regularization parameter selections via generalized information criterion. Journal of the American Statistical Association, 105(489): p. 312-323.

[4] Guyon, I. and Elisseeff, A. (2003). An introduction to variable and feature selection. Journal of machine learning research, 3(Mar): p. 1157-1182.

[5] Nardi, Y. and Rinaldo, A. (2011). Autoregressive process modeling via the lasso procedure. Journal of Multivariate Analysis, 102(3): p. 528-549.

[6] Chan, N.H., Yau, C.Y., and Zhang, R.-M. (2015). LASSO estimation of threshold autoregressive models. Journal of Econometrics, 189(2): p. 285-296.

[7] Adamek, R., Smeekes, S., and Wilms, I. (2023). Lasso inference for high-dimensional time series. Journal of Econometrics, 235(2): p. 1114-1143.

[8] Iturbide, E., Cerda, J., and Graff, M. (2013). A comparison between LARS and LASSO for initialising the time-series forecasting auto-regressive equations. Procedia Technology, 7: p. 282-288.

[9] Brockwell, P.J. and Davis, R.A. (2009). Time series: theory and methods. Springer science & business media.

[10] Liu, L.-M. (2006). Time Series Analysis and Forecasting. 2nd ed. Illinois, USA: Scientific Computing Associates Corp.

[11] Shukur, O.B. (2015). Artifical Neural Network and Kalman Filter Approaches Based on ARIMA for Daily Wind Speed Forecasting, Universiti Teknologi Malaysia.

[12] Muhammadullah, S., Urooj, A., Khan, F., Alshahrani, M.N., Alqawba, M., and Al-Marzouki, S. (2022). Comparison of weighted lag adaptive LASSO with Autometrics for covariate selection and forecasting using time-series data. Complexity, 2022: p. 1-10.

[13] Müller, I.M. (2021). Feature selection for energy system modeling: Identification of relevant time series information. Energy and AI, 4: p. 100057.

[14] Fahrmeir, L., Kneib, T., Lang, S., Marx, B., Fahrmeir, L., Kneib, T., Lang, S., and Marx, B. (2013). Generalized linear models. Regression: Models, methods and applications: p. 269-324.

[15] Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Methodological), 58(1): p. 267-288.

[16] Friedman, J., Hastie, T., and Tibshirani, R. (2010). Regularization paths for generalized linear models via coordinate descent. Journal of statistical software, 33(1): p. 1.

[17] Wood, S.N. (2006). Low‐rank scale‐invariant tensor product smooths for generalized additive mixed models. Biometrics, 62(4): p. 1025-1036.

[18] Freijeiro‐González, L., Febrero‐Bande, M., and González‐Manteiga, W. (2022). A critical review of LASSO and its derivatives for variable selection under dependence among covariates. International Statistical Review, 90(1): p. 118-145.

[19] Wang, T. (2021). A combined model for short-term wind speed forecasting based on empirical mode decomposition, feature selection, support vector regression and cross-validated lasso. PeerJ Computer Science, 7: p. e732.

[20] Androulakis, E., Koukouvinos, C., and Mylona, K. (2011). Tuning parameter estimation in penalized least squares methodology. Communications in Statistics-Simulation and Computation, 40(9): p. 1444-1457.

[21] Li, Y., Dicker, L., and Zhao, S.D. (2014). The Dantzig selector for censored linear regression models. Statistica Sinica, 24(1): p. 251.

[22] Hirotugu, A. (1973). Information theory and an extension of the maximum likelihood principle, in Selected Papers of Hirotugu Akaike, Springer. p. 199-213.

[23] Burnham, K.P. and Anderson, D.R. (2002). Model selection and multimodel inference: a practical information-theoretic approach. Springer Science & Business Media.

[24] Dawson, C. and Wilby, R. (2001). Hydrological modelling using artificial neural networks. Progress in physical Geography, 25(1): p. 80-108.

[25] Billah, B., Hyndman, R.J., and Koehler, A.B. (2005). Empirical information criteria for time series forecasting model selection. Journal of Statistical Computation and Simulation, 75(10): p. 831-840.

[26] Bozdogan, H. (2000). Akaike's Information Criterion and Recent Developments in Information Complexity. Journal of Mathematical Psychology, 44(1): p. 62-91.

[27] Zhang, Y. and Yang, Y. (2015). Cross-validation for selecting a model selection procedure. Journal of Econometrics, 187(1): p. 95-112.

[28] Hyndman, R.J. and Koehler, A.B. (2006). Another look at measures of forecast accuracy. International journal of forecasting, 22(4): p. 679-688.

Downloads

Published

2025-05-11

Issue

Section

Original Articles

How to Cite

Using Lasso Procedure for Variables Selection of Autoregressive Model for High Dimensional Time Series of Caenorhabditis Elegans Motion. (2025). Iraqi Statisticians Journal, 2(special issue for ICSA2025), 277-284. https://doi.org/10.62933/jxdyj297