IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v167y2018icp319-330.html
   My bibliography  Save this article

A model selection approach for multiple sequence segmentation and dimensionality reduction

Author

Listed:
  • Castro, Bruno M.
  • Lemes, Renan B.
  • Cesar, Jonatas
  • Hünemeier, Tábita
  • Leonardi, Florencia

Abstract

In this paper we consider the problem of segmenting n aligned random sequences of equal length m into a finite number of independent blocks. We propose a penalized maximum likelihood criterion to infer simultaneously the number of points of independence as well as the position of each point. We show how to compute exactly the estimator by means of a dynamic programming algorithm with time complexity O(m2n). We also propose another method, called hierarchical algorithm, that provides an approximation to the estimator when the sample size increases and runs in time O{mln(m)n}. Our main theoretical results are the strong consistency of both estimators when the sample size n grows to infinity. We illustrate the convergence of these algorithms through some simulation examples and we apply the method to identify recombination hotspots in real SNPs data.

Suggested Citation

  • Castro, Bruno M. & Lemes, Renan B. & Cesar, Jonatas & Hünemeier, Tábita & Leonardi, Florencia, 2018. "A model selection approach for multiple sequence segmentation and dimensionality reduction," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 319-330.
  • Handle: RePEc:eee:jmvana:v:167:y:2018:i:c:p:319-330
    DOI: 10.1016/j.jmva.2018.05.006
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X18302331
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2018.05.006?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Richard J. Boys & Daniel A. Henderson, 2004. "A Bayesian Approach to DNA Sequence Segmentation," Biometrics, The International Biometric Society, vol. 60(3), pages 573-581, September.
    2. Douglas M. Hawkins, 1976. "Point Estimation of the Parameters of Piecewise Regression Models," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 25(1), pages 51-57, March.
    3. Clive J Hoggart & John C Whittaker & Maria De Iorio & David J Balding, 2008. "Simultaneous Analysis of All SNPs in Genome-Wide and Re-Sequencing Association Studies," PLOS Genetics, Public Library of Science, vol. 4(7), pages 1-8, July.
    4. Fridlyand, Jane & Snijders, Antoine M. & Pinkel, Dan & Albertson, Donna G. & Jain, A.N.Ajay N., 2004. "Hidden Markov models approach to the analysis of array CGH data," Journal of Multivariate Analysis, Elsevier, vol. 90(1), pages 132-153, July.
    5. Jushan Bai & Pierre Perron, 2003. "Computation and analysis of multiple structural change models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 18(1), pages 1-22.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Florencia Leonardi & Matías Lopez‐Rosenfeld & Daniela Rodriguez & Magno T. F. Severino & Mariela Sued, 2021. "Independent block identification in multivariate time series," Journal of Time Series Analysis, Wiley Blackwell, vol. 42(1), pages 19-33, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Salvatore Fasola & Vito M. R. Muggeo & Helmut Küchenhoff, 2018. "A heuristic, iterative algorithm for change-point detection in abrupt change models," Computational Statistics, Springer, vol. 33(2), pages 997-1015, June.
    2. Alessandro Casini & Pierre Perron, 2018. "Structural Breaks in Time Series," Boston University - Department of Economics - Working Papers Series WP2019-02, Boston University - Department of Economics.
    3. Casini, Alessandro & Perron, Pierre, 2021. "Continuous record Laplace-based inference about the break date in structural change models," Journal of Econometrics, Elsevier, vol. 224(1), pages 3-21.
    4. Pierre Perron & Yohei Yamamoto, 2008. "Estimating and Testing Multiple Structural Changes in Models with Endogenous Regressors," Boston University - Department of Economics - Working Papers Series wp2008-017, Boston University - Department of Economics.
    5. Pierre Perron & Yohei Yamamoto, 2015. "Using OLS to Estimate and Test for Structural Changes in Models with Endogenous Regressors," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 30(1), pages 119-144, January.
    6. Lee, Chien-Chiang & Chen, Mei-Ping & Chang, Chi-Hung, 2013. "Dynamic relationships between industry returns and stock market returns," The North American Journal of Economics and Finance, Elsevier, vol. 26(C), pages 119-144.
    7. Moonsoo Park & Yanhong Jin & Alan Love, 2011. "Dynamic and contemporaneous causality in a supply chain: an application of the US beef industry," Applied Economics, Taylor & Francis Journals, vol. 43(30), pages 4785-4801.
    8. Bilal Mehmood & Syed Hassan Raza & Mahwish Rana & Huma Sohaib & Muhammad Azhar Khan, 2014. "Triangular Relationship between Energy Consumption, Price Index and National Income in Asian Countries: A Pooled Mean Group Approach in Presence of Structural Breaks," International Journal of Energy Economics and Policy, Econjournals, vol. 4(4), pages 610-620.
    9. Matteo Mogliani, 2010. "Residual-based tests for cointegration and multiple deterministic structural breaks: A Monte Carlo study," Working Papers halshs-00564897, HAL.
    10. Aye, Goodness & Gupta, Rangan & Hammoudeh, Shawkat & Kim, Won Joong, 2015. "Forecasting the price of gold using dynamic model averaging," International Review of Financial Analysis, Elsevier, vol. 41(C), pages 257-266.
    11. Hilary S. Booth & Conrad J. Burden & John H. Maindonald & Lucia Santoso & Matthew J. Wakefield & Susan R. Wilson, 2005. "Discussion of “A Bayesian Approach to DNA Sequence Segmentation”," Biometrics, The International Biometric Society, vol. 61(2), pages 635-637, June.
    12. Mariam Camarero & Juan Sapena & Cecilio Tamarit, 2020. "Modelling Time-Varying Parameters in Panel Data State-Space Frameworks: An Application to the Feldstein–Horioka Puzzle," Computational Economics, Springer;Society for Computational Economics, vol. 56(1), pages 87-114, June.
    13. Bernard, Jean-Thomas & Idoudi, Nadhem & Khalaf, Lynda & Yelou, Clement, 2007. "Finite sample multivariate structural change tests with application to energy demand models," Journal of Econometrics, Elsevier, vol. 141(2), pages 1219-1244, December.
    14. Nuruddeen Usman & Kodili Nwanneka & Nduka, 2023. "Announcement Effect of COVID-19 on Cryptocurrencies," Asian Economics Letters, Asia-Pacific Applied Economics Association, vol. 3(3), pages 1-4.
    15. Kevin S. Nell & Maria M. De Mello, 2019. "The interdependence between the saving rate and technology across regimes: evidence from South Africa," Empirical Economics, Springer, vol. 56(1), pages 269-300, January.
    16. Ngene, Geoffrey & Tah, Kenneth A. & Darrat, Ali F., 2017. "Long memory or structural breaks: Some evidence for African stock markets," Review of Financial Economics, Elsevier, vol. 34(C), pages 61-73.
    17. Parma Chakravartti & Sudipto Mundle, 2017. "An Automatic Leading Indicator Based Growth Forecast For 2016-17 and The Outlook Beyond," Working Papers id:11773, eSocialSciences.
    18. Mina Kim & Deokwoo Nam & Jian Wang & Jason J. Wu, 2013. "International trade price stickiness and exchange rate pass-through in micro data: a case study on U.S.–China trade," Globalization Institute Working Papers 135, Federal Reserve Bank of Dallas.
    19. Nikeel Kumar & Ronald Ravinesh Kumar & Radika Kumar & Peter Josef Stauvermann, 2020. "Is the tourism–growth relationship asymmetric in the Cook Islands? Evidence from NARDL cointegration and causality tests," Tourism Economics, , vol. 26(4), pages 658-681, June.
    20. Meng Xu & Avishai Ceder & Ziyou Gao & Wei Guan, 2010. "Mass transit systems of Beijing: governance evolution and analysis," Transportation, Springer, vol. 37(5), pages 709-729, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:167:y:2018:i:c:p:319-330. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.