Direct maximization of the likelihood of a hidden Markov model

Direct maximization of the likelihood of a hidden Markov model

Author

Listed:

Turner, Rolf

Abstract

Ever since the introduction of hidden Markov models by Baum and his co-workers, the method of choice for fitting such models has been maximum likelihood via the EM algorithm. In recent years it has been noticed that the gradient and Hessian of the log likelihood of hidden Markov and related models may be calculated in parallel with a filtering process by which the likelihood may be calculated. Various authors have used, or suggested the use of, this idea in order to maximize the likelihood directly, without using the EM algorithm. In this paper we discuss an implementation of such an approach. We have found that a straightforward implementation of Newton's method sometimes works but is unreliable. A form of the Levenberg-Marquardt algorithm appears to provide excellent reliability. Two rather complex examples are given for applying this algorithm to the fitting of hidden Markov models. In the first a better than 6-fold increase in speed over the EM algorithm was achieved. The second example turned out to be problematic (somewhat interestingly) in that the maximum likelihood estimator appears to be inconsistent. Whatever its merit, this estimator is calculated much faster by Levenberg-Marquardt than by EM. We also compared the Levenberg-Marquardt algorithm, applied to the first example, with a generic numerical maximization procedure. The Levenberg-Marquardt algorithm appeared to perform almost three times better than the generic procedure, even when analytic derivatives were provided, and 19 times better when they were not provided.

Suggested Citation

Turner, Rolf, 2008. "Direct maximization of the likelihood of a hidden Markov model," Computational Statistics & Data Analysis, Elsevier, vol. 52(9), pages 4147-4160, May.

Handle: RePEc:eee:csdana:v:52:y:2008:i:9:p:4147-4160

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Leroux, Brian G., 1992. "Maximum-likelihood estimation for hidden Markov models," Stochastic Processes and their Applications, Elsevier, vol. 40(1), pages 127-143, February.
D. Oakes, 1999. "Direct calculation of the information matrix via the EM," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 61(2), pages 479-482, April.
Campillo, Fabien & Le Gland, François, 1989. "MLE for partially observed diffusions: direct maximization vs. the em algorithm," Stochastic Processes and their Applications, Elsevier, vol. 33(2), pages 245-274, December.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Forcina, Antonio, 2017. "A Fisher-scoring algorithm for fitting latent class models with individual covariates," Econometrics and Statistics, Elsevier, vol. 3(C), pages 132-140.
Gordon Anderson & Alessio Farcomeni & Maria Grazia Pittau & Roberto Zelli, 2019. "Rectangular latent Markov models for time‐specific clustering, with an analysis of the wellbeing of nations," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 68(3), pages 603-621, April.
Bartolucci, Francesco & Montanari, Giorgio E. & Pandolfi, Silvia, 2015. "Three-step estimation of latent Markov models with covariates," Computational Statistics & Data Analysis, Elsevier, vol. 83(C), pages 287-301.
Ari Hyytinen & Frode Steen & Otto Toivanen, 2018. "Cartels Uncovered," American Economic Journal: Microeconomics, American Economic Association, vol. 10(4), pages 190-222, November.
- Steen, Frode & Toivanen, Otto & Hyytinen, Ari, 2010. "Cartels Uncovered," CEPR Discussion Papers 7761, C.E.P.R. Discussion Papers.
- Hyytinen, Ari & Steen, Frode & Toivanen, Otto, 2010. "Cartels Uncovered," Discussion Paper Series in Economics 10/2010, Norwegian School of Economics, Department of Economics.
F. Bartolucci & A. Farcomeni & F. Pennoni, 2014. "Latent Markov models: a review of a general framework for the analysis of longitudinal data with covariates," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 23(3), pages 433-465, September.
- Bartolucci, Francesco & Farcomeni, Alessio & Pennoni, Fulvia, 2012. "Latent Markov models: a review of a general framework for the analysis of longitudinal data with covariates," MPRA Paper 39023, University Library of Munich, Germany.
Simon DeDeo & David C Krakauer & Jessica C Flack, 2010. "Inductive Game Theory and the Dynamics of Animal Conflict," PLOS Computational Biology, Public Library of Science, vol. 6(5), pages 1-16, May.
Pereira, Diogo & Nunes, Cláudia & Rodrigues, Rui, 2024. "A new algorithm for inference in HMM's with lower span complexity," Computational Statistics & Data Analysis, Elsevier, vol. 195(C).
Iain L. MacDonald, 2014. "Numerical Maximisation of Likelihood: A Neglected Alternative to EM?," International Statistical Review, International Statistical Institute, vol. 82(2), pages 296-308, August.
Bartolucci, Francesco & Bacci, Silvia & Mira, Antonietta, 2018. "On the role of latent variable models in the era of big data," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 165-169.
Maruotti, Antonello & Punzo, Antonio, 2017. "Model-based time-varying clustering of multivariate longitudinal data with covariates and outliers," Computational Statistics & Data Analysis, Elsevier, vol. 113(C), pages 475-496.
Gordon Anderson & Alessio Farcomeni & Grazia Pittau & Roberto Zelli, 2017. "Rectangular latent Markov models for time-specific clustering," Working Papers tecipa-589, University of Toronto, Department of Economics.
Kristian Gundersen & Timothée Bacri & Jan Bulla & Sondre Hølleland & Antonello Maruotti & Bård Støve, 2024. "Testing for time‐varying nonlinear dependence structures: Regime‐switching and local Gaussian correlation," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 51(3), pages 1012-1060, September.
Elliott, Robert J. & Chen, Zhiping & Duan, Qihong, 2009. "Insurance claims modulated by a hidden Brownian marked point process," Insurance: Mathematics and Economics, Elsevier, vol. 45(2), pages 163-172, October.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Printechapat, Tanes & Aiewsakun, Pakorn & Krityakierne, Tipaluck, 2025. "Caged Markov process – A continuous-time framework for modeling a constrained Markov process within a freely-evolving Markov process," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 230(C), pages 350-369.
Shu Yang & Jae Kwang Kim, 2016. "Likelihood-based Inference with Missing Data Under Missing-at-Random," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(2), pages 436-454, June.
María Luz Gámiz & Nikolaos Limnios & Mari Carmen Segovia-García, 2023. "The continuous-time hidden Markov model based on discretization. Properties of estimators and applications," Statistical Inference for Stochastic Processes, Springer, vol. 26(3), pages 525-550, October.
Roberto Mari & Zsuzsa Bakk & Jennifer Oser & Jouni Kuha, 2023. "A two-step estimator for multilevel latent class analysis with covariates," Psychometrika, Springer;The Psychometric Society, vol. 88(4), pages 1144-1170, December.
Ahmed Belhadjayed & Grégoire Loeper & Frédéric Abergel, 2016. "Forecasting Trends With Asset Prices," Post-Print hal-01512431, HAL.
Aknouche, Abdelhakim & Dimitrakopoulos, Stefanos, 2020. "On an integer-valued stochastic intensity model for time series of counts," MPRA Paper 105406, University Library of Munich, Germany.
Zhou, Lin & Tang, Yayong, 2021. "Linearly preconditioned nonlinear conjugate gradient acceleration of the PX-EM algorithm," Computational Statistics & Data Analysis, Elsevier, vol. 155(C).
BjÃ¶rn Andersson & Tao Xin, 2021. "Estimation of Latent Regression Item Response Theory Models Using a Second-Order Laplace Approximation," Journal of Educational and Behavioral Statistics, , vol. 46(2), pages 244-265, April.
Genon-Catalot, Valentine, 2003. "A non-linear explicit filter," Statistics & Probability Letters, Elsevier, vol. 61(2), pages 145-154, January.
Jörn Dannemann & Hajo Holzmann, 2008. "Likelihood Ratio Testing for Hidden Markov Models Under Non‐standard Conditions," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 35(2), pages 309-321, June.
Massimo Guidolin, 2013. "Markov switching models in asset pricing research," Chapters, in: Adrian R. Bell & Chris Brooks & Marcel Prokopczuk (ed.), Handbook of Research Methods and Applications in Empirical Finance, chapter 1, pages 3-44, Edward Elgar Publishing.
Anton Molyboha & Michael Zabarankin, 2012. "Stochastic Optimization of Sensor Placement for Diver Detection," Operations Research, INFORMS, vol. 60(2), pages 292-312, April.
Pierre Guérin & Danilo Leiva-Leon & Massimiliano Marcellino, 2020. "Markov-Switching Three-Pass Regression Filter," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 38(2), pages 285-302, April.
- Pierre Guerin & Danilo Leiva-Leon & Massimiliano Marcellino, 2016. "Markov-Switching Three-Pass Regression Filter," Working Papers 591, IGIER (Innocenzo Gasparini Institute for Economic Research), Bocconi University.
- Pierre Guérin & Danilo Leiva-Leon & Massimiliano Marcellino, 2017. "Markov-Switching Three-Pass Regression Filter," Staff Working Papers 17-13, Bank of Canada.
- Pierre Guérin & Danilo Leiva-Leon & Massimiliano Marcellino, 2017. "Markov-switching three-pass regression filter," Working Papers 1748, Banco de España.
Giorgio Eduardo Montanari & Marco Doretti & Maria Francesca Marino, 2022. "Model-based two-way clustering of second-level units in ordinal multilevel latent Markov models," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 16(2), pages 457-485, June.
Bacci, Silvia & Bartolucci, Francesco & Pieroni, Luca, 2012. "A causal analysis of mother’s education on birth inequalities," MPRA Paper 38754, University Library of Munich, Germany.
Poncela, Pilar & Ruiz, Esther & Miranda, Karen, 2021. "Factor extraction using Kalman filter and smoothing: This is not just another survey," International Journal of Forecasting, Elsevier, vol. 37(4), pages 1399-1425.
- Poncela, Pilar & Ruiz Ortega, Esther & Miranda Gualdrón, Karen Alejandra, 2020. "Factor extraction using Kalman filter and smoothing: this is not just another survey," DES - Working Papers. Statistics and Econometrics. WS 30644, Universidad Carlos III de Madrid. Departamento de EstadÃstica.
David Aristei & Silvia Bacci & Francesco Bartolucci & Silvia Pandolfi, 2021. "A bivariate finite mixture growth model with selection," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 15(3), pages 759-793, September.
Linda Möstel & Marius Pfeuffer & Matthias Fischer, 2020. "Statistical inference for Markov chains with applications to credit risk," Computational Statistics, Springer, vol. 35(4), pages 1659-1684, December.
Mogens Bladt & Michael SØrensen, 2009. "Efficient estimation of transition rates between credit ratings from observations at discrete time points," Quantitative Finance, Taylor & Francis Journals, vol. 9(2), pages 147-160.
Aknouche, Abdelhakim & Dimitrakopoulos, Stefanos & Touche, Nassim, 2019. "Integer-valued stochastic volatility," MPRA Paper 91962, University Library of Munich, Germany, revised 04 Feb 2019.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:52:y:2008:i:9:p:4147-4160. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Direct maximization of the likelihood of a hidden Markov model

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data