Direct maximization of the likelihood of a hidden Markov model
Ever since the introduction of hidden Markov models by Baum and his co-workers, the method of choice for fitting such models has been maximum likelihood via the EM algorithm. In recent years it has been noticed that the gradient and Hessian of the log likelihood of hidden Markov and related models may be calculated in parallel with a filtering process by which the likelihood may be calculated. Various authors have used, or suggested the use of, this idea in order to maximize the likelihood directly, without using the EM algorithm. In this paper we discuss an implementation of such an approach. We have found that a straightforward implementation of Newton's method sometimes works but is unreliable. A form of the Levenberg-Marquardt algorithm appears to provide excellent reliability. Two rather complex examples are given for applying this algorithm to the fitting of hidden Markov models. In the first a better than 6-fold increase in speed over the EM algorithm was achieved. The second example turned out to be problematic (somewhat interestingly) in that the maximum likelihood estimator appears to be inconsistent. Whatever its merit, this estimator is calculated much faster by Levenberg-Marquardt than by EM. We also compared the Levenberg-Marquardt algorithm, applied to the first example, with a generic numerical maximization procedure. The Levenberg-Marquardt algorithm appeared to perform almost three times better than the generic procedure, even when analytic derivatives were provided, and 19 times better when they were not provided.
References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Leroux, Brian G., 1992. "Maximum-likelihood estimation for hidden Markov models," Stochastic Processes and their Applications, Elsevier, vol. 40(1), pages 127-143, February.
- Campillo, Fabien & Le Gland, François, 1989. "MLE for partially observed diffusions: direct maximization vs. the em algorithm," Stochastic Processes and their Applications, Elsevier, vol. 33(2), pages 245-274, December.
When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:52:y:2008:i:9:p:4147-4160. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Zhang, Lei)
If references are entirely missing, you can add them using this form.