IDEAS home Printed from https://ideas.repec.org/a/taf/gnstxx/v20y2008i7p553-571.html
   My bibliography  Save this article

Estimating and clustering curves in the presence of heteroscedastic errors

Author

Listed:
  • Nicoleta Serban

Abstract

The technique introduced in this paper is a means for estimating and discovering underlying patterns for a large number of curves observed with heteroscedastic errors. Therefore, both the mean and the variance functions of each curve are assumed unknown and varying over time. The method consists of a series of steps. We transform using an orthonormal basis of functions in L2. In the transform domain, the non-parametric regression is reduced to a means model. To estimate the means in the transform domain, we consider the class of linear or modulation estimators and proceed as in Beran and Dümbgen (R. Beran and L. Dümbgen, Modulation of estimators and confidence sets, Ann. Stat. 26(5) (1998), pp. 1826–1856.) by minimising the Stein's unbiased risk estimate. By minimising the risk over a nested subset selection of modulators, we reduce the dimensionality of the means space. We show that in the transform space, the risk estimate is asymptotically optimal in the Pinsker's minimax sense over Sobolev ellipsoids under heteroscedastic errors. Coefficient estimation and dimensionality reduction via optimal risk estimation is essential for accurate clustering membership estimation. We illustrate our technique by estimating and clustering a large number of curves both within a synthetic example and within a specific application. In this application, we analyse the research and development expenditure of a subset of companies in the Compustat Global database. We show that our method compares favourably to two alternative approaches.

Suggested Citation

  • Nicoleta Serban, 2008. "Estimating and clustering curves in the presence of heteroscedastic errors," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 20(7), pages 553-571.
  • Handle: RePEc:taf:gnstxx:v:20:y:2008:i:7:p:553-571
    DOI: 10.1080/10485250802348742
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/10485250802348742
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/10485250802348742?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Robert Tibshirani & Guenther Walther & Trevor Hastie, 2001. "Estimating the number of clusters in a data set via the gap statistic," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 63(2), pages 411-423.
    2. Hardle, W. & Tsybakov, A., 1997. "Local polynomial estimators of the volatility function in nonparametric autoregression," Journal of Econometrics, Elsevier, vol. 81(1), pages 223-242, November.
    3. Fan, Jianqing & Yao, Qiwei, 1998. "Efficient estimation of conditional variance functions in stochastic regression," LSE Research Online Documents on Economics 6635, London School of Economics and Political Science, LSE Library.
    4. Jerome H. Friedman & Jacqueline J. Meulman, 2004. "Clustering objects on subsets of attributes (with discussion)," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 66(4), pages 815-849, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Withers Christopher S. & Nadarajah Saralees, 2011. "Expansions for the risk of Stein type estimates for non-normal data," Statistics & Risk Modeling, De Gruyter, vol. 28(2), pages 81-95, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Martins-Filho, Carlos & Yao, Feng & Torero, Maximo, 2018. "Nonparametric Estimation Of Conditional Value-At-Risk And Expected Shortfall Based On Extreme Value Theory," Econometric Theory, Cambridge University Press, vol. 34(1), pages 23-67, February.
    2. Heather L. R. Tierney, 2019. "Forecasting with the Nonparametric Exclusion-from-Core Inflation Persistence Model Using Real-Time Data," International Advances in Economic Research, Springer;International Atlantic Economic Society, vol. 25(1), pages 39-63, February.
    3. Jianqing Fan, 2004. "A selective overview of nonparametric methods in financial econometrics," Papers math/0411034, arXiv.org.
    4. Biqing Cai & Dag Tjøstheim, 2015. "Nonparametric Regression Estimation for Multivariate Null Recurrent Processes," Econometrics, MDPI, vol. 3(2), pages 1-24, April.
    5. Tierney, Heather L.R., 2011. "Real-time data revisions and the PCE measure of inflation," Economic Modelling, Elsevier, vol. 28(4), pages 1763-1773, July.
    6. Scholz, Achim & Neumeyer, Natalie & Munk, Axel, 2004. "Nonparametric Analysis of Covariance : the Case of Inhomogeneous and Heteroscedastic Noise," Technical Reports 2004,28, Technische Universität Dortmund, Sonderforschungsbereich 475: Komplexitätsreduktion in multivariaten Datenstrukturen.
    7. Enno Mammen & Jens Perch Nielsen & Michael Scholz & Stefan Sperlich, 2019. "Conditional Variance Forecasts for Long-Term Stock Returns," Risks, MDPI, vol. 7(4), pages 1-22, November.
    8. Chen, Gongmeng & Choi, Yoon K. & Zhou, Yong, 2008. "Detections of changes in return by a wavelet smoother with conditional heteroscedastic volatility," Journal of Econometrics, Elsevier, vol. 143(2), pages 227-262, April.
    9. Xu, Ke-Li & Phillips, Peter C. B., 2011. "Tilted Nonparametric Estimation of Volatility Functions With Empirical Applications," Journal of Business & Economic Statistics, American Statistical Association, vol. 29(4), pages 518-528.
    10. Chaouch, Mohamed, 2019. "Volatility estimation in a nonlinear heteroscedastic functional regression model with martingale difference errors," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 129-148.
    11. Benjamin R. Auer & Benjamin Mögel, 2016. "How Accurate are Modern Value-at-Risk Estimators Derived from Extreme Value Theory?," CESifo Working Paper Series 6288, CESifo.
    12. Comte, F. & Rozenholc, Y., 2002. "Adaptive estimation of mean and volatility functions in (auto-)regressive models," Stochastic Processes and their Applications, Elsevier, vol. 97(1), pages 111-145, January.
    13. Floriello, Davide & Vitelli, Valeria, 2017. "Sparse clustering of functional data," Journal of Multivariate Analysis, Elsevier, vol. 154(C), pages 1-18.
    14. Pérez-González, A. & Vilar-Fernández, J.M. & González-Manteiga, W., 2010. "Nonparametric variance function estimation with missing data," Journal of Multivariate Analysis, Elsevier, vol. 101(5), pages 1123-1142, May.
    15. Spokoiny, Vladimir, 2002. "Variance Estimation for High-Dimensional Regression Models," Journal of Multivariate Analysis, Elsevier, vol. 82(1), pages 111-133, July.
    16. Chen, Gongmeng & Choi, Yoon K. & Zhou, Yong, 2005. "Nonparametric estimation of structural change points in volatility models for time series," Journal of Econometrics, Elsevier, vol. 126(1), pages 79-114, May.
    17. Josephine Njeri Ngure & Anthony Gichuhi Waititu, 2021. "Consistency of an Estimator for Change Point in Volatility of Financial Returns," Journal of Mathematics Research, Canadian Center of Science and Education, vol. 13(1), pages 1-56, February.
    18. Joseph Ngatchou-Wandji & Marwa Ltaifa & Didier Alain Njamen Njomen & Jia Shen, 2022. "Nonparametric Estimation of the Density Function of the Distribution of the Noise in CHARN Models," Mathematics, MDPI, vol. 10(4), pages 1-20, February.
    19. Li, Degui & Li, Runze, 2016. "Local composite quantile regression smoothing for Harris recurrent Markov processes," Journal of Econometrics, Elsevier, vol. 194(1), pages 44-56.
    20. Gaynor, Sheila & Bair, Eric, 2017. "Identification of relevant subtypes via preweighted sparse clustering," Computational Statistics & Data Analysis, Elsevier, vol. 116(C), pages 139-154.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:gnstxx:v:20:y:2008:i:7:p:553-571. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/GNST20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.