IDEAS home Printed from
   My bibliography  Save this paper

A prediction approach to sampling design




Standard approaches to sample surveys take as the point of departure the estimation of one or several population totals (or means), or a few predefined sub-totals (or sub-means). While the model-based prediction approach provides an attractive framework for estimation and inference, a model-based theory for the variety of randomization sampling designs has been lacking. In this paper we extend the model-based approach to the prediction of individuals in addition to totals and means. Since, given the sample, the conditional prediction error is zero for the selected units but positive for the units outside of the sample, it is possible to use the sampling design to control the unconditional individual prediction mean square errors. This immediately raises the need for probability sampling. It turns out that balancing between optimal prediction of the population total and control over individual predictions provides a fruitful model-based approach to sampling design. Apart from raising the need for probability sampling in general, it leads naturally to a number of important design features that are firmly established in the sampling practice, including the use of simple random sampling for homogeneous populations and unequal probability sampling otherwise, the division of a business population into the take-all, take-some and take-none units, the most common two-stage sampling designs, the use of stratification with proportional allocation, etc.. Most of them have not received adequate model-based treatment previously. Our approach enables us to give an appraisal of these methods from a prediction point of view.

Suggested Citation

  • Li-Chun Zhang & Ib Thomsen, 2005. "A prediction approach to sampling design," Discussion Papers 440, Statistics Norway, Research Department.
  • Handle: RePEc:ssb:dispap:440

    Download full text from publisher

    File URL:
    Download Restriction: no

    More about this item


    Individual prediction; business survey; unequal probability sampling; two-stage sampling; linear regression population; common parameter model;

    NEP fields

    This paper has been announced in the following NEP Reports:


    Access and download statistics


    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ssb:dispap:440. See general information about how to correct material in RePEc.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (L Maasø). General contact details of provider: .

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service hosted by the Research Division of the Federal Reserve Bank of St. Louis . RePEc uses bibliographic data supplied by the respective publishers.