A framework for cut-off sampling in business survey design
AbstractIn sampling theory the large concentration of the population with respect to most surveyed variables constitutes a problem which is difficult to tackle by means of classical tools. One possible solution is given by cut-off sampling, which explicitly prescribes to discard part of the population; in particular, if the population is composed by firms or establishments, the method results in the exclusion of the “smallest” firms. Whereas this sampling scheme is common among practitioners, its theoretical foundations tend to be considered weak, because the inclusion probability of some units is equal to zero. In this paper we propose a framework to justify cut-off sampling and to determine the census and cut-off thresholds. We use an estimation model which assumes as known the weight of the discarded units with respect to each variable; we compute the variance of the estimator and its bias, which is caused by violations of the aforementioned hypothesis. We develop an algorithm which minimizes the MSE as a function of multivariate auxiliary information at the population level. Considering the combinatorial optimization nature of the model, we resort to the theory of stochastic relaxation: in particular, we use the simulated annealing algorithm.
Download InfoIf you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
Bibliographic InfoPaper provided by Department of Economics, University of Trento, Italia in its series Department of Economics Working Papers with number 0709.
Date of creation: 2007
Date of revision:
Cut-off sampling; skewed populations; model-based estimation; optimal stratification; simulated annealing;
Find related papers by JEL classification:
- C21 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Cross-Sectional Models; Spatial Models; Treatment Effect Models
- D92 - Microeconomics - - Intertemporal Choice - - - Intertemporal Firm Choice, Investment, Capacity, and Financing
- L60 - Industrial Organization - - Industry Studies: Manufacturing - - - General
- O18 - Economic Development, Technological Change, and Growth - - Economic Development - - - Urban, Rural, Regional, and Transportation Analysis; Housing; Infrastructure
- R12 - Urban, Rural, Regional, Real Estate, and Transportation Economics - - General Regional Economics - - - Size and Spatial Distributions of Regional Economic Activity; Interregional Trade (economic geography)
This paper has been announced in the following NEP Reports:
- NEP-ALL-2007-05-12 (All new papers)
- NEP-CMP-2007-05-12 (Computational Economics)
- NEP-ECM-2007-05-12 (Econometrics)
You can help add them by filling out this form.
CitEc Project, subscribe to its RSS feed for this item.
- Orietta Luzi & Gianni Seri & Viviana De Giorgi & Giampiero Siesto, 2013. "Estimating Business Statistics by integrating administrative and survey data: an experimental study on small and medium enterprises," Rivista di statistica ufficiale, ISTAT - Italian National Institute of Statistics - (Rome, ITALY), vol. 15(2-3), pages 31-50.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Luciano Andreozzi).
If references are entirely missing, you can add them using this form.