Emma De Angelis (ISAE - Institute for Studies and Economic Analyses) Carmine Pappalardo
Abstract
The aim of the paper is to develop algorithms for the matching of strings from two different registers of Italian enterprises. This makes effective the possibility of integrating the firm-level information collected by ISAE (Institute for Studies and Economic Analyses) with the one gathered by the Italian NSI. The name of the company is the reference information used for string matching. The first procedure is based on a backward recursive application of the critical factorization method. In the second algorithm, a shifting rule is defined based on the positions of blanks within the reference text T. About 80% of the units in ISAE register of business enterprises have been exactly matched. This suggests that the role of firms’ demography should be adequately taken into account to get satisfactory results and confirms the reliability of the proposed algorithms. Some structural characteristics of ISAE sampled firms can be now accounted for.
Download Info
To download:
If you experience problems downloading a file, check if you have the
proper application to
view it first. Information about this may be contained
in the File-Format links below. In case of further problems read
the IDEAS help
page. Note that these files are not on the IDEAS
site. Please be patient as the files may be large.
Publisher Info
Paper provided by ISAE - Institute for Studies and Economic Analyses - (Rome, ITALY) in its series ISAE Working Papers with number
115.
Find related papers by JEL classification: C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Microeconomic Data C87 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Econometric Software L60 - Industrial Organization - - Industry Studies: Manufacturing - - - General
This paper has been announced in the following NEP Reports: