(String Matching Algorithms,An Applicatione ti ISAE and ISTAT Firms's Registers)
The aim of the paper is to develop algorithms for the matching of strings from two different registers of Italian enterprises. This makes effective the possibility of integrating the firm-level information collected by ISAE (Institute for Studies and Economic Analyses) with the one gathered by the Italian NSI. The name of the company is the reference information used for string matching. The first procedure is based on a backward recursive application of the critical factorization method. In the second algorithm, a shifting rule is defined based on the positions of blanks within the reference text T. About 80% of the units in ISAE register of business enterprises have been exactly matched. This suggests that the role of firms’ demography should be adequately taken into account to get satisfactory results and confirms the reliability of the proposed algorithms. Some structural characteristics of ISAE sampled firms can be now accounted for.
|Date of creation:||Jun 2009|
|Contact details of provider:|| Postal: Via Cesare Balbo 16, Roma|
Web page: http://www.istat.it/en/
More information through EDIRC
References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Marco Malgarini & Patrizia Margani & Bianca Maria Martelli, 2005.
"Re-engineering the ISAE manufacturing survey,"
ISAE Working Papers
47, ISTAT - Italian National Institute of Statistics - (Rome, ITALY).
- Malgarini, Marco & Margani, Patrizia & Martelli, Bianca Maria, 2005. "Re-engineering the ISAE manufacturing survey," MPRA Paper 42440, University Library of Munich, Germany.
- Jovanovic, Boyan, 1982. "Selection and the Evolution of Industry," Econometrica, Econometric Society, vol. 50(3), pages 649-670, May. Full references (including those not matched with items on IDEAS)
When requesting a correction, please mention this item's handle: RePEc:isa:wpaper:115. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Stefania Rossetti)
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If references are entirely missing, you can add them using this form.
If the full references list an item that is present in RePEc, but the system did not link to it, you can help with this form.
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your profile, as there may be some citations waiting for confirmation.
Please note that corrections may take a couple of weeks to filter through the various RePEc services.