This file is part of IDEAS, which uses RePEc data


[ Papers | Articles | Software | Books | Chapters | Authors | Institutions | JEL Classification | NEP reports | Search | New papers by email | Author registration | Rankings | Volunteers | FAQ | Blog | Help! ]

(String Matching Algorithms,An Applicatione ti ISAE and ISTAT Firms's Registers)

Author info | Abstract | Publisher info | Download info | Related research | Statistics
Author Info
Emma De Angelis (ISAE - Institute for Studies and Economic Analyses)
Carmine Pappalardo
Abstract

The aim of the paper is to develop algorithms for the matching of strings from two different registers of Italian enterprises. This makes effective the possibility of integrating the firm-level information collected by ISAE (Institute for Studies and Economic Analyses) with the one gathered by the Italian NSI. The name of the company is the reference information used for string matching. The first procedure is based on a backward recursive application of the critical factorization method. In the second algorithm, a shifting rule is defined based on the positions of blanks within the reference text T. About 80% of the units in ISAE register of business enterprises have been exactly matched. This suggests that the role of firms’ demography should be adequately taken into account to get satisfactory results and confirms the reliability of the proposed algorithms. Some structural characteristics of ISAE sampled firms can be now accounted for.

Download Info
To download:

If you experience problems downloading a file, check if you have the proper application to view it first. Information about this may be contained in the File-Format links below. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL: http://www.isae.it/Working_Papers/WP_115_2009_DeAngelis_Pappalardo.pdf
File Format: application/pdf
File Function:
Download Restriction: no

Publisher Info
Paper provided by ISAE - Institute for Studies and Economic Analyses - (Rome, ITALY) in its series ISAE Working Papers with number 115.

Download reference. The following formats are available: HTML (with abstract), plain text (with abstract), BibTeX, RIS (EndNote, RefMan, ProCite), ReDIF
Length: 26 pages
Date of creation: Jun 2009
Date of revision:
Handle: RePEc:isa:wpaper:115

Contact details of provider:
Postal: Piazza dell'Indipendenza, No. 4, 00185 Rome
Fax: +39-06-44482219
Email:
Web page: http://www.isae.it
More information through EDIRC

For technical questions regarding this item, or to correct its listing, contact: (Anita Guelfi).

Related research
Keywords: String matching algorithms; critical factorization method; shifting rule; firms' registers; firms’ demography.;

Find related papers by JEL classification:
C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Microeconomic Data
C87 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Econometric Software
L60 - Industrial Organization - - Industry Studies: Manufacturing - - - General

This paper has been announced in the following NEP Reports:

Statistics
Access and download statistics

Did you know? IDEAS was launched in September 1997.

This page was last updated on 2009-11-13.


This information is provided to you by IDEAS at the Department of Economics, College of Liberal Arts and Sciences, University of Connecticut using RePEc data on a server sponsored by the Society for Economic Dynamics.