Measures of lexical distance between languages

My bibliography Save this article

Measures of lexical distance between languages

Author

Listed:

Petroni, Filippo
Serva, Maurizio

Registered:

Filippo Petroni

Abstract

The idea of measuring distance between languages seems to have its roots in the work of the French explorer Dumont D’Urville (1832) [13]. He collected comparative word lists for various languages during his voyages aboard the Astrolabe from 1826 to 1829 and, in his work concerning the geographical division of the Pacific, he proposed a method for measuring the degree of relation among languages. The method used by modern glottochronology, developed by Morris Swadesh in the 1950s, measures distances from the percentage of shared cognates, which are words with a common historical origin. Recently, we proposed a new automated method which uses the normalized Levenshtein distances among words with the same meaning and averages on the words contained in a list. Recently another group of scholars, Bakker et al. (2009) [8] and Holman et al. (2008) [9], proposed a refined version of our definition including a second normalization. In this paper we compare the information content of our definition with the refined version in order to decide which of the two can be applied with greater success to resolve relationships among languages.

Suggested Citation

Petroni, Filippo & Serva, Maurizio, 2010. "Measures of lexical distance between languages," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(11), pages 2280-2283.

Handle: RePEc:eee:phsmap:v:389:y:2010:i:11:p:2280-2283
DOI: 10.1016/j.physa.2010.02.004

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Russell D. Gray & Quentin D. Atkinson, 2003. "Language-tree divergence times support the Anatolian theory of Indo-European origin," Nature, Nature, vol. 426(6965), pages 435-439, November.
Russell D. Gray & Fiona M. Jordan, 2000. "Language trees support the express-train sequence of Austronesian expansion," Nature, Nature, vol. 405(6790), pages 1052-1055, June.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Isphording, Ingo E. & Piopiunik, Marc & Rodríguez-Planas, Núria, 2016. "Speaking in numbers: The effect of reading performance on math performance among immigrants," Economics Letters, Elsevier, vol. 139(C), pages 52-56.
- Isphording, Ingo E. & Piopiunik, Marc & Rodríguez-Planas, Núria, 2015. "Speaking in Numbers: The Effect of Reading Performance on Math Performance among Immigrants," IZA Discussion Papers 9433, Institute of Labor Economics (IZA).
- Ingo E. Isphording & Marc Piopiunik & Núria Rodríguez-Planas, 2015. "Speaking in Numbers: The Effect of Reading Performance on Math Performance among Immigrants," CESifo Working Paper Series 5589, CESifo.
- Isphording, Ingo E. & Piopiunik, Marc & Rodríguez-Planas, Núria, 2016. "Speaking in numbers: The effect of reading performance on math performance among immigrants," Munich Reprints in Economics 43490, University of Munich, Department of Economics.
Ingo Eduard Isphording & Sebastian Otten, 2013. "The Costs of Babylon—Linguistic Distance in Applied Economics," Review of International Economics, Wiley Blackwell, vol. 21(2), pages 354-369, May.
- Isphording, Ingo E. & Otten, Sebastian, 2012. "The Costs of Babylon – Linguistic Distance in Applied Economics," Ruhr Economic Papers 337, RWI - Leibniz-Institut für Wirtschaftsforschung, Ruhr-University Bochum, TU Dortmund University, University of Duisburg-Essen.
repec:zbw:rwirep:0337 is not listed on IDEAS
Gamallo, Pablo & Pichel, José Ramom & Alegria, Iñaki, 2017. "From language identification to language distance," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 484(C), pages 152-162.
Isphording, Ingo E. & Otten, Sebastian, 2014. "Linguistic barriers in the destination language acquisition of immigrants," Journal of Economic Behavior & Organization, Elsevier, vol. 105(C), pages 30-50.
- Isphording, Ingo E. & Otten, Sebastian, 2011. "Linguistic Distance and the Language Fluency of Immigrants," Ruhr Economic Papers 274, RWI - Leibniz-Institut für Wirtschaftsforschung, Ruhr-University Bochum, TU Dortmund University, University of Duisburg-Essen.
- Isphording, Ingo E. & Otten, Sebastian, 2014. "Linguistic Barriers in the Destination Language Acquisition of Immigrants," IZA Discussion Papers 8090, Institute of Labor Economics (IZA).
Ibrahim Bousmah & Gilles Grenier & David M. Gray, 2021. "Linguistic Distance, Languages of Work and Wages of Immigrants in Montreal," Journal of Labor Research, Springer, vol. 42(1), pages 1-28, March.
- Ibrahim Bousmah & Gilles Grenier & David Gray, 2018. "Linguistic Distance, Languages of Work and Wages of Immigrants in Montreal," Working Papers 1805E, University of Ottawa, Department of Economics.
Erkan Gören, 2013. "Economic Effects of Domestic and Neighbouring Countries’ Cultural Diversity," Working Papers V-352-13, University of Oldenburg, Department of Economics, revised Mar 2013.
- Erkan Gören, 2013. "Economic Effects of Domestic and Neighbouring Countries' Cultural Diversity," ZenTra Working Papers in Transnational Studies 16 / 2013, ZenTra - Center for Transnational Studies, revised Apr 2013.
Mehri, Ali & Jamaati, Maryam, 2021. "Statistical metrics for languages classification: A case study of the Bible translations," Chaos, Solitons & Fractals, Elsevier, vol. 144(C).
repec:zbw:hohpro:352 is not listed on IDEAS
Espitia, Diego & Larralde, Hernán, 2020. "Universal and non-universal text statistics: Clustering coefficient for language identification," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 553(C).
repec:old:wpaper:352 is not listed on IDEAS
Ingo Eduard Isphording & Sebastian Otten, 2013. "The Costs of Babylon—Linguistic Distance in Applied Economics," Review of International Economics, Wiley Blackwell, vol. 21(2), pages 354-369, 05.
- Isphording, Ingo E. & Otten, Sebastian, 2012. "The Costs of Babylon – Linguistic Distance in Applied Economics," Ruhr Economic Papers 337, RWI - Leibniz-Institut für Wirtschaftsforschung, Ruhr-University Bochum, TU Dortmund University, University of Duisburg-Essen.
Lorraine Wong, 2023. "The effect of linguistic proximity on the labour market outcomes of the asylum population," Journal of Population Economics, Springer;European Society for Population Economics, vol. 36(2), pages 609-652, April.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Matthew J. Baker, 2021. "Foundations of the Age-Area Hypothesis," Palgrave Communications, Palgrave Macmillan, vol. 8(1), pages 1-17, December.
- Matthew J. Baker, 2020. "Foundations of the Age-Area Hypothesis," Economics Working Paper Archive at Hunter College 451, Hunter College Department of Economics, revised 2021.
Klaus Desmet & Ignacio Ortuño-Ortín & Romain Wacziarg, 2009. "The political economy of ethnolinguistic cleavages," Working Papers 2009-17, Instituto Madrileño de Estudios Avanzados (IMDEA) Ciencias Sociales.
- Klaus Desmet & Ignacio Ortuño-Ortín & Romain Wacziarg, 2009. "The Political Economy of Ethnolinguistic Cleavages," NBER Working Papers 15360, National Bureau of Economic Research, Inc.
- Wacziarg, Romain & Desmet, Klaus & OrtuÃ±o-Ortin, Ignacio, 2009. "The Political Economy of Ethnolinguistic Cleavages," CEPR Discussion Papers 7478, C.E.P.R. Discussion Papers.
Victor Ginsburgh & Shlomo Weber, 2020. "The Economics of Language," Journal of Economic Literature, American Economic Association, vol. 58(2), pages 348-404, June.
- Victor Ginsburgh & Shlomo Weber, 2018. "The Economics of Language," Working Papers ECARES 2018-18, ULB -- Universite Libre de Bruxelles.
- Ginsburgh, Victor & Weber, Shlomo, 2020. "The Economics of Language," LIDAM Reprints CORE 3118, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
- Weber, Shlomo & Ginsburgh, Victor, 2018. "The Economics of Language," CEPR Discussion Papers 13002, C.E.P.R. Discussion Papers.
Aparicio Fenoll, Ainoa & Kuehn, Zoë, 2016. "Education Policies and Migration across European Countries," IZA Discussion Papers 9755, Institute of Labor Economics (IZA).
- Ainhoa Aparicio Fenoll & Zoe Kuehn, 2016. "Education Policies and Migration across European Countries," CHILD Working Papers Series 42 JEL Classification: J6, Centre for Household, Income, Labour and Demographic Economics (CHILD) - CCA.
Ainhoa Aparicio Fenoll & Zoë Kuehn, 2017. "Compulsory Schooling Laws and Migration Across European Countries," Demography, Springer;Population Association of America (PAA), vol. 54(6), pages 2181-2200, December.
Stanisz, Tomasz & Drożdż, Stanisław & Kwapień, Jarosław, 2023. "Universal versus system-specific features of punctuation usage patterns in major Western languages," Chaos, Solitons & Fractals, Elsevier, vol. 168(C).
Stelios Michalopoulos, 2012. "The Origins of Ethnolinguistic Diversity," American Economic Review, American Economic Association, vol. 102(4), pages 1508-1539, June.
- Stelios Michalopoulos, 2009. "The Origins of Ethnolinguistic Diversity," Carlo Alberto Notebooks 110, Collegio Carlo Alberto.
- Stelios Michalopoulos, 2011. "The Origins of Technolinguistic Diversity," Economics Working Papers 0095, Institute for Advanced Study, School of Social Science.
Carl MÃ¼ller-Crepon & Yannick Pengl & Nils-Christian Bormann, 2022. "Linking Ethnic Data from Africa (LEDA)," Journal of Peace Research, Peace Research Institute Oslo, vol. 59(3), pages 425-435, May.
Nico Neureiter & Peter Ranacher & Nour Efrat-Kowalsky & Gereon A. Kaiping & Robert Weibel & Paul Widmer & Remco R. Bouckaert, 2022. "Detecting contact in language trees: a Bayesian phylogenetic model with horizontal transfer," Palgrave Communications, Palgrave Macmillan, vol. 9(1), pages 1-14, December.
Victor GINSBURGH & Shlomo WEBER, 2016. "Linguistic distances and ethnolinguistic fractionalization and disenfranchisement indices," LIDAM Reprints CORE 2855, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
- Victor Ginsburgh & Shlomo Weber, 2016. "Linguistic distances and ethnolinguistic fractionalization and disenfranchisement indices," ULB Institutional Repository 2013/236842, ULB -- Universite Libre de Bruxelles.
- Victor Ginsburgh & Shlomo Weber, 2016. "Linguistic Distances and Ethno-Linguistic Fractionalisation and Disenfranchisement Indices," Working Papers ECARES ECARES 2016-25, ULB -- Universite Libre de Bruxelles.
Aguilar, Elliot & Ghirlanda, Stefano, 2015. "Modeling the genealogy of a cultural trait," Theoretical Population Biology, Elsevier, vol. 101(C), pages 1-8.
Victor Zitian Chen & John Cantwell, 2022. "An evolutionary view of institutional complexity," Journal of Evolutionary Economics, Springer, vol. 32(3), pages 1071-1090, July.
Marcelo A Montemurro & Damián H Zanette, 2011. "Universal Entropy of Word Ordering Across Linguistic Families," PLOS ONE, Public Library of Science, vol. 6(5), pages 1-9, May.
Taraka Rama, 2013. "Phonotactic Diversity Predicts the Time Depth of the World’s Language Families," PLOS ONE, Public Library of Science, vol. 8(5), pages 1-9, May.
Arthur J. Robson, 2010. "A bioeconomic view of the Neolithic transition to agriculture," Canadian Journal of Economics, Canadian Economics Association, vol. 43(1), pages 280-300, February.
- Arthur J. Robson, 2010. "A bioeconomic view of the Neolithic transition to agriculture," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 43(1), pages 280-300, February.
Alexei S. Kassian & George Starostin, 2025. "Do ‘language trees with sampled ancestors’ really support a ‘hybrid model’ for the origin of Indo-European? Thoughts on the most recent attempt at yet another IE phylogeny," Palgrave Communications, Palgrave Macmillan, vol. 12(1), pages 1-10, December.
Marc Allassonnière-Tang & Olof Lundgren & Maja Robbers & Sandra Cronhamn & Filip Larsson & One-Soon Her & Harald Hammarström & Gerd Carling, 2021. "Expansion by migration and diffusion by contact is a source to the global diversity of linguistic nominal categorization systems," Palgrave Communications, Palgrave Macmillan, vol. 8(1), pages 1-6, December.
Joseph Flavian Gomes, 2020. "The health costs of ethnic distance: evidence from sub-Saharan Africa," Journal of Economic Growth, Springer, vol. 25(2), pages 195-226, June.
- Gomes, Joseph, 2014. "The health costs of ethnic distance: evidence from Sub-Saharan Africa," ISER Working Paper Series 2014-33, Institute for Social and Economic Research.
- Joseph Flavian Gomes, 2020. "The Health Costs of Ethnic Distance: Evidence from Sub-Saharan Africa," LIDAM Discussion Papers IRES 2020005, Université catholique de Louvain, Institut de Recherches Economiques et Sociales (IRES).
- Joseph Flavian Gomes, 2017. "The Health Costs of Ethnic Distance: Evidence from Sub-Saharan Africa," NCID Working Papers 04/2017, Navarra Center for International Development, University of Navarra.
- Gomes, Joseph Flavian, 2020. "The Health Costs of Ethnic Distance: Evidence from Sub-Saharan Africa," CEPR Discussion Papers 14332, C.E.P.R. Discussion Papers.
Job Schepens & Ton Dijkstra & Franc Grootjen & Walter J B van Heuven, 2013. "Cross-Language Distributions of High Frequency and Phonetically Similar Cognates," PLOS ONE, Public Library of Science, vol. 8(5), pages 1-15, May.
Paola Giuliano, 2016. "Review of Cultural Evolution: Society, Technology, Language, and Religion Edited by Peter J. Richerson and Morten H. Christiansen," Journal of Economic Literature, American Economic Association, vol. 54(2), pages 522-533, June.

More about this item

Keywords

; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:389:y:2010:i:11:p:2280-2283. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Measures of lexical distance between languages

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data