IDEAS home Printed from https://ideas.repec.org/p/arz/wpaper/eres2018_165.html
   My bibliography  Save this paper

Towards the broad application of machine learning for document classification and data migration in real estate

Author

Listed:
  • Mario Bodenbender
  • Björn-Martin Kurzrock

Abstract

Real estate is increasingly becoming an asset class subject to the same requirements as other capital investments. As a consequence, the strategic relevance of real estate portfolios has gained in importance for many businesses. The resulting large quantities of documentation and information require a structured database system, in which information and documents will remain permanently transparent, complete, and findable. Portfolio and operating documentation must be reliably and consistently available to a variety of actors, over a period of decades. In order to facilitate effective document protection, administration and access at all times, it is necessary to establish a unique structure and identification system for the information. In practice, however, there are a variety of existing standards relating to document structures for particular lifecycle phases and for transmission of the data between specific phases. The documents are consequently subject to repeated restructuring throughout their lifecycle - a process that is expensive and entails a risk of data loss.The paper describes an approach for unifying and establishing compatibility between the existing document structure standards throughout the property's lifecycle, making use of unique document classes. The goal is to achieve a stable, unique document classification, accompanied by a capacity to automatically classify relevant (and, in particular, unstructured) documents. In this way, in the course of digitalization or migration, it will be possible to directly associate documents with a document class and thus ensure that they have a single unique classification throughout their lifecycle; they can then be displayed (by the users) in restructured forms for specific use cases at any time without incurring additional costs.In order to determine to what extent this process can be automated with machine learning, a range of algorithms were applied to real building documentation, analyzed, tested for reliability and optimized to building-specific data. The analysis demonstrated that not all digitalized documents are directly suited to automated classification; the paper therefore illustrates the associated problems, presenting detailed recommendations for how to facilitate automated classification and migration using machine learning. In this way, major errors can be avoided from the very beginning of the digitalization process.

Suggested Citation

  • Mario Bodenbender & Björn-Martin Kurzrock, 2018. "Towards the broad application of machine learning for document classification and data migration in real estate," ERES eres2018_165, European Real Estate Society (ERES).
  • Handle: RePEc:arz:wpaper:eres2018_165
    as

    Download full text from publisher

    File URL: https://eres.architexturez.net/doc/oai-eres-id-eres2018-165
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    Artificial Intelligence; data room; Digitization; document classification; real estate data;
    All these keywords.

    JEL classification:

    • R3 - Urban, Rural, Regional, Real Estate, and Transportation Economics - - Real Estate Markets, Spatial Production Analysis, and Firm Location

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arz:wpaper:eres2018_165. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Architexturez Imprints (email available below). General contact details of provider: https://edirc.repec.org/data/eressea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.