IDEAS home Printed from https://ideas.repec.org/a/ack/journl/y2021id657.html
   My bibliography  Save this article

Modern Methods of Extracting Key Information From Regulatory Documents

Author

Listed:
  • Maria A. Milkova
  • Ivan V. Nevolin
  • Dmitriy P. Pigorev

Abstract

This article is an attempt to comprehend the difficulties and propose approaches to eliminate them when analyzing legal documents in the framework of economic and interdisciplinary research. The utmost goal is to seek incorporating advances in computational linguistics and natural language analysis into the discourse of the digital economy in order to develop methods involved in decision-making and strategy development based on the analysis of textual information. In conditions when the amount of information is too large, is constantly updated and / or the area of study is new, the most expedient at the first stage is to obtain the general structure of the entire collection of documents, some kind of semantic compression of information. The practical part contains the development of an approach for the analysis of regulations governing food and nutrition issues, in particular, related to the prevention of the development of iron deficiency anemia (IDA). The approach includes the extraction of key information of voluminous texts (keywords and key sentences) based on the TextRank graph algorithm. An important link contributing to cognition is also the visualization of semantic relationships between words within documents. In our opinion, it is the combination of semantic compression and visualization of information as a “close-up†of text documents, as well as the possibility of further detailing by linear reading and analysis, which are the most relevant approach in conditions of information overload and attention deficit. The active introduction of text analytics methods for systems that are not involved in attention markets, which lag significantly behind in the convenience of extracting meaningful information, is especially important. Approaches to improve the understanding of large volumes of regulations will be of significant value to researchers in economic, legal or multidisciplinary research.

Suggested Citation

  • Maria A. Milkova & Ivan V. Nevolin & Dmitriy P. Pigorev, 2021. "Modern Methods of Extracting Key Information From Regulatory Documents," Economics of Contemporary Russia, Regional Public Organization for Assistance to the Development of Institutions of the Department of Economics of the Russian Academy of Sciences, issue 2.
  • Handle: RePEc:ack:journl:y:2021:id:657
    DOI: 10.33293/1609-1442-2021-2(93)-101-114
    as

    Download full text from publisher

    File URL: https://www.ecr-journal.ru/jour/article/viewFile/657/421
    Download Restriction: no

    File URL: https://libkey.io/10.33293/1609-1442-2021-2(93)-101-114?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ack:journl:y:2021:id:657. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Ð ÐµÐ´Ð°ÐºÑ†Ð¸Ñ (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.