Automatic recognition and classification of future work sentences from academic articles in a specific domain

My bibliography Save this article

Automatic recognition and classification of future work sentences from academic articles in a specific domain

Author

Listed:

Zhang, Chengzhi
Xiang, Yi
Hao, Wenke
Li, Zhicheng
Qian, Yuchen
Wang, Yuzhuo

Registered:

Abstract

Future work sentences (FWS) are the particular sentences in academic papers that contain the author's description of their proposed follow-up research direction. This paper presents methods to automatically extract FWS from academic papers and classify them according to the different future directions embodied in the paper's content. FWS recognition methods will enable subsequent researchers to locate future work sentences more accurately and quickly and reduce the time and cost of acquiring the corpus. At the same time, changes in the content of future work will be illuminated, and a foundation will be laid for a more in-depth semantic analysis of future work sentences. The current work on automatic identification of future work sentences is relatively small, and the existing research cannot accurately identify FWS from academic papers, and thus cannot conduct data mining on a large scale. Furthermore, there are many aspects to the content of future work, and the subdivision of the content is conducive to the analysis of specific development directions. In this paper, Nature Language Processing (NLP) is used as a case study, and FWS are extracted from academic papers and classified into different types. We manually build an annotated corpus with six different types of FWS. Then, automatic recognition and classification of FWS are implemented using machine learning models, and the performance of these models is compared based on the evaluation metrics. The results show that the Bernoulli Bayesian model has the best performance in the automatic recognition task, with the Macro F1 reaching 90.73%, and the SCIBERT model has the best performance in the automatic classification task, with the weighted average F1 reaching 72.63%. Finally, we extract keywords from FWS and gain a deep understanding of the key content described in FWS, and we also demonstrate that content determination in FWS will be reflected in the subsequent research work by measuring the similarity between future work sentences and the abstracts.

Suggested Citation

Zhang, Chengzhi & Xiang, Yi & Hao, Wenke & Li, Zhicheng & Qian, Yuchen & Wang, Yuzhuo, 2023. "Automatic recognition and classification of future work sentences from academic articles in a specific domain," Journal of Informetrics, Elsevier, vol. 17(1).

Handle: RePEc:eee:infome:v:17:y:2023:i:1:s1751157722001262
DOI: 10.1016/j.joi.2022.101373

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Li, Munan & Wang, Liang, 2025. "Leveraging patent classification based on deep learning: The case study on smart cities and industrial Internet of Things," Journal of Informetrics, Elsevier, vol. 19(1).
Biao Zhang & Yunwei Chen, 2024. "Automated recognition of innovative sentences in academic articles: semi-automatic annotation for cost reduction and SAO reconstruction for enhanced data," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(9), pages 5403-5432, September.
Yu, Dejian & Xiang, Bo, 2024. "An ESTs detection research based on paper entity mapping: Combining scientific text modeling and neural prophet," Journal of Informetrics, Elsevier, vol. 18(4).

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:infome:v:17:y:2023:i:1:s1751157722001262. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

We have no bibliographic references for this item. You can help adding them by using this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/joi .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Automatic recognition and classification of future work sentences from academic articles in a specific domain

Author

Abstract

Suggested Citation

Download full text from publisher

Citations

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data