IDEAS home Printed from https://ideas.repec.org/a/spr/aodasc/v11y2024i1d10.1007_s40745-022-00424-6.html
   My bibliography  Save this article

Machine Learning Algorithms for Crime Prediction under Indian Penal Code

Author

Listed:
  • Rabia Musheer Aziz

    (VIT Bhopal University)

  • Prajwal Sharma

    (VIT Bhopal University)

  • Aftab Hussain

    (VIT Bhopal University)

Abstract

In this paper, the authors propose a data-driven approach to draw insightful knowledge from the Indian crime data. The proposed approach can be helpful for police and other law enforcement bodies in India for controlling and preventing crime region-wise. In the proposed approach different regression models are built based on different regression algorithms, viz., random forest regression (RFR), decision tree regression (DTR), multiple linear regression (MLR), simple linear regression (SLR), and support vector regression (SVR) after pre-processing the data using MySQL Workbench and R programming. These regression models can predict 28 different types of IPC cognizable crime counts and also a total number of Indian Penal Code (IPC) cognizable crime counts region-wise, state-wise, and year-wise (for all over the country) provided the desired inputs to the model. Data visualization techniques, namely, chord diagrams and map plots, are used to visualize pre-processed data (corresponding to the years 2014 to 2020) and predicted data by the relatively best regression model for the year 2022. For the chosen data, it is concluded that Random Forest Regression (RFR), which predicts total IPC cognizable crime, fits relatively the best, with a 0.96 adjusted r squared value and a MAPE value of 0.2, and among regression models predicting region-wise theft crime count, the random forest regression-based model relatively fits the best, with an adjusted R squared value of 0.96 and a MAPE value of 0.166. These regression models predict that Andhra Pradesh state will have the highest crime counts, with Adilabad district at the top, having 31,933 predicted crime counts.

Suggested Citation

  • Rabia Musheer Aziz & Prajwal Sharma & Aftab Hussain, 2024. "Machine Learning Algorithms for Crime Prediction under Indian Penal Code," Annals of Data Science, Springer, vol. 11(1), pages 379-410, February.
  • Handle: RePEc:spr:aodasc:v:11:y:2024:i:1:d:10.1007_s40745-022-00424-6
    DOI: 10.1007/s40745-022-00424-6
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s40745-022-00424-6
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s40745-022-00424-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Kassem, Mohamad & Ali, Amjad & Audi, Marc, 2019. "Unemployment Rate, Population Density and Crime Rate in Punjab (Pakistan): An Empirical Analysis," MPRA Paper 95964, University Library of Munich, Germany.
    2. Mohamad Kassem & Amjad Ali & Marc Audi, 2019. "Unemployment Rate, Population Density and Crime Rate in Punjab (Pakistan): An Empirical Analysis," Bulletin of Business and Economics (BBE), Research Foundation for Humanity (RFH), vol. 8(2), pages 92-104, June.
    3. Rabia Aziz & C. K. Verma & Namita Srivastava, 2018. "Artificial Neural Network Classification of High Dimensional Data with Novel Optimization Approach of Dimension Reduction," Annals of Data Science, Springer, vol. 5(4), pages 615-635, December.
    4. Javad Hosseinkhani & Hamed Taherdoost & Solmaz Keikhaee, 2021. "ANTON Framework Based on Semantic Focused Crawler to Support Web Crime Mining Using SVM," Annals of Data Science, Springer, vol. 8(2), pages 227-240, June.
    5. James M. Tien, 2017. "Internet of Things, Real-Time Decision Making, and Artificial Intelligence," Annals of Data Science, Springer, vol. 4(2), pages 149-178, June.
    6. Vojo Lakovic, 2020. "Modeling of Entrepreneurship Activity Crisis Management by Support Vector Machine," Annals of Data Science, Springer, vol. 7(4), pages 629-638, December.
    7. Mamta Mittal & Lalit Mohan Goyal & Jasleen Kaur Sethi & D. Jude Hemanth, 2019. "Monitoring the Impact of Economic Crisis on Crime in India Using Machine Learning," Computational Economics, Springer;Society for Computational Economics, vol. 53(4), pages 1467-1485, April.
    8. Suellen Teixeira Zavadzki de Pauli & Mariana Kleina & Wagner Hugo Bonat, 2020. "Comparing Artificial Neural Network Architectures for Brazilian Stock Market Prediction," Annals of Data Science, Springer, vol. 7(4), pages 613-628, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Amjad Ali & Marc Audi & Chan Bibi & Yannick Roussel, 2021. "The Impact of Gender Inequality and Environmental Degradation on Human Well-being in the Case of Pakistan: A Time Series Analysis," International Journal of Economics and Financial Issues, Econjournals, vol. 11(2), pages 92-99.
    2. Aftab Ahmad, 2020. "Poverty Terrorism Nexus: A Case Study Of Pakistan," Bulletin of Business and Economics (BBE), Research Foundation for Humanity (RFH), vol. 9(4), pages 162-172, December.
    3. Muhammad Shahid & Khalil Ahmad & Muhammad Amir Inayat & Muhammad Kashif Bhatti, 2024. "Socio-Economic Determinants of Property Crime Across the Districts of Punjab: Highlighting the Role of Law Enforcement Agencies of Pakistan," Bulletin of Business and Economics (BBE), Research Foundation for Humanity (RFH), vol. 13(2), pages 22-36.
    4. Muhammad Bilal Ahmad & Ghulam Mustafa & Dr. Muhammad Asif Shahzad, 2021. "A Comparative Study Of Public And Private Students’ Attitude Towards Learning English At Secondary School Level," Bulletin of Business and Economics (BBE), Research Foundation for Humanity (RFH), vol. 10(4), pages 101-106, December.
    5. Farooq Ahmad & Amna Gul & Syed Ali Raza Hamid & Zunaira Mahmood & Shahida Mariam, 2021. "Employees’ Own Personality May Induce Their Victimization At Work: Evidence From Universities In Lahore," Bulletin of Business and Economics (BBE), Research Foundation for Humanity (RFH), vol. 10(4), pages 13-21, December.
    6. Zerish Tasleem & Muhammad Hatim & Mahnoor Malik & Muhammad Nadeem & Muhammad Tariq Ramzan, 2022. "The Impact Of Health Facilities On Rural Poverty In Southern Punjab, Pakistan," Bulletin of Business and Economics (BBE), Research Foundation for Humanity (RFH), vol. 11(2), pages 104-109, June.
    7. Arif Khan & Gul Zeb Chaudhary, 2020. "Determinants Of Inflation In Case Of Pakistan," Bulletin of Business and Economics (BBE), Research Foundation for Humanity (RFH), vol. 9(4), pages 151-161, December.
    8. Roussel, Yannick & Ali, Amjad & Audi, Marc, 2021. "Measuring the Money Demand in Pakistan: A Time Series Analysis," MPRA Paper 106629, University Library of Munich, Germany.
    9. Arzoo Mushtaq & Shahnawaz Malik & Muhammad Hanif Akhtar, 2022. "Nonlinear Taylor Rule And Inflation-Targeting In Pakistan: A Time Series Analysis," Bulletin of Business and Economics (BBE), Research Foundation for Humanity (RFH), vol. 11(2), pages 185-197, June.
    10. Manoj Verma & Harish Kumar Ghritlahre & Surendra Bajpai, 2023. "A Case Study of Optimization of a Solar Power Plant Sizing and Placement in Madhya Pradesh, India Using Multi-Objective Genetic Algorithm," Annals of Data Science, Springer, vol. 10(4), pages 933-966, August.
    11. Muhammad Rahat Abbas & Barkat Ullah, 2023. "The Impact of Credit and Liquidity Risk on Bank Performance," Bulletin of Business and Economics (BBE), Research Foundation for Humanity (RFH), vol. 12(4), pages 205-218.
    12. repec:rfh:jprjor:v:6:y:2020:i:2:p:7-11 is not listed on IDEAS
    13. Huanyu Ma & Yan Xu & Yulong Liu, 2022. "Prediction of Listed Company Growth in Non-public Economy," Annals of Data Science, Springer, vol. 9(4), pages 847-861, August.
    14. SHAHID MANZOOR SHAH & Nooria Shams-U-Din, 2020. "Determinants Of Death Rates In Pakistan: An Empirical Analysis," Bulletin of Business and Economics (BBE), Research Foundation for Humanity (RFH), vol. 9(3), pages 141-150, September.
    15. Naveed Mushtaq & Muhammad Asim & Mohsin Raza Khan & Tanveer Illahi & Abdul Qayyum, 2021. "Impact Of Employee Displayed Emotion On Perceived Waiting Time Of Clients Among Islamic Banks Of Paksitan," Bulletin of Business and Economics (BBE), Research Foundation for Humanity (RFH), vol. 10(1), pages 99-113, March.
    16. Aftab Anwar & Mubashar Nadeem & Gulfam Nawaz & Ambreen Siddique, 2021. "Socio-Economic Crisis Of The Mothers Of Special Children During Covid-19: A Reflective Study," Bulletin of Business and Economics (BBE), Research Foundation for Humanity (RFH), vol. 10(4), pages 22-27, December.
    17. repec:rfh:jprjor:v:8:y:2022:i:3:p:107-112 is not listed on IDEAS
    18. Ismail Senturk & Amna Shafiq Minhas, 2020. "Researching Shadow Education: Methodological Challenges And Directions," Bulletin of Business and Economics (BBE), Research Foundation for Humanity (RFH), vol. 9(4), pages 173-182, December.
    19. Khalil Ahmad & Ismail Senturk, 2021. "Health Structure, Nutrition And Economic Growth In Pakistan: A Time Series Analysis," Bulletin of Business and Economics (BBE), Research Foundation for Humanity (RFH), vol. 10(1), pages 42-50, March.
    20. repec:rfh:jprjor:v:6:y:2020:i:2:p:23-29 is not listed on IDEAS
    21. Muhammad Ashraf & Arslan Ali Raza & Muhammad Ishaq, 2022. "A Novel Approach Of Social Media Analytics For Predicting National Consumer Confidence Index," Bulletin of Business and Economics (BBE), Research Foundation for Humanity (RFH), vol. 11(2), pages 220-234, June.
    22. Muhammad Hatim & Zerish Tasleem & Muhammad Nadeem, 2022. "The Influence Of Education And Health On Rural Household Poverty: A Moderating Role Of Culture In Punjab, Pakistan," Bulletin of Business and Economics (BBE), Research Foundation for Humanity (RFH), vol. 11(2), pages 120-133, June.
    23. Fiaz Ahmad Sulehri & Usman Ahmed & Wajid Alim, 2021. "Black Economy, Financial Inclusion, Financial Liberalization Nexus: A Panel Analysis Of Developing Countries," Bulletin of Business and Economics (BBE), Research Foundation for Humanity (RFH), vol. 10(3), pages 65-77.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:aodasc:v:11:y:2024:i:1:d:10.1007_s40745-022-00424-6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.