IDEAS home Printed from https://ideas.repec.org/a/gam/jecnmx/v11y2023i3p22-d1229353.html

Detecting Pump-and-Dumps with Crypto-Assets: Dealing with Imbalanced Datasets and Insiders’ Anticipated Purchases

Author

Listed:
  • Dean Fantazzini

    (Moscow School of Economics, Moscow State University, 119992 Moscow, Russia
    Faculty of Economic Sciences, Higher School of Economics, 109028 Moscow, Russia
    These authors contributed equally to this work.)

  • Yufeng Xiao

    (Moscow School of Economics, Moscow State University, 119992 Moscow, Russia
    These authors contributed equally to this work.)

Abstract

Detecting pump-and-dump schemes involving cryptoassets with high-frequency data is challenging due to imbalanced datasets and the early occurrence of unusual trading volumes. To address these issues, we propose constructing synthetic balanced datasets using resampling methods and flagging a pump-and-dump from the moment of public announcement up to 60 min beforehand. We validated our proposals using data from Pumpolymp and the CryptoCurrency eXchange Trading Library to identify 351 pump signals relative to the Binance crypto exchange in 2021 and 2022. We found that the most effective approach was using the original imbalanced dataset with pump-and-dumps flagged 60 min in advance, together with a random forest model with data segmented into 30-s chunks and regressors computed with a moving window of 1 h. Our analysis revealed that a better balance between sensitivity and specificity could be achieved by simply selecting an appropriate probability threshold, such as setting the threshold close to the observed prevalence in the original dataset. Resampling methods were useful in some cases, but threshold-independent measures were not affected. Moreover, detecting pump-and-dumps in real-time involves high-dimensional data, and the use of resampling methods to build synthetic datasets can be time-consuming, making them less practical.

Suggested Citation

  • Dean Fantazzini & Yufeng Xiao, 2023. "Detecting Pump-and-Dumps with Crypto-Assets: Dealing with Imbalanced Datasets and Insiders’ Anticipated Purchases," Econometrics, MDPI, vol. 11(3), pages 1-73, August.
  • Handle: RePEc:gam:jecnmx:v:11:y:2023:i:3:p:22-:d:1229353
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2225-1146/11/3/22/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2225-1146/11/3/22/
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. King, Gary & Zeng, Langche, 2001. "Logistic Regression in Rare Events Data," Political Analysis, Cambridge University Press, vol. 9(2), pages 137-163, January.
    2. López-Ratón, Mónica & Rodríguez-Álvarez, María Xosé & Cadarso-Suárez, Carmen & Gude-Sampedro, Francisco, 2014. "OptimalCutpoints: An R Package for Selecting Optimal Cutpoints in Diagnostic Tests," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 61(i08).
    3. Withanawasam, R.M. & Whigham, P.A. & Crack, T.F., 2013. "Characterising trader manipulation in a limit-order driven market," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 93(C), pages 43-52.
    4. Freeman, Elizabeth A. & Moisen, Gretchen G., 2008. "A comparison of the performance of threshold criteria for binary classification in terms of predicted prevalence and kappa," Ecological Modelling, Elsevier, vol. 217(1), pages 48-58.
    5. Jiahua Xu & Benjamin Livshits, 2018. "The Anatomy of a Cryptocurrency Pump-and-Dump Scheme," Papers 1811.10109, arXiv.org, revised Aug 2019.
    6. Ouyang, Liangyi & Cao, Bolong, 2020. "Selective pump-and-dump: The manipulation of their top holdings by Chinese mutual funds around quarter-ends," Emerging Markets Review, Elsevier, vol. 44(C).
    7. Gandal, Neil & Hamrick, JT & Rouhi, Farhang & Mukherjee, Arghya & Feder, Amir & Moore, Tyler & Vasek, Marie, 2018. "The Economics of Cryptocurrency Pump and Dump Schemes," CEPR Discussion Papers 13404, Centre for Economic Policy Research.
    8. Massimo La Morgia & Alessandro Mei & Francesco Sassi & Julinda Stefa, 2020. "Pump and Dumps in the Bitcoin Era: Real Time Detection of Cryptocurrency Market Manipulations," Papers 2005.06610, arXiv.org, revised Sep 2024.
    9. Rosa A. Schiavo & David J. Hand, 2000. "Ten More Years of Error Rate Research," International Statistical Review, International Statistical Institute, vol. 68(3), pages 295-310, December.
    10. Anirudh Dhawan & Tālis J Putniņš, 2023. "A New Wolf in Town? Pump-and-Dump Manipulation in Cryptocurrency Markets," Review of Finance, European Finance Association, vol. 27(3), pages 935-975.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sihao Hu & Zhen Zhang & Shengliang Lu & Bingsheng He & Zhao Li, 2022. "Sequence-Based Target Coin Prediction for Cryptocurrency Pump-and-Dump," Papers 2204.12929, arXiv.org, revised Apr 2023.
    2. Mohammad Javad Rajaei & Qusay H. Mahmoud, 2023. "A Survey on Pump and Dump Detection in the Cryptocurrency Market Using Machine Learning," Future Internet, MDPI, vol. 15(8), pages 1-17, August.
    3. Dun Li & Dezhi Han & Zibin Zheng & Tien-Hsiung Weng & Kuan-Ching Li & Ming Li & Shaokang Cai, 2024. "Does Short-and-Distort Scheme Really Exist? A Bitcoin Futures Audit Scheme through BIRCH & BPNN Approach," Computational Economics, Springer;Society for Computational Economics, vol. 63(4), pages 1649-1671, April.
    4. Kaihua Qin & Liyi Zhou & Yaroslav Afonin & Ludovico Lazzaretti & Arthur Gervais, 2021. "CeFi vs. DeFi -- Comparing Centralized to Decentralized Finance," Papers 2106.08157, arXiv.org, revised Jun 2021.
    5. Hornuf, Lars & Momtaz, Paul P. & Nam, Rachel J. & Yuan, Ye, 2025. "Cybercrime on the ethereum blockchain," Journal of Banking & Finance, Elsevier, vol. 175(C).
    6. Taro Tsuchiya, 2021. "Profitability of cryptocurrency Pump and Dump schemes," Digital Finance, Springer, vol. 3(2), pages 149-167, June.
    7. Ardia, David & Bluteau, Keven, 2024. "Twitter and cryptocurrency pump-and-dumps," International Review of Financial Analysis, Elsevier, vol. 95(PB).
    8. David Ardia & Keven Bluteau, 2023. "The Role of Twitter in Cryptocurrency Pump-and-Dumps," Papers 2306.02148, arXiv.org.
    9. Xi Yang, 2016. "Predicting bank failures: The leverage versus the risk-weighted capital ratio," Working Papers hal-04141595, HAL.
    10. Kolbe, Maura & Mansouri, Sasan & Momtaz, Paul P., 2022. "Why do video pitches matter in crowdfunding?," Journal of Economics and Business, Elsevier, vol. 122(C).
    11. Lars Hornuf & Paul P. Momtaz & Rachel J. Nam & Ye Yuan, 2023. "Cybercrime on the Ethereum Blockchain," CESifo Working Paper Series 10598, CESifo.
    12. J.J. Prescott & Kathryn E. Spier & Albert Yoon, 2014. "Trial and Settlement: A Study of High-Low Agreements," NBER Working Papers 19873, National Bureau of Economic Research, Inc.
    13. Christian Hopp & Christian Lukas, 2014. "A Signaling Perspective on Partner Selection in Venture Capital Syndicates," Entrepreneurship Theory and Practice, , vol. 38(3), pages 635-670, May.
    14. Angel M. Morales & Patrick Tarwater & Indika Mallawaarachchi & Alok Kumar Dwivedi & Juan B. Figueroa-Casas, 2015. "Multinomial logistic regression approach for the evaluation of binary diagnostic test in medical research," Statistics in Transition new series, Główny Urząd Statystyczny (Polska), vol. 16(2), pages 203-222, June.
    15. F. Gauthier & D. Germain & B. Hétu, 2017. "Logistic models as a forecasting tool for snow avalanches in a cold maritime climate: northern Gaspésie, Québec, Canada," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 89(1), pages 201-232, October.
    16. Douglas Cumming & Lars Hornuf & Moein Karami & Denis Schweizer, 2023. "Disentangling Crowdfunding from Fraudfunding," Journal of Business Ethics, Springer, vol. 182(4), pages 1103-1128, February.
    17. Wang, Xize & Liu, Tao, 2022. "Home-made blues: Residential crowding and mental health in Beijing, China," SocArXiv cy8hz, Center for Open Science.
    18. Kabo, Felichism W. & Cotton-Nessler, Natalie & Hwang, Yongha & Levenstein, Margaret C. & Owen-Smith, Jason, 2014. "Proximity effects on the dynamics and outcomes of scientific collaborations," Research Policy, Elsevier, vol. 43(9), pages 1469-1485.
    19. Fryges, Helmut, 2005. "The Change of Sales Modes in International Markets: Empirical Results for German and British High-Tech Firms," ZEW Discussion Papers 05-82, ZEW - Leibniz Centre for European Economic Research.
    20. James Bailey, 2017. "Health insurance and the supply of entrepreneurs: new evidence from the affordable care act," Small Business Economics, Springer, vol. 49(3), pages 627-646, October.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    JEL classification:

    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C25 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Discrete Regression and Qualitative Choice Models; Discrete Regressors; Proportions; Probabilities
    • C35 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Discrete Regression and Qualitative Choice Models; Discrete Regressors; Proportions
    • C38 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Classification Methdos; Cluster Analysis; Principal Components; Factor Analysis
    • C51 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Construction and Estimation
    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods
    • C58 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Financial Econometrics
    • G17 - Financial Economics - - General Financial Markets - - - Financial Forecasting and Simulation
    • G32 - Financial Economics - - Corporate Finance and Governance - - - Financing Policy; Financial Risk and Risk Management; Capital and Ownership Structure; Value of Firms; Goodwill
    • K42 - Law and Economics - - Legal Procedure, the Legal System, and Illegal Behavior - - - Illegal Behavior and the Enforcement of Law

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jecnmx:v:11:y:2023:i:3:p:22-:d:1229353. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager The email address of this maintainer does not seem to be valid anymore. Please ask MDPI Indexing Manager to update the entry or send us the correct address (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.