IDEAS home Printed from https://ideas.repec.org/a/inm/orijoc/v37y2025i4p874-893.html

Clustering Then Estimation of Spatio-Temporal Self-Exciting Processes

Author

Listed:
  • Haoting Zhang

    (Industrial Engineering and Operations Research Department, University of California Berkeley, Berkeley, California 94720)

  • Donglin Zhan

    (Department of Electrical Engineering, Columbia University, New York, New York 10027)

  • James Anderson

    (Department of Electrical Engineering, Columbia University, New York, New York 10027)

  • Rhonda Righter

    (Industrial Engineering and Operations Research Department, University of California Berkeley, Berkeley, California 94720)

  • Zeyu Zheng

    (Industrial Engineering and Operations Research Department, University of California Berkeley, Berkeley, California 94720)

Abstract

We propose a new estimation procedure for general spatio-temporal point processes that include a self-exciting feature. Estimating spatio-temporal self-exciting point processes with observed data is challenging, partly because of the difficulty in computing and optimizing the likelihood function. To circumvent this challenge, we employ a Poisson cluster representation for spatio-temporal self-exciting point processes to simplify the likelihood function and develop a new estimation procedure called “clustering-then-estimation” (CTE), which integrates clustering algorithms with likelihood-based estimation methods. Compared with the widely used expectation-maximization (EM) method, our approach separates the cluster structure inference of the data from the model selection. This has the benefit of reducing the risk of model misspecification. Our approach is computationally more efficient because it does not need to recursively solve optimization problems, which would be needed for EM. We also present asymptotic statistical results for our approach as theoretical support. Experimental results on several synthetic and real data sets illustrate the effectiveness of the proposed CTE procedure.

Suggested Citation

  • Haoting Zhang & Donglin Zhan & James Anderson & Rhonda Righter & Zeyu Zheng, 2025. "Clustering Then Estimation of Spatio-Temporal Self-Exciting Processes," INFORMS Journal on Computing, INFORMS, vol. 37(4), pages 874-893, July.
  • Handle: RePEc:inm:orijoc:v:37:y:2025:i:4:p:874-893
    DOI: 10.1287/ijoc.2022.0351
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/ijoc.2022.0351
    Download Restriction: no

    File URL: https://libkey.io/10.1287/ijoc.2022.0351?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Vladimir Filimonov & Didier Sornette, 2012. "Quantifying reflexivity in financial markets: towards a prediction of flash crashes," Papers 1201.3572, arXiv.org, revised Apr 2012.
    2. Onur Seref & Ya-Ju Fan & Wanpracha Art Chaovalitwongse, 2014. "Mathematical Programming Formulations and Algorithms for Discrete k-Median Clustering of Time-Series Data," INFORMS Journal on Computing, INFORMS, vol. 26(1), pages 160-172, February.
    3. Shao-Bo Lin & Shaojie Tang & Yao Wang & Di Wang, 2022. "Toward Efficient Ensemble Learning with Structure Constraints: Convergent Algorithms and Applications," INFORMS Journal on Computing, INFORMS, vol. 34(6), pages 3096-3116, November.
    4. Martin Bichler & Alexander Hammerl & Thayer Morrill & Stefan Waldherr, 2021. "How to Assign Scarce Resources Without Money: Designing Information Systems that are Efficient, Truthful, and (Pretty) Fair," Information Systems Research, INFORMS, vol. 32(2), pages 335-355, June.
    5. Tingting Cui & Yanfeng Ouyang & Zuo-Jun Max Shen, 2010. "Reliable Facility Location Design Under the Risk of Disruptions," Operations Research, INFORMS, vol. 58(4-part-1), pages 998-1011, August.
    6. Kuhl, Michael E. & Wilson, James R., 2001. "Modeling and simulating Poisson processes having trends or nontrigonometric cyclic effects," European Journal of Operational Research, Elsevier, vol. 133(3), pages 566-582, September.
    7. Cui, Tingting & Ouyang, Yanfeng & Shen, Zuo-Jun Max J, 2010. "Reliable Facility Location Design under the Risk of Disruptions," University of California Transportation Center, Working Papers qt5sh2c7pw, University of California Transportation Center.
    8. Zhiling Guo & Jin Li & Ram Ramesh, 2019. "Optimal Management of Virtual Infrastructures Under Flexible Cloud Service Agreements," Information Systems Research, INFORMS, vol. 30(4), pages 1424-1446, April.
    9. Baris Ungun & Lei Xing & Stephen Boyd, 2019. "Real-Time Radiation Treatment Planning with Optimality Guarantees via Cluster and Bound Methods," INFORMS Journal on Computing, INFORMS, vol. 31(3), pages 544-558, July.
    10. Shutong Chen & Weijun Xie, 2022. "On Cluster-Aware Supervised Learning: Frameworks, Convergent Algorithms, and Applications," INFORMS Journal on Computing, INFORMS, vol. 34(1), pages 481-502, January.
    11. Shalinda Adikari & Kaushik Dutta, 2019. "A New Approach to Real-Time Bidding in Online Advertisements: Auto Pricing Strategy," INFORMS Journal on Computing, INFORMS, vol. 31(1), pages 66-82, February.
    12. Yosihiko Ogata, 1998. "Space-Time Point-Process Models for Earthquake Occurrences," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 50(2), pages 379-402, June.
    13. Vladimir Filimonov & Didier Sornette, 2012. "Quantifying Reflexivity in Financial Markets: Towards a Prediction of Flash Crashes," Swiss Finance Institute Research Paper Series 12-02, Swiss Finance Institute.
    14. Zhengyi Zhou & David S. Matteson & Dawn B. Woodard & Shane G. Henderson & Athanasios C. Micheas, 2015. "A Spatio-Temporal Point Process Model for Ambulance Demand," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(509), pages 6-15, March.
    15. Veen, Alejandro & Schoenberg, Frederic P., 2008. "Estimation of SpaceTime Branching Process Models in Seismology Using an EMType Algorithm," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 614-624, June.
    16. Songhao Wang & Szu Hui Ng & William Benjamin Haskell, 2022. "A Multilevel Simulation Optimization Approach for Quantile Functions," INFORMS Journal on Computing, INFORMS, vol. 34(1), pages 569-585, January.
    17. Zhiling Guo & Jin Li & Ram Ramesh, 2020. "Scalable, Adaptable, and Fast Estimation of Transient Downtime in Virtual Infrastructures Using Convex Decomposition and Sample Path Randomization," INFORMS Journal on Computing, INFORMS, vol. 32(2), pages 321-345, April.
    18. Nan Chen & Yanchu Liu, 2014. "American Option Sensitivities Estimation via a Generalized Infinitesimal Perturbation Analysis Approach," Operations Research, INFORMS, vol. 62(3), pages 616-632, June.
    19. Mohler, G. O. & Short, M. B. & Brantingham, P. J. & Schoenberg, F. P. & Tita, G. E., 2011. "Self-Exciting Point Process Modeling of Crime," Journal of the American Statistical Association, American Statistical Association, vol. 106(493), pages 100-108.
    20. Qingxin Meng & Keli Xiao & Dazhong Shen & Hengshu Zhu & Hui Xiong, 2022. "Fine-Grained Job Salary Benchmarking with a Nonparametric Dirichlet Process–Based Latent Factor Model," INFORMS Journal on Computing, INFORMS, vol. 34(5), pages 2443-2463, September.
    21. Daniel Manrique‐Vallier & Jingchen Hu, 2018. "Bayesian non‐parametric generation of fully synthetic multivariate categorical data in the presence of structural zeros," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 181(3), pages 635-647, June.
    22. Ningyuan Chen & Ragıp Gürlek & Donald K. K. Lee & Haipeng Shen, 2024. "Can Customer Arrival Rates Be Modelled by Sine Waves?," Service Science, INFORMS, vol. 16(2), pages 70-84, June.
    23. repec:inm:orstsy:v:11:y:2021:i:13:p:264-283 is not listed on IDEAS
    24. Xuefeng Gao & Lingjiong Zhu, 2018. "Functional central limit theorems for stationary Hawkes processes and application to infinite-server queues," Queueing Systems: Theory and Applications, Springer, vol. 90(1), pages 161-206, October.
    25. Lawrence Brown & Noah Gans & Avishai Mandelbaum & Anat Sakov & Haipeng Shen & Sergey Zeltyn & Linda Zhao, 2005. "Statistical Analysis of a Telephone Call Center: A Queueing-Science Perspective," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 36-50, March.
    26. Mohler, George, 2014. "Marked point process hotspot maps for homicide and gun crime prediction in Chicago," International Journal of Forecasting, Elsevier, vol. 30(3), pages 491-497.
    27. Ramesh Bollapragada & Yanjun Li & Uday S. Rao, 2006. "Budget-Constrained, Capacitated Hub Location to Maximize Expected Demand Coverage in Fixed-Wireless Telecommunication Networks," INFORMS Journal on Computing, INFORMS, vol. 18(4), pages 422-432, November.
    28. Earvin Balderama & Frederic Paik Schoenberg & Erin Murray & Philip W. Rundel, 2012. "Application of Branching Models in the Study of Invasive Species," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(498), pages 467-476, June.
    29. Pierre Brice & Wei Jiang & Guohua Wan, 2011. "A Cluster-Based Context-Tree Model for Multivariate Data Streams with Applications to Anomaly Detection," INFORMS Journal on Computing, INFORMS, vol. 23(3), pages 364-376, August.
    30. Shaokun Fan & Xin Li & J. Leon Zhao & Shaokun Fan & Xin Li & J. Leon Zhao, 2017. "Collaboration Process Pattern Approach to Improving Teamwork Performance: A Data Mining-Based Methodology," INFORMS Journal on Computing, INFORMS, vol. 29(3), pages 438-456, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Baichuan Yuan & Frederic P. Schoenberg & Andrea L. Bertozzi, 2021. "Fast estimation of multivariate spatiotemporal Hawkes processes and network reconstruction," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 73(6), pages 1127-1152, December.
    2. P. Jeffrey Brantingham & Baichuan Yuan & Denise Herz, 2021. "Is Gang Violent Crime More Contagious than Non-Gang Violent Crime?," Journal of Quantitative Criminology, Springer, vol. 37(4), pages 953-977, December.
    3. Gresnigt, Francine & Kole, Erik & Franses, Philip Hans, 2015. "Interpreting financial market crashes as earthquakes: A new Early Warning System for medium term crashes," Journal of Banking & Finance, Elsevier, vol. 56(C), pages 123-139.
    4. Francesco Serafini & Finn Lindgren & Mark Naylor, 2023. "Approximation of Bayesian Hawkes process with inlabru," Environmetrics, John Wiley & Sons, Ltd., vol. 34(5), August.
    5. Anatoliy Swishchuk & Aiden Huffman, 2020. "General Compound Hawkes Processes in Limit Order Books," Risks, MDPI, vol. 8(1), pages 1-25, March.
    6. Chenlong Li & Zhanjie Song & Wenjun Wang, 2020. "Space–time inhomogeneous background intensity estimators for semi-parametric space–time self-exciting point process models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 72(4), pages 945-967, August.
    7. Eric W. Fox & Martin B. Short & Frederic P. Schoenberg & Kathryn D. Coronges & Andrea L. Bertozzi, 2016. "Modeling E-mail Networks and Inferring Leadership Using Self-Exciting Point Processes," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(514), pages 564-584, April.
    8. Maillart, Thomas & Sornette, Didier, 2019. "Aristotle vs. Ringelmann: On superlinear production in open source software," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 523(C), pages 964-972.
    9. Sobin Joseph & Shashi Jain, 2023. "A neural network based model for multi-dimensional nonlinear Hawkes processes," Papers 2303.03073, arXiv.org.
    10. Didier Sornette & Thomas Maillart & Giacomo Ghezzi, 2014. "How Much Is the Whole Really More than the Sum of Its Parts? 1 ⊞ 1 = 2.5: Superlinear Productivity in Collective Group Actions," PLOS ONE, Public Library of Science, vol. 9(8), pages 1-15, August.
    11. Frederic Paik Schoenberg & Marc Hoffmann & Ryan J. Harrigan, 2019. "A recursive point process model for infectious diseases," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 71(5), pages 1271-1287, October.
    12. Stindl, Tom & Bi, Zelong & Grazian, Clara, 2025. "Bayesian forecasting of Italian seismicity using the spatiotemporal RETAS model," Computational Statistics & Data Analysis, Elsevier, vol. 212(C).
    13. Tomlinson, Matthew F. & Greenwood, David & Mucha-Kruczyński, Marcin, 2024. "2T-POT Hawkes model for left- and right-tail conditional quantile forecasts of financial log returns: Out-of-sample comparison of conditional EVT models," International Journal of Forecasting, Elsevier, vol. 40(1), pages 324-347.
    14. Jakob Gulddahl Rasmussen, 2013. "Bayesian Inference for Hawkes Processes," Methodology and Computing in Applied Probability, Springer, vol. 15(3), pages 623-642, September.
    15. Kim, Gunhee & Choe, Geon Ho, 2019. "Limit properties of continuous self-exciting processes," Statistics & Probability Letters, Elsevier, vol. 155(C), pages 1-1.
    16. Mohler, George & Carter, Jeremy & Raje, Rajeev, 2018. "Improving social harm indices with a modulated Hawkes process," International Journal of Forecasting, Elsevier, vol. 34(3), pages 431-439.
    17. Alex Reinhart & Joel Greenhouse, 2018. "Self‐exciting point processes with spatial covariates: modelling the dynamics of crime," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 67(5), pages 1305-1329, November.
    18. Weijun Xie & Yanfeng Ouyang & Sze Chun Wong, 2016. "Reliable Location-Routing Design Under Probabilistic Facility Disruptions," Transportation Science, INFORMS, vol. 50(3), pages 1128-1138, August.
    19. Zhuohan Wang & Carmine Ventre, 2025. "DiffVolume: Diffusion Models for Volume Generation in Limit Order Books," Papers 2508.08698, arXiv.org.
    20. An, Yu & Zhang, Yu & Zeng, Bo, 2015. "The reliable hub-and-spoke design problem: Models and algorithms," Transportation Research Part B: Methodological, Elsevier, vol. 77(C), pages 103-122.

    More about this item

    Keywords

    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:orijoc:v:37:y:2025:i:4:p:874-893. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.