Report NEP-BIG-2025-08-18
This is the archive for NEP-BIG, a report on new working papers in the area of Big Data. Tom Coupé (Tom Coupe) issued this report. It is usually issued weekly.Subscribe to this report: email, RSS, or Mastodon, or Bluesky.
Other reports in NEP-BIG
The following items were announced in this report:
- Duane, Jackson & Ren, Alicia & Zhang, Wei, 2025, "Deep Learning Models for Financial Data Analysis: A Focused Review of Recent Advances," OSF Preprints, Center for Open Science, number ctxf9_v1, Jul, DOI: 10.31219/osf.io/ctxf9_v1.
- Baptiste Lefort & Eric Benhamou & Beatrice Guez & Jean-Jacques Ohana & Ethan Setrouk & Alban Etienne, 2025, "FinMarBa: A Market-Informed Dataset for Financial Sentiment Classification," Papers, arXiv.org, number 2507.22932, Jul.
- Md Talha Mohsin, 2025, "Evaluating Large Language Models (LLMs) in Financial NLP: A Comparative Study on Financial Report Analysis," Papers, arXiv.org, number 2507.22936, Jul, revised Jan 2026.
- Richiardi, Matteo & Rejoice, Frimpong, 2025, "Machine learning regionalisation of input data for microsimulation models: An application of a hybrid GBM / IPF method to build a tax-benefit model for the Essex region in the UK," Centre for Microsimulation and Policy Analysis Working Paper Series, Centre for Microsimulation and Policy Analysis at the Institute for Social and Economic Research, number CEMPA9/25, Aug.
- Tingyu Yuan & Xi Zhang & Xuanjing Chen, 2025, "Machine Learning based Enterprise Financial Audit Framework and High Risk Identification," Papers, arXiv.org, number 2507.06266, Jul.
- Siyi Wu & Junqiao Wang & Zhaoyang Guan & Leyi Zhao & Xinyuan Song & Xinyu Ying & Dexu Yu & Jinhao Wang & Hanlin Zhang & Michele Pak & Yangfan He & Yi Xin & Jianhui Wang & Tianyu Shi, 2025, "MountainLion: A Multi-Modal LLM-Based Agent System for Interpretable and Adaptive Financial Trading," Papers, arXiv.org, number 2507.20474, Jul, revised Sep 2025.
- Ozili, Peterson K & Obiora, Kingsley I & Onuzo, Chinwendu, 2025, "Financial inclusion and large language models," MPRA Paper, University Library of Munich, Germany, number 125562.
- Tomaso Duso & Joseph E., Jr. Harrington & Carl Kreuzberg & Geza Sapi, 2025, "Public Communication and Collusion: New Screening Tools for Competition Authorities," CESifo Working Paper Series, CESifo, number 12029.
- Schultze, Michelle, 2025, "In the Shadow of War: Assessing Conflict-Driven Disruptions in the Kyrgyzstan-Russia Labor Pipeline via a Gradient Boosting Approach to Nowcasting," SocArXiv, Center for Open Science, number z2wch_v1, Jul, DOI: 10.31219/osf.io/z2wch_v1.
- Jiayi Guo & Zhiyu Quan & Linfeng Zhang, 2025, "Entity-Specific Cyber Risk Assessment using InsurTech Empowered Risk Factors," Papers, arXiv.org, number 2507.08193, Jul, revised Mar 2026.
- Mori, Misato, 2025, "How AI Detects Financial Fraud: A Review of Emerging Deep Learning Methods," OSF Preprints, Center for Open Science, number 5yjm4_v1, Jul, DOI: 10.31219/osf.io/5yjm4_v1.
- Francis Boabang & Samuel Asante Gyamerah, 2025, "An Enhanced Focal Loss Function to Mitigate Class Imbalance in Auto Insurance Fraud Detection with Explainable AI," Papers, arXiv.org, number 2508.02283, Aug, revised Jan 2026.
- Golo Henseke & Rhys Davies & Alan Felstead & Duncan Gallie & Francis Green & Ying Zhou, 2025, "How Exposed Are UK Jobs to Generative AI? Developing and Applying a Novel Task-Based Index," Papers, arXiv.org, number 2507.22748, Jul, revised Apr 2026.
- Yingnan Yan & Tianming Liu & Yafeng Yin, 2025, "Valuing Time in Silicon: Can Large Language Models Replicate Human Value of Travel Time," Papers, arXiv.org, number 2507.22244, Jul, revised Dec 2025.
- Junjie Zhao & Chengxi Zhang & Chenkai Wang & Peng Yang, 2025, "Learning from Expert Factors: Trajectory-level Reward Shaping for Formulaic Alpha Mining," Papers, arXiv.org, number 2507.20263, Jul.
- Zeqi Wu & Meilin Wang & Wei Huang & Zheng Zhang, 2025, "A New and Efficient Debiased Estimation of General Treatment Models by Balanced Neural Networks Weighting," Papers, arXiv.org, number 2507.04044, Jul.
- Rachel Cho & Christoph Görtz & Danny McGowan & Max Schröder, 2025, "Defining Current and Expected Financial Constraints Using AI: Reinterpreting the Cash Flow Sensitivity of Cash," CESifo Working Paper Series, CESifo, number 12054.
- Zequn Jin & Gaoqian Xu & Xi Zheng & Yahong Zhou, 2025, "Policy Learning under Unobserved Confounding: A Robust and Efficient Approach," Papers, arXiv.org, number 2507.20550, Jul.
- Dhanashekar Kandaswamy & Ashutosh Sahoo & Akshay SP & Gurukiran S & Parag Paul & Girish G N, 2025, "Deep Reputation Scoring in DeFi: zScore-Based Wallet Ranking from Liquidity and Trading Signals," Papers, arXiv.org, number 2507.20494, Jul.
- Johannes Kruse, 2025, "The ordinary meaning bot: Simulating human surveys with LLMs," Discussion Paper Series of the Max Planck Institute for Behavioral Economics, Max Planck Institute for Behavioral Economics, number 2025_12, Aug.
- Seyed Mohammad Ali Jafari & Ali Mobini Dehkordi & Ehsan Chitsaz & Yadollah Yaghoobzadeh, 2025, "What Matters Most? A Quantitative Meta-Analysis of AI-Based Predictors for Startup Success," Papers, arXiv.org, number 2507.09675, Jul.
- Georges Sfeir & Gabriel Nova & Stephane Hess & Sander van Cranenburgh, 2025, "Can large language models assist choice modelling? Insights into prompting strategies and current models capabilities," Papers, arXiv.org, number 2507.21790, Jul, revised Mar 2026.
- Aaron Green & Zihan Nie & Hanzhen Qin & Oshani Seneviratne & Kristin P. Bennett, 2025, "FinSurvival: A Suite of Large Scale Survival Modeling Tasks from Finance," Papers, arXiv.org, number 2507.14160, Jul.
- Zehao Lin & Ying Liu & Congrong Pan & Lutz Sager, 2025, "Can Air Pollution Affect Our Sentiments: Social Media Evidence from Japan," CESifo Working Paper Series, CESifo, number 12030.
- Hoyoung Lee & Junhyuk Seo & Suhwan Park & Junhyeong Lee & Wonbin Ahn & Chanyeol Choi & Alejandro Lopez-Lira & Yongjae Lee, 2025, "Your AI, Not Your View: The Bias of LLMs in Investment Analysis," Papers, arXiv.org, number 2507.20957, Jul, revised Oct 2025.
- König, Leonard Maximilian, 2025, "Affective Signals and Issue Salience in Swiss Reddit Discourse: Insights on Public Reaction to Government Measures During COVID-19 and the Ukraine Crisis," SocArXiv, Center for Open Science, number 28exs_v2, Jul, DOI: 10.31219/osf.io/28exs_v2.
- Wei Lu & Amit Dhanda & Daniel L. Chen & Christian B. Hansen, 2025, "Aligning Large Language Model Agents with Rational and Moral Preferences: A Supervised Fine-Tuning Approach," Papers, arXiv.org, number 2507.20796, Jul, revised Mar 2026.
- Item repec:cam:camjip:2520 is not listed on IDEAS anymore
- Christoph Engel & Yoan Hermstrüwer & Alison Kim, 2025, "Human Realignment: An Empirical Study of LLMs as Legal Decision-Aids in Moral Dilemmas," Discussion Paper Series of the Max Planck Institute for Behavioral Economics, Max Planck Institute for Behavioral Economics, number 2025_03, Apr.
Printed from https://ideas.repec.org/n/nep-big/2025-08-18.html