IDEAS home Printed from https://ideas.repec.org/a/spr/elmark/v26y2016i2d10.1007_s12525-016-0217-2.html
   My bibliography  Save this article

The Ambiguous Identifier Clustering Technique

Author

Listed:
  • Michael Scholz

    (University of Passau)

  • Markus Franz

    (University of Passau)

  • Oliver Hinz

    (University of Passau)

Abstract

Investigations of online transaction data often face the problem that entries for identical products cannot be identified as such. There is, for example, typically no unique product identifier in online auctions; retailers make their offers at price comparison sites hardly comparable and online stores often use different identifiers for virtually equal products. Existing studies typically use data sets that are restricted to one or only a few products in order to avoid product heterogeneity if a unique product identifier is not available. We propose the Ambiguous Identifier Clustering Technique (AICT) that identifies online transaction data that refer to virtually the same product. Based on a data set of eBay auctions, we demonstrate that AICT clusters online transactions for identical products with high accuracy. We further show how researchers benefit from AICT and the reduced product heterogeneity when analyzing data with econometric models.

Suggested Citation

  • Michael Scholz & Markus Franz & Oliver Hinz, 2016. "The Ambiguous Identifier Clustering Technique," Electronic Markets, Springer;IIM University of St. Gallen, vol. 26(2), pages 143-156, May.
  • Handle: RePEc:spr:elmark:v:26:y:2016:i:2:d:10.1007_s12525-016-0217-2
    DOI: 10.1007/s12525-016-0217-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s12525-016-0217-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s12525-016-0217-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Debabrata Dey, 2003. "Record Matching in Data Warehouses: A Decision Model for Data Consolidation," Operations Research, INFORMS, vol. 51(2), pages 240-254, April.
    2. Lawrence Hubert & Phipps Arabie, 1985. "Comparing partitions," Journal of Classification, Springer;The Classification Society, vol. 2(1), pages 193-218, December.
    3. Mauricio Sadinle & Stephen E. Fienberg, 2013. "A Generalized Fellegi--Sunter Framework for Multiple Record Linkage With Application to Homicide Record Systems," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(502), pages 385-397, June.
    4. Ravi Bapna & Paulo Goes & Alok Gupta, 2003. "Replicating Online Yankee Auctions to Analyze Auctioneers' and Bidders' Strategies," Information Systems Research, INFORMS, vol. 14(3), pages 244-268, September.
    5. Michael Ostrovsky & Michael Schwarz, 2023. "Reserve Prices in Internet Advertising Auctions: A Field Experiment," Journal of Political Economy, University of Chicago Press, vol. 131(12), pages 3352-3376.
    6. Hinz, Oliver & Hill, Shawndra & Kim, Ju-Young, 2014. "TV's Dirty Little Secret: The Negative Effect of Popular TV on Online Auction Sales," Publications of Darmstadt Technical University, Institute for Business Studies (BWL) 69931, Darmstadt Technical University, Department of Business Administration, Economics and Law, Institute for Business Studies (BWL).
    7. Bajari, Patrick & Hortacsu, Ali, 2003. "The Winner's Curse, Reserve Prices, and Endogenous Entry: Empirical Insights from eBay Auctions," RAND Journal of Economics, The RAND Corporation, vol. 34(2), pages 329-355, Summer.
    8. David Lucking‐Reiley & Doug Bryan & Naghi Prasad & Daniel Reeves, 2007. "Pennies From Ebay: The Determinants Of Price In Online Auctions," Journal of Industrial Economics, Wiley Blackwell, vol. 55(2), pages 223-233, June.
    9. Wenyan Zhou & Oliver Hinz, 2016. "Determining profit-optimizing return policies – a two-step approach on data from taobao.com," Electronic Markets, Springer;IIM University of St. Gallen, vol. 26(2), pages 103-114, May.
    10. Ji-Liang Shiu & Chia-Hung D. Sun, 2014. "The determinants of price in online auctions: more evidence from unbalanced panel data," Journal of Applied Statistics, Taylor & Francis Journals, vol. 41(2), pages 382-392, February.
    11. Ravi Bapna & Wolfgang Jank & Galit Shmueli, 2008. "Consumer Surplus in Online Auctions," Information Systems Research, INFORMS, vol. 19(4), pages 400-416, December.
    12. Chris Forman & Anindya Ghose & Batia Wiesenfeld, 2008. "Examining the Relationship Between Reviews and Sales: The Role of Reviewer Identity Disclosure in Electronic Markets," Information Systems Research, INFORMS, vol. 19(3), pages 291-313, September.
    13. Liran Einav & Theresa Kuchler & Jonathan Levin & Neel Sundaresan, 2015. "Assessing Sale Strategies in Online Markets Using Matched Listings," American Economic Journal: Microeconomics, American Economic Association, vol. 7(2), pages 215-247, May.
    14. Larsen M. D & Rubin D. B, 2001. "Iterative Automated Record Linkage Using Mixture Models," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 32-41, March.
    15. Rong Zheng & Jiexun Li & Hsinchun Chen & Zan Huang, 2006. "A framework for authorship identification of online messages: Writing‐style features and classification techniques," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 57(3), pages 378-393, February.
    16. Daniel W. Elfenbein & Ray Fisman & Brian Mcmanus, 2012. "Charity as a Substitute for Reputation: Evidence from an Online Marketplace," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 79(4), pages 1441-1468.
    17. Amar Cheema & Dipankar Chakravarti & Atanu R. Sinha, 2012. "Bidding Behavior in Descending and Ascending Auctions," Marketing Science, INFORMS, vol. 31(5), pages 779-800, September.
    18. Mingfeng Lin & Henry C. Lucas & Galit Shmueli, 2013. "Research Commentary ---Too Big to Fail: Large Samples and the p -Value Problem," Information Systems Research, INFORMS, vol. 24(4), pages 906-917, December.
    19. Kannan Srinivasan & Xin Wang, 2010. "Commentary--Bidders' Experience and Learning in Online Auctions: Issues and Implications," Marketing Science, INFORMS, vol. 29(6), pages 988-993, 11-12.
    20. David Lucking-Reiley, 1999. "Using Field Experiments to Test Equivalence between Auction Formats: Magic on the Internet," American Economic Review, American Economic Association, vol. 89(5), pages 1063-1080, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Fatao Wang & Lihui Ding & Hongxin Yu & Yuanjun Zhao, 2020. "Big data analytics on enterprise credit risk evaluation of e-Business platform," Information Systems and e-Business Management, Springer, vol. 18(3), pages 311-350, September.
    2. Benjamin M. Abdel-Karim & Nicolas Pfeuffer & Oliver Hinz, 2021. "Machine learning in information systems - a bibliographic review and open research issues," Electronic Markets, Springer;IIM University of St. Gallen, vol. 31(3), pages 643-670, September.
    3. Rainer Alt & Carsta Militzer-Horstmann & Hans-Dieter Zimmermann, 2016. "Electronic Markets on the impact factor," Electronic Markets, Springer;IIM University of St. Gallen, vol. 26(2), pages 95-101, May.
    4. Fatao Wang & Lihui Ding & Hongxin Yu & Yuanjun Zhao, 0. "Big data analytics on enterprise credit risk evaluation of e-Business platform," Information Systems and e-Business Management, Springer, vol. 0, pages 1-40.
    5. Blasco-Arcas, Lorena & Lee, Hsin-Hsuan Meg & Kastanakis, Minas N. & Alcañiz, Mariano & Reyes-Menendez, Ana, 2022. "The role of consumer data in marketing: A research agenda," Journal of Business Research, Elsevier, vol. 146(C), pages 436-452.
    6. Erik Ernesto Vazquez, 2021. "Effect of an e-retailer’s product category and social media platform selection on perceived quality of e-retail products," Electronic Markets, Springer;IIM University of St. Gallen, vol. 31(1), pages 139-157, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Liran Einav & Theresa Kuchler & Jonathan Levin & Neel Sundaresan, 2015. "Assessing Sale Strategies in Online Markets Using Matched Listings," American Economic Journal: Microeconomics, American Economic Association, vol. 7(2), pages 215-247, May.
    2. Cong Feng & Scott Fay & K. Sivakumar, 2016. "Overbidding in electronic auctions: factors influencing the propensity to overbid and the magnitude of overbidding," Journal of the Academy of Marketing Science, Springer, vol. 44(2), pages 241-260, March.
    3. März, Armin & Lachner, Michael & Heumann, Christian G. & Schumann, Jan H. & von Wangenheim, Florian, 2021. "How You Remind Me! The Influence of Mobile Push Notifications on Success Rates in Last-Minute Bidding," Journal of Interactive Marketing, Elsevier, vol. 54(C), pages 11-24.
    4. Grégory Jolivet & Bruno Jullien & Fabien Postel-Vinay, 2014. "Reputation and Pricing on the e-Market: Evidence from a Major French Platform," PSE Working Papers hal-03460312, HAL.
    5. Katkar Rama & Reiley David H, 2007. "Public versus Secret Reserve Prices in eBay Auctions: Results from a Pokémon Field Experiment," The B.E. Journal of Economic Analysis & Policy, De Gruyter, vol. 5(2), pages 1-25, January.
    6. Paul Resnick & Richard Zeckhauser & John Swanson & Kate Lockwood, 2006. "The value of reputation on eBay: A controlled experiment," Experimental Economics, Springer;Economic Science Association, vol. 9(2), pages 79-101, June.
    7. Marie BLUM & Régis BLAZY, 2021. "The three stages of an auction: how do the bid dynamics influence auction prices? Evidence from live art auctions," Working Papers of LaRGE Research Center 2021-10, Laboratoire de Recherche en Gestion et Economie (LaRGE), Université de Strasbourg.
    8. Chen, Kong-Pin & Lai, Hung-pin & Yu, Ya-Ting, 2018. "The seller's listing strategy in online auctions: Evidence from eBay," International Journal of Industrial Organization, Elsevier, vol. 56(C), pages 107-144.
    9. Jolivet, Grégory & Jullien, Bruno & Postel-Vinay, Fabien, 2016. "Reputation and prices on the e-market: Evidence from a major French platform," International Journal of Industrial Organization, Elsevier, vol. 45(C), pages 59-75.
    10. Bolton, Gary E. & Ockenfels, Axel, 2014. "Does laboratory trading mirror behavior in real world markets? Fair bargaining and competitive bidding on eBay," Journal of Economic Behavior & Organization, Elsevier, vol. 97(C), pages 143-154.
    11. Dawn Gregg & Madhavan Parthasarathy, 2017. "Factors affecting the long-term survival of eBay ventures: a longitudinal study," Small Business Economics, Springer, vol. 49(2), pages 405-419, August.
    12. Shiu, Ji-Liang & Sun, Chia-Hung D., 2014. "Modeling and estimating returns to seller reputation with unobserved heterogeneity in online auctions," Economic Modelling, Elsevier, vol. 40(C), pages 59-67.
    13. Janne Tukiainen, 2017. "Effects of Minimum Bid Increments in Internet Auctions: Evidence from a Field Experiment," Journal of Industrial Economics, Wiley Blackwell, vol. 65(3), pages 597-622, September.
    14. Ronald Peeters & Martin Strobel & Dries Vermeulen & Markus Walzl, 2016. "The Impact of the Irrelevant: Temporary Buy-Options and Bidding Behavior in Auctions," Games, MDPI, vol. 7(1), pages 1-19, March.
    15. Rama Katkar & David Lucking-Reiley, 2000. "Public Versus Secret Reserve Prices in eBay Auctions: Results of Pok�mon Field Experiment," Vanderbilt University Department of Economics Working Papers 0026, Vanderbilt University Department of Economics.
    16. Dass, Mayukh & Reddy, Srinivas K. & Iacobucci, Dawn, 2014. "A Network Bidder Behavior Model in Online Auctions: A Case of Fine Art Auctions," Journal of Retailing, Elsevier, vol. 90(4), pages 445-462.
    17. Anderson, Steven & Friedman, Daniel & Milam, Garrett & Singh, Nirvikar, 2007. "Seller strategies on eBay: Does size matter?," MPRA Paper 4324, University Library of Munich, Germany.
    18. repec:hal:spmain:info:hdl:2441/4firg7dmo68rdasf15hi85vn53 is not listed on IDEAS
    19. Paulo B. Goes & Gilbert G. Karuga & Arvind K. Tripathi, 2010. "Understanding Willingness-to-Pay Formation of Repeat Bidders in Sequential Online Auctions," Information Systems Research, INFORMS, vol. 21(4), pages 907-924, December.
    20. Podwol, Joseph Uri & Schneider, Henry S., 2016. "Nonstandard bidder behavior in real-world auctions," European Economic Review, Elsevier, vol. 83(C), pages 198-212.
    21. Genti Kostandini & Elton Mykerezi & Eftila Tanellari & Nour Dib, 2011. "Does Buyer Experience Pay Off? Evidence from eBay," Review of Industrial Organization, Springer;The Industrial Organization Society, vol. 39(3), pages 253-265, November.

    More about this item

    Keywords

    Product heterogeneity; Clustering; Online transaction data; E-commerce;
    All these keywords.

    JEL classification:

    • C18 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Methodolical Issues: General
    • D44 - Microeconomics - - Market Structure, Pricing, and Design - - - Auctions

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:elmark:v:26:y:2016:i:2:d:10.1007_s12525-016-0217-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.