IDEAS home Printed from https://ideas.repec.org/a/inm/oropre/v61y2013i1p17-31.html
   My bibliography  Save this article

Data Quality of Query Results with Generalized Selection Conditions

Author

Listed:
  • Debabrata Dey

    (Foster School of Business, University of Washington, Seattle, Washington 98195)

  • Subodha Kumar

    (Mays Business School, Texas A&M University, College Station, Texas 77845)

Abstract

Information systems play a very important role in managerial decision making within modern organizations. While making different types of decisions (at operational, tactical, and strategic levels), managers are increasingly relying on information gleaned from various databases, data warehouses, and data streams feeding them. The quality of organizational decisions, therefore, often depends on the quality of the information derived from these databases and data streams, and a manager is able to make better use of the information if she also understands the quality level of that information. Previous research has examined how the quality level of a database query output can be estimated based on the quality level of the input data. In this research, we generalize this stream of research and allow a query to have general selection conditions involving multiple attributes with any combination of conjunction or disjunction of subconditions that may include functions of multiple attributes. Results of this research can easily be implemented in real-world decision contexts.

Suggested Citation

  • Debabrata Dey & Subodha Kumar, 2013. "Data Quality of Query Results with Generalized Selection Conditions," Operations Research, INFORMS, vol. 61(1), pages 17-31, February.
  • Handle: RePEc:inm:oropre:v:61:y:2013:i:1:p:17-31
    DOI: 10.1287/opre.1120.1128
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/opre.1120.1128
    Download Restriction: no

    File URL: https://libkey.io/10.1287/opre.1120.1128?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Debabrata Dey, 2003. "Record Matching in Data Warehouses: A Decision Model for Data Consolidation," Operations Research, INFORMS, vol. 51(2), pages 240-254, April.
    2. Donald P. Ballou & Harold L. Pazer, 1985. "Modeling Data and Process Quality in Multi-Input, Multi-Output Information Systems," Management Science, INFORMS, vol. 31(2), pages 150-162, February.
    3. Kon, Henry B. & Madnick, Stuart E. & Siegel, Michael D., 1995. "Good answers from bad data : a data management strategy," Working papers 3868-95., Massachusetts Institute of Technology (MIT), Sloan School of Management.
    4. Donald Ballou & Richard Wang & Harold Pazer & Giri Kumar Tayi, 1998. "Modeling Information Manufacturing Systems to Determine Information Product Quality," Management Science, INFORMS, vol. 44(4), pages 462-484, April.
    5. repec:mpr:mprres:3857 is not listed on IDEAS
    6. Nicole DeHoratius & Ananth Raman, 2008. "Inventory Record Inaccuracy: An Empirical Analysis," Management Science, INFORMS, vol. 54(4), pages 627-641, April.
    7. Amir Parssian & Sumit Sarkar & Varghese S. Jacob, 2004. "Assessing Data Quality for Information Products: Impact of Selection, Projection, and Cartesian Product," Management Science, INFORMS, vol. 50(7), pages 967-982, July.
    8. Xue Bai & Manuel Nunez & Jayant R. Kalagnanam, 2012. "Managing Data Quality Risk in Accounting Information Systems," Information Systems Research, INFORMS, vol. 23(2), pages 453-473, June.
    9. Debabrata Dey & Sumit Sarkar & Prabuddha De, 1998. "A Probabilistic Decision Model for Entity Matching in Heterogeneous Databases," Management Science, INFORMS, vol. 44(10), pages 1379-1395, October.
    10. Xue Bai, 2012. "A Mathematical Framework for Data Quality Management in Enterprise Systems," INFORMS Journal on Computing, INFORMS, vol. 24(4), pages 648-664, November.
    11. Zhengrui Jiang & Sumit Sarkar & Prabuddha De & Debabrata Dey, 2007. "A Framework for Reconciling Attribute Values from Multiple Data Sources," Management Science, INFORMS, vol. 53(12), pages 1946-1963, December.
    12. Amir Parssian & Sumit Sarkar & Varghese S. Jacob, 2009. "Impact of the Union and Difference Operations on the Quality of Information Products," Information Systems Research, INFORMS, vol. 20(1), pages 99-120, March.
    13. Debabrata Dey & Subodha Kumar, 2010. "Reassessing Data Quality for Information Products," Management Science, INFORMS, vol. 56(12), pages 2316-2322, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Qi Liu & Gengzhong Feng & Giri Kumar Tayi & Jun Tian, 2021. "Managing Data Quality of the Data Warehouse: A Chance-Constrained Programming Approach," Information Systems Frontiers, Springer, vol. 23(2), pages 375-389, April.
    2. Xiangyu Chang & Yinghui Huang & Mei Li & Xin Bo & Subodha Kumar, 2021. "Efficient Detection of Environmental Violators: A Big Data Approach," Production and Operations Management, Production and Operations Management Society, vol. 30(5), pages 1246-1270, May.
    3. Lu, Jizhou & Feng, Gengzhong & Shum, Stephen & Lai, Kin Keung, 2021. "On the value of information sharing in the presence of information errors," European Journal of Operational Research, Elsevier, vol. 294(3), pages 1139-1152.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xiao, Yu & Lu, Louis Y.Y. & Liu, John S. & Zhou, Zhili, 2014. "Knowledge diffusion path analysis of data quality literature: A main path analysis," Journal of Informetrics, Elsevier, vol. 8(3), pages 594-605.
    2. Qi Liu & Gengzhong Feng & Giri Kumar Tayi & Jun Tian, 2021. "Managing Data Quality of the Data Warehouse: A Chance-Constrained Programming Approach," Information Systems Frontiers, Springer, vol. 23(2), pages 375-389, April.
    3. Hazen, Benjamin T. & Boone, Christopher A. & Ezell, Jeremy D. & Jones-Farmer, L. Allison, 2014. "Data quality for data science, predictive analytics, and big data in supply chain management: An introduction to the problem and suggestions for research and applications," International Journal of Production Economics, Elsevier, vol. 154(C), pages 72-80.
    4. Amir Parssian & Sumit Sarkar & Varghese S. Jacob, 2009. "Impact of the Union and Difference Operations on the Quality of Information Products," Information Systems Research, INFORMS, vol. 20(1), pages 99-120, March.
    5. Dominikus Kleindienst, 2017. "The data quality improvement plan: deciding on choice and sequence of data quality improvements," Electronic Markets, Springer;IIM University of St. Gallen, vol. 27(4), pages 387-398, November.
    6. Xiangyu Chang & Yinghui Huang & Mei Li & Xin Bo & Subodha Kumar, 2021. "Efficient Detection of Environmental Violators: A Big Data Approach," Production and Operations Management, Production and Operations Management Society, vol. 30(5), pages 1246-1270, May.
    7. Xitong Li & Hongwei Zhu & Luo Zuo, 2021. "Reporting Technologies and Textual Readability: Evidence from the XBRL Mandate," Information Systems Research, INFORMS, vol. 32(3), pages 1025-1042, September.
    8. Kartik Hosanagar, 2011. "Usercentric Operational Decision Making in Distributed Information Retrieval," Information Systems Research, INFORMS, vol. 22(4), pages 739-755, December.
    9. Shaobo Li & Matthew J. Schneider & Yan Yu & Sachin Gupta, 2023. "Reidentification Risk in Panel Data: Protecting for k -Anonymity," Information Systems Research, INFORMS, vol. 34(3), pages 1066-1088, September.
    10. Klein, B. D. & Rossin, D. F., 1999. "Data quality in neural network models: effect of error rate and magnitude of error on predictive accuracy," Omega, Elsevier, vol. 27(5), pages 569-582, October.
    11. Rajiv D. Banker & Robert J. Kauffman, 2004. "50th Anniversary Article: The Evolution of Research on Information Systems: A Fiftieth-Year Survey of the Literature in Management Science," Management Science, INFORMS, vol. 50(3), pages 281-298, March.
    12. Debabrata Dey & Subodha Kumar, 2010. "Reassessing Data Quality for Information Products," Management Science, INFORMS, vol. 56(12), pages 2316-2322, December.
    13. Amir Parssian & Sumit Sarkar & Varghese S. Jacob, 2004. "Assessing Data Quality for Information Products: Impact of Selection, Projection, and Cartesian Product," Management Science, INFORMS, vol. 50(7), pages 967-982, July.
    14. Hazen, Benjamin T. & Weigel, Fred K. & Ezell, Jeremy D. & Boehmke, Bradley C. & Bradley, Randy V., 2017. "Toward understanding outcomes associated with data quality improvement," International Journal of Production Economics, Elsevier, vol. 193(C), pages 737-747.
    15. Choo Yeon Kim & Seong Soo Cha, 2023. "Effect of SNS Characteristics for Dining Out on Customer Satisfaction and Online Word of Mouth," SAGE Open, , vol. 13(3), pages 21582440231, September.
    16. Arzum Akkaş & Nachiketa Sahoo, 2020. "Reducing Product Expiration by Aligning Salesforce Incentives: A Data‐driven Approach," Production and Operations Management, Production and Operations Management Society, vol. 29(8), pages 1992-2009, August.
    17. Rikhardsson, Pall & Yigitbasioglu, Ogan, 2018. "Business intelligence & analytics in management accounting research: Status and future focus," International Journal of Accounting Information Systems, Elsevier, vol. 29(C), pages 37-58.
    18. Risto Silvola & Janne Harkonen & Olli Vilppola & Hanna Kropsu-Vehkapera & Harri Haapasalo, 2016. "Data quality assessment and improvement," International Journal of Business Information Systems, Inderscience Enterprises Ltd, vol. 22(1), pages 62-81.
    19. Mou, Shandong & Robb, David J. & DeHoratius, Nicole, 2018. "Retail store operations: Literature review and research directions," European Journal of Operational Research, Elsevier, vol. 265(2), pages 399-422.
    20. Thiesse, Frédéric & Buckel, Thomas, 2015. "A comparison of RFID-based shelf replenishment policies in retail stores under suboptimal read rates," International Journal of Production Economics, Elsevier, vol. 159(C), pages 126-136.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:oropre:v:61:y:2013:i:1:p:17-31. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.