Journey to Data Quality

Journey to Data Quality
Author :
Publisher : MIT Press (MA)
Total Pages : 248
Release :
ISBN-10 : UOM:49015003152734
ISBN-13 :
Rating : 4/5 (34 Downloads)

Book Synopsis Journey to Data Quality by : Yang W. Lee

Download or read book Journey to Data Quality written by Yang W. Lee and published by MIT Press (MA). This book was released on 2006 with total page 248 pages. Available in PDF, EPUB and Kindle. Book excerpt: All organizations today confront data quality problems, both systemic and structural. Neither ad hoc approaches nor fixes at the systems level--installing the latest software or developing an expensive data warehouse--solve the basic problem of bad data quality practices. Journey to Data Qualityoffers a roadmap that can be used by practitioners, executives, and students for planning and implementing a viable data and information quality management program. This practical guide, based on rigorous research and informed by real-world examples, describes the challenges of data management and provides the principles, strategies, tools, and techniques necessary to meet them. The authors, all leaders in the data quality field for many years, discuss how to make the economic case for data quality and the importance of getting an organization's leaders on board. They outline different approaches for assessing data, both subjectively (by users) and objectively (using sampling and other techniques). They describe real problems and solutions, including efforts to find the root causes of data quality problems at a healthcare organization and data quality initiatives taken by a large teaching hospital. They address setting company policy on data quality and, finally, they consider future challenges on the journey to data quality.

Executing Data Quality Projects

Executing Data Quality Projects
Author :
Publisher : Academic Press
Total Pages : 378
Release :
ISBN-10 : 9780128180167
ISBN-13 : 0128180161
Rating : 4/5 (67 Downloads)

Book Synopsis Executing Data Quality Projects by : Danette McGilvray

Download or read book Executing Data Quality Projects written by Danette McGilvray and published by Academic Press. This book was released on 2021-05-27 with total page 378 pages. Available in PDF, EPUB and Kindle. Book excerpt: Executing Data Quality Projects, Second Edition presents a structured yet flexible approach for creating, improving, sustaining and managing the quality of data and information within any organization. Studies show that data quality problems are costing businesses billions of dollars each year, with poor data linked to waste and inefficiency, damaged credibility among customers and suppliers, and an organizational inability to make sound decisions. Help is here! This book describes a proven Ten Step approach that combines a conceptual framework for understanding information quality with techniques, tools, and instructions for practically putting the approach to work – with the end result of high-quality trusted data and information, so critical to today's data-dependent organizations. The Ten Steps approach applies to all types of data and all types of organizations – for-profit in any industry, non-profit, government, education, healthcare, science, research, and medicine. This book includes numerous templates, detailed examples, and practical advice for executing every step. At the same time, readers are advised on how to select relevant steps and apply them in different ways to best address the many situations they will face. The layout allows for quick reference with an easy-to-use format highlighting key concepts and definitions, important checkpoints, communication activities, best practices, and warnings. The experience of actual clients and users of the Ten Steps provide real examples of outputs for the steps plus highlighted, sidebar case studies called Ten Steps in Action. This book uses projects as the vehicle for data quality work and the word broadly to include: 1) focused data quality improvement projects, such as improving data used in supply chain management, 2) data quality activities in other projects such as building new applications and migrating data from legacy systems, integrating data because of mergers and acquisitions, or untangling data due to organizational breakups, and 3) ad hoc use of data quality steps, techniques, or activities in the course of daily work. The Ten Steps approach can also be used to enrich an organization's standard SDLC (whether sequential or Agile) and it complements general improvement methodologies such as six sigma or lean. No two data quality projects are the same but the flexible nature of the Ten Steps means the methodology can be applied to all. The new Second Edition highlights topics such as artificial intelligence and machine learning, Internet of Things, security and privacy, analytics, legal and regulatory requirements, data science, big data, data lakes, and cloud computing, among others, to show their dependence on data and information and why data quality is more relevant and critical now than ever before. - Includes concrete instructions, numerous templates, and practical advice for executing every step of The Ten Steps approach - Contains real examples from around the world, gleaned from the author's consulting practice and from those who implemented based on her training courses and the earlier edition of the book - Allows for quick reference with an easy-to-use format highlighting key concepts and definitions, important checkpoints, communication activities, and best practices - A companion Web site includes links to numerous data quality resources, including many of the templates featured in the text, quick summaries of key ideas from the Ten Steps methodology, and other tools and information that are available online

Executing Data Quality Projects

Executing Data Quality Projects
Author :
Publisher : Elsevier
Total Pages : 353
Release :
ISBN-10 : 9780080558394
ISBN-13 : 0080558399
Rating : 4/5 (94 Downloads)

Book Synopsis Executing Data Quality Projects by : Danette McGilvray

Download or read book Executing Data Quality Projects written by Danette McGilvray and published by Elsevier. This book was released on 2008-09-01 with total page 353 pages. Available in PDF, EPUB and Kindle. Book excerpt: Information is currency. Recent studies show that data quality problems are costing businesses billions of dollars each year, with poor data linked to waste and inefficiency, damaged credibility among customers and suppliers, and an organizational inability to make sound decisions. In this important and timely new book, Danette McGilvray presents her "Ten Steps approach to information quality, a proven method for both understanding and creating information quality in the enterprise. Her trademarked approach—in which she has trained Fortune 500 clients and hundreds of workshop attendees—applies to all types of data and to all types of organizations.* Includes numerous templates, detailed examples, and practical advice for executing every step of the "Ten Steps approach.* Allows for quick reference with an easy-to-use format highlighting key concepts and definitions, important checkpoints, communication activities, and best practices.* A companion Web site includes links to numerous data quality resources, including many of the planning and information-gathering templates featured in the text, quick summaries of key ideas from the Ten Step methodology, and other tools and information available online.

Handbook of Data Quality

Handbook of Data Quality
Author :
Publisher : Springer Science & Business Media
Total Pages : 440
Release :
ISBN-10 : 9783642362576
ISBN-13 : 3642362575
Rating : 4/5 (76 Downloads)

Book Synopsis Handbook of Data Quality by : Shazia Sadiq

Download or read book Handbook of Data Quality written by Shazia Sadiq and published by Springer Science & Business Media. This book was released on 2013-08-13 with total page 440 pages. Available in PDF, EPUB and Kindle. Book excerpt: The issue of data quality is as old as data itself. However, the proliferation of diverse, large-scale and often publically available data on the Web has increased the risk of poor data quality and misleading data interpretations. On the other hand, data is now exposed at a much more strategic level e.g. through business intelligence systems, increasing manifold the stakes involved for individuals, corporations as well as government agencies. There, the lack of knowledge about data accuracy, currency or completeness can have erroneous and even catastrophic results. With these changes, traditional approaches to data management in general, and data quality control specifically, are challenged. There is an evident need to incorporate data quality considerations into the whole data cycle, encompassing managerial/governance as well as technical aspects. Data quality experts from research and industry agree that a unified framework for data quality management should bring together organizational, architectural and computational approaches. Accordingly, Sadiq structured this handbook in four parts: Part I is on organizational solutions, i.e. the development of data quality objectives for the organization, and the development of strategies to establish roles, processes, policies, and standards required to manage and ensure data quality. Part II, on architectural solutions, covers the technology landscape required to deploy developed data quality management processes, standards and policies. Part III, on computational solutions, presents effective and efficient tools and techniques related to record linkage, lineage and provenance, data uncertainty, and advanced integrity constraints. Finally, Part IV is devoted to case studies of successful data quality initiatives that highlight the various aspects of data quality in action. The individual chapters present both an overview of the respective topic in terms of historical research and/or practice and state of the art, as well as specific techniques, methodologies and frameworks developed by the individual contributors. Researchers and students of computer science, information systems, or business management as well as data professionals and practitioners will benefit most from this handbook by not only focusing on the various sections relevant to their research area or particular practical work, but by also studying chapters that they may initially consider not to be directly relevant to them, as there they will learn about new perspectives and approaches.

Information Quality

Information Quality
Author :
Publisher : John Wiley & Sons
Total Pages : 381
Release :
ISBN-10 : 9781118874448
ISBN-13 : 1118874447
Rating : 4/5 (48 Downloads)

Book Synopsis Information Quality by : Ron S. Kenett

Download or read book Information Quality written by Ron S. Kenett and published by John Wiley & Sons. This book was released on 2016-12-19 with total page 381 pages. Available in PDF, EPUB and Kindle. Book excerpt: Provides an important framework for data analysts in assessing the quality of data and its potential to provide meaningful insights through analysis Analytics and statistical analysis have become pervasive topics, mainly due to the growing availability of data and analytic tools. Technology, however, fails to deliver insights with added value if the quality of the information it generates is not assured. Information Quality (InfoQ) is a tool developed by the authors to assess the potential of a dataset to achieve a goal of interest, using data analysis. Whether the information quality of a dataset is sufficient is of practical importance at many stages of the data analytics journey, from the pre-data collection stage to the post-data collection and post-analysis stages. It is also critical to various stakeholders: data collection agencies, analysts, data scientists, and management. This book: Explains how to integrate the notions of goal, data, analysis and utility that are the main building blocks of data analysis within any domain. Presents a framework for integrating domain knowledge with data analysis. Provides a combination of both methodological and practical aspects of data analysis. Discusses issues surrounding the implementation and integration of InfoQ in both academic programmes and business / industrial projects. Showcases numerous case studies in a variety of application areas such as education, healthcare, official statistics, risk management and marketing surveys. Presents a review of software tools from the InfoQ perspective along with example datasets on an accompanying website. This book will be beneficial for researchers in academia and in industry, analysts, consultants, and agencies that collect and analyse data as well as undergraduate and postgraduate courses involving data analysis.

The Self-Service Data Roadmap

The Self-Service Data Roadmap
Author :
Publisher : "O'Reilly Media, Inc."
Total Pages : 297
Release :
ISBN-10 : 9781492075202
ISBN-13 : 1492075205
Rating : 4/5 (02 Downloads)

Book Synopsis The Self-Service Data Roadmap by : Sandeep Uttamchandani

Download or read book The Self-Service Data Roadmap written by Sandeep Uttamchandani and published by "O'Reilly Media, Inc.". This book was released on 2020-09-10 with total page 297 pages. Available in PDF, EPUB and Kindle. Book excerpt: Data-driven insights are a key competitive advantage for any industry today, but deriving insights from raw data can still take days or weeks. Most organizations can’t scale data science teams fast enough to keep up with the growing amounts of data to transform. What’s the answer? Self-service data. With this practical book, data engineers, data scientists, and team managers will learn how to build a self-service data science platform that helps anyone in your organization extract insights from data. Sandeep Uttamchandani provides a scorecard to track and address bottlenecks that slow down time to insight across data discovery, transformation, processing, and production. This book bridges the gap between data scientists bottlenecked by engineering realities and data engineers unclear about ways to make self-service work. Build a self-service portal to support data discovery, quality, lineage, and governance Select the best approach for each self-service capability using open source cloud technologies Tailor self-service for the people, processes, and technology maturity of your data platform Implement capabilities to democratize data and reduce time to insight Scale your self-service portal to support a large number of users within your organization

Data Quality for Analytics Using SAS

Data Quality for Analytics Using SAS
Author :
Publisher : SAS Institute
Total Pages : 356
Release :
ISBN-10 : 9781612902272
ISBN-13 : 1612902278
Rating : 4/5 (72 Downloads)

Book Synopsis Data Quality for Analytics Using SAS by : Gerhard Svolba

Download or read book Data Quality for Analytics Using SAS written by Gerhard Svolba and published by SAS Institute. This book was released on 2012-04-01 with total page 356 pages. Available in PDF, EPUB and Kindle. Book excerpt: Analytics offers many capabilities and options to measure and improve data quality, and SAS is perfectly suited to these tasks. Gerhard Svolba's Data Quality for Analytics Using SAS focuses on selecting the right data sources and ensuring data quantity, relevancy, and completeness. The book is made up of three parts. The first part, which is conceptual, defines data quality and contains text, definitions, explanations, and examples. The second part shows how the data quality status can be profiled and the ways that data quality can be improved with analytical methods. The final part details the consequences of poor data quality for predictive modeling and time series forecasting. With this book you will learn how you can use SAS to perform advanced profiling of data quality status and how SAS can help improve your data quality. This book is part of the SAS Press program.