product
7659269Deequ for Scalable Data Quality Assurancehttps://www.gandhi.com.mx/deequ-for-scalable-data-quality-assurance-6610000973354/phttps://gandhi.vtexassets.com/arquivos/ids/7286484/image.jpg?v=638889772952900000186186MXNGandhiInStock/Ebooks/<p>"Deequ for Scalable Data Quality Assurance"</p><p>In an era where data powers decision-making at every level, ensuring the quality of massive and fast-growing datasets poses unprecedented challenges. "Deequ for Scalable Data Quality Assurance" addresses this critical need by exploring not only the evolving standards and requirements for data quality in large-scale, modern systems but also the profound business and technical risks of neglecting it. The book begins by framing the dimensions of data qualityaccuracy, completeness, consistency, timeliness, and validityand critically evaluates traditional approaches, making a compelling case for automation and scalable, data-driven architectures.</p><p>At the heart of this work is a comprehensive exploration of Deequ, an open-source library purpose-built for automated, scalable data quality checks on distributed platforms such as Apache Spark. Through clear architectural exposition, the book demystifies Deequs foundational abstractionsmetrics, checks, constraints, and analyzersthen guides readers in designing expressive, reusable, and parameterized validations. Advanced chapters reveal how to extend Deequ with custom metrics, orchestrate robust quality workflows in production, and integrate with CI/CD, monitoring, and audit frameworks, all while upholding security and regulatory compliance in areas such as GDPR and HIPAA.</p><p>Drawing from hands-on case studies in enterprise environments, the book illustrates the end-to-end lifecycle of data quality managementfrom automated detection and remediation to storytelling with actionable insights. Readers gain practical knowledge in deployment strategies, visualization, and root cause analytics while also being introduced to future trends in automated quality assurance and intelligent profiling. Whether you are a data engineer, architect, or leader, this book is an essential guide to mastering scalable data quality in the era of big data.</p>...7266841Deequ for Scalable Data Quality Assurance186186https://www.gandhi.com.mx/deequ-for-scalable-data-quality-assurance-6610000973354/phttps://gandhi.vtexassets.com/arquivos/ids/7286484/image.jpg?v=638889772952900000InStockMXN99999DIEbook20256610000973354_W3siaWQiOiJiYzIxYzI4Yi0xMTNlLTRkYjktOTc1ZS1lNjc4YmQ4NWZmNTciLCJsaXN0UHJpY2UiOjE4NiwiZGlzY291bnQiOjAsInNlbGxpbmdQcmljZSI6MTg2LCJpbmNsdWRlc1RheCI6dHJ1ZSwicHJpY2VUeXBlIjoiSXBwIiwiY3VycmVuY3kiOiJNWE4iLCJmcm9tIjoiMjAyNS0wNy0yNFQxNzowMDowMFoiLCJyZWdpb24iOiJNWCIsImlzUHJlb3JkZXIiOmZhbHNlfV0=6610000973354_<p>"Deequ for Scalable Data Quality Assurance"</p><p>In an era where data powers decision-making at every level, ensuring the quality of massive and fast-growing datasets poses unprecedented challenges. "Deequ for Scalable Data Quality Assurance" addresses this critical need by exploring not only the evolving standards and requirements for data quality in large-scale, modern systems but also the profound business and technical risks of neglecting it. The book begins by framing the dimensions of data qualityaccuracy, completeness, consistency, timeliness, and validityand critically evaluates traditional approaches, making a compelling case for automation and scalable, data-driven architectures.</p><p>At the heart of this work is a comprehensive exploration of Deequ, an open-source library purpose-built for automated, scalable data quality checks on distributed platforms such as Apache Spark. Through clear architectural exposition, the book demystifies Deequs foundational abstractionsmetrics, checks, constraints, and analyzersthen guides readers in designing expressive, reusable, and parameterized validations. Advanced chapters reveal how to extend Deequ with custom metrics, orchestrate robust quality workflows in production, and integrate with CI/CD, monitoring, and audit frameworks, all while upholding security and regulatory compliance in areas such as GDPR and HIPAA.</p><p>Drawing from hands-on case studies in enterprise environments, the book illustrates the end-to-end lifecycle of data quality managementfrom automated detection and remediation to storytelling with actionable insights. Readers gain practical knowledge in deployment strategies, visualization, and root cause analytics while also being introduced to future trends in automated quality assurance and intelligent profiling. Whether you are a data engineer, architect, or leader, this book is an essential guide to mastering scalable data quality in the era of big data.</p>...6610000973354_HiTeX Presslibro_electonico_6610000973354_6610000973354William SmithInglésMéxicohttps://getbook.kobo.com/koboid-prod-public/content2connect_drm-epub-54b17248-1342-4c6a-8613-7f738636d722.epub2025-07-24T00:00:00+00:00HiTeX Press