product
2077145Advanced Analytics with PySparkhttps://www.gandhi.com.mx/advanced-analytics-with-pyspark/phttps://gandhi.vtexassets.com/arquivos/ids/1431563/f8a81ccd-27bf-417d-936b-5d06a7729a7f.jpg?v=638338105763430000490681MXNOReilly MediaInStock/Ebooks/<p>The amount of data being generated today is staggering and growing. Apache Spark has emerged as the de facto tool to analyze big data and is now a critical part of the data science toolbox. Updated for Spark 3.0, this practical guide brings together Spark, statistical methods, and real-world datasets to teach you how to approach analytics problems using PySpark, Sparks Python API, and other best practices in Spark programming.</p><p>Data scientists Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen, and Josh Wills offer an introduction to the Spark ecosystem, then dive into patterns that apply common techniques-including classification, clustering, collaborative filtering, and anomaly detection, to fields such as genomics, security, and finance. This updated edition also covers NLP and image processing.</p><p>If you have a basic understanding of machine learning and statistics and you program in Python, this book will get you started with large-scale data analysis.</p><ul><li>Familiarize yourself with Sparks programming model and ecosystem</li><li>Learn general approaches in data science</li><li>Examine complete implementations that analyze large public datasets</li><li>Discover which machine learning tools make sense for particular problems</li><li>Explore code that can be adapted to many uses</li></ul>...2033793Advanced Analytics with PySpark490681https://www.gandhi.com.mx/advanced-analytics-with-pyspark/phttps://gandhi.vtexassets.com/arquivos/ids/1431563/f8a81ccd-27bf-417d-936b-5d06a7729a7f.jpg?v=638338105763430000InStockMXN99999DIEbook20229781098103606_W3siaWQiOiIwMGJjM2RjOS05OTdkLTRkOGUtODZjYS0xMTM4OTA3MzE4M2IiLCJsaXN0UHJpY2UiOjY4MSwiZGlzY291bnQiOjE5MSwic2VsbGluZ1ByaWNlIjo0OTAsImluY2x1ZGVzVGF4Ijp0cnVlLCJwcmljZVR5cGUiOiJXaG9sZXNhbGUiLCJjdXJyZW5jeSI6Ik1YTiIsImZyb20iOiIyMDI0LTA1LTIxVDA5OjAwOjAwWiIsInJlZ2lvbiI6Ik1YIiwiaXNQcmVvcmRlciI6ZmFsc2V9XQ==9781098103606_<p>The amount of data being generated today is staggering and growing. Apache Spark has emerged as the de facto tool to analyze big data and is now a critical part of the data science toolbox. Updated for Spark 3.0, this practical guide brings together Spark, statistical methods, and real-world datasets to teach you how to approach analytics problems using PySpark, Sparks Python API, and other best practices in Spark programming.</p><p>Data scientists Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen, and Josh Wills offer an introduction to the Spark ecosystem, then dive into patterns that apply common techniques-including classification, clustering, collaborative filtering, and anomaly detection, to fields such as genomics, security, and finance. This updated edition also covers NLP and image processing.</p><p>If you have a basic understanding of machine learning and statistics and you program in Python, this book will get you started with large-scale data analysis.</p><ul><li>Familiarize yourself with Sparks programming model and ecosystem</li><li>Learn general approaches in data science</li><li>Examine complete implementations that analyze large public datasets</li><li>Discover which machine learning tools make sense for particular problems</li><li>Explore code that can be adapted to many uses</li></ul>...9781098103606_OReilly Medialibro_electonico_a554cca4-c17a-3983-bd44-9ad3af1f78e3_9781098103606;9781098103606_9781098103606Josh WillsInglésMéxicohttps://getbook.kobo.com/koboid-prod-public/oreilly-epub-b9f99e5c-ecf8-46ab-9d9f-c2e634d6a797.epub2022-06-14T00:00:00+00:00OReilly Media