TY - BOOK T1 - Data science for dummies T2 - For dummies A1 - Pierson, Liliian., author LA - English PP - Hoboken, NJ PB - John Wiley and Sons, Inc. YR - 2017 ED - Second edition. UL - https://tuklas.up.edu.ph/Record/UP-99796217613163324 AB - Begins by explaining large data sets and data formats, including sample Python code for manipulating data. The book explains how to work with relational databases and unstructured data, including NoSQL. The book then moves into preparing data for analysis by cleaning it up or "munging" it. From there the book explains data visualization techniques and types of data sets. Part II of the book is all about supervised machine learning, including regression techniques and model validation techniques. Part III explains unsupervised machine learning, including clustering and recommendation engines. Part IV overviews big data processing, including MapReduce, Hadoop, Dremel, Storm, and Spark. The book finishes up with real world applications of data science and how data science fits into organizations. OP - 364 CN - T 58.5 P54 2017 SN - 9781119327639 (paperbackl) KW - Information retrieval. KW - Data mining. KW - Information technology. KW - Databases. KW - COMPUTERS -- Data Processing. ER -