Cleaning Data for Effective Data Science
Cleaning Data for Effective Data Science
Paperback
Couldn't load pickup availability
Join our rewards scheme and earn 126 reward points on this purchase!
Earn 126 points on this!
Sign in or Sign up!- Release Date: 31/03/2021
- Barcode: 9781801071291
- Genre: Computing & Internet

Cleaning Data for Effective Data Science
Couldn't load pickup availability
Collapsible content
DESCRIPTION
Doing the other 80% of the work with Python, R, and command-line tools. Data in its raw state is rarely ready for productive analysis. This book not only teaches you data preparation, but also what questions you should ask of your data. It focuses on the thought processes necessary for successful data cleaning as much as on concise and precise code examples that express these thoughts. Think about your data intelligently and ask the right questions Data cleaning is the all-important first step to successful data science, data analysis, and machine learning. If you work with any kind of data, this book is your go-to resource, arming you with the insights and heuristics experienced data scientists had to learn the hard way. In a light-hearted and engaging exploration of different tools, techniques, and datasets real and fictitious, Python veteran David Mertz teaches you the ins and outs of data preparation and the essential questions you should be asking of every piece of data you work with. Using a mixture of Python, R, and common command-line tools, Cleaning Data for Effective Data Science follows the data cleaning pipeline from start to end, focusing on helping you understand the principles underlying each step of the process. You'll look at data ingestion of a vast range of tabular, hierarchical, and other data formats, impute missing values, detect unreliable data and statistical anomalies, and generate synthetic features. The long-form exercises at the end of each chapter let you get hands-on with the skills you've acquired along the way, also providing a valuable resource for academic courses. This book is designed to benefit software developers, data scientists, aspiring data scientists, teachers, and students who work with data. If you want to improve your rigor in data hygiene or are looking for a refresher, this book is for you. Basic familiarity with statistics, general concepts in machine learning, knowledge of a programming language (Python or R), and some exposure to data science are helpful.
Book Description
Who this book is for
DELIVERY & RETURNS
UK Delivery:
- Free delivery on all orders of £10 or more.
- £1.49 delivery fee on orders below £10.
- UK orders are shipped via Royal Mail 2nd Class.
International Delivery:
- Flat rate delivery charges vary by country.
Dispatch and Delivery Times:
- All orders are shipped from our warehouse in Northampton, UK within 48 hours of receipt during working hours.
- UK mainland orders typically arrive within 3-5 working days via Royal Mail 2nd Class.
- International estimated delivery times:
- Europe & Channel Islands: 7 to 10 working days
- USA: 7 to 15 working days
- Rest of the World: 9 to 21 working days
View our full delivery infomation here.
-
OVER
2 MILLION PRODUCTS
-
60 MILLION CUSTOMERS
ACROSS 190 COUNTRIES
You might also like
Loading recommendations...