About the Book:
Data is everywhere and it’s growing at an unprecedented rate. But making sense of all that data is a challenge. Data Mining is the process of discovering patterns and knowledge from large data sets, and Data Mining with Python focuses on the hands-on approach to learning Data Mining. It showcases how to use Python Packages to fulfill the Data Mining pipeline, which is to collect, integrate, manipulate, clean, process, organize, and analyze data for knowledge.
The contents are organized based on the Data Mining pipeline, so readers can naturally progress step by step through the process. Topics, methods, and tools are explained in three aspects: “What it is” as a theoretical background, “why we need it” as an application orientation, and “how we do it” as a case study.
Contents:
Section I. Data Wrangling
1. Data Collection
2. Data Integration
3. Data Statistics
4. Data Visualization
5. Data Preprocessing
Section II. Data Analysis
6. Classification
7. Regression
8. Clustering
9. Frequent Patterns
About the Author: