This two-day course reviews in-depth how to prepare data for a successful data mining project. Included are examples of appending and merging files, sampling and partitioning records from files, handling missing data, and working with dates and sequence data.
Prerequisite: General computer literacy. Some experience with using Clementine, including familiarity with the Clementine environment, creating streams, reading in data files, and doing simple data exploration and manipulation. Prior completion of the Introduction to Clementine and Data Mining course is strongly encouraged.
- Introduction to Data Preparation
- Combining Data Files
- Sampling Data & Missing Data
- Outliers and Anomalous Data
- Working with Dates and String Data
- Data Transformations
- Working with Sequence Data
- Aggregating Data
- Exporting Data Files
- Efficiency with Clementine
Course Duration: 2 Full Days
Venues: Gauteng and Cape Town

.jpg)