Day 1 Basic data mining concept
At Harrisburg University 🙂 Note from Dough Rumbaugh
statistical analysis vs Data Mining
statistical analysis: does the two dataset have same distribution?(TTEST) data mining: not a specific claim, a general correlation without an initial hypothesis.
data structure: structured data: relational database unstructured data: handwriting
7 Steps data processing:
- cleaning 2.integeration 3.selection 4.transformation 5.Data Mining 6.pattern evaluation 7.Presentation(show off )
Data Mining: 1.pattern Mining 2.classification/Regression 3.cluster Analysis
- Outlier analysis (ex: Credit card Fraud)