Data Mining and network analysis IDN0110
Fall 2018/2019
IDN0110 / ITI8730: Data Mining and network analysis Taught by: Sven Nõmm EAP: 6.0
Examination (final project defense) and Consultations times
4.01 16:00-17:00 Consultation, make up for Closed book test 2. (If you wish to defend your project this date please contact lecturer in advance.)
10.01 16:00- 17:45 Examination (final project defense)
17.01 16:00 – 17:00 Consultation
23.01 16:00- 17:45 Examination (final project defense)
Lectures: Wednesdays 14:00-15:30 ICT-A1 Labs: Thursdays 16:00-17:30 ICT-401
Consultation: by appointment only Please do not hesitate to ask for appointment!!!
For communication please use the following e-mail: sven.nomm@ttu.ee
Overview
The course aims to provide knowledge of theory behind different methods of data mining and develop practical skills in applying those methods on practice. Is is spanned around four "super problems" of data mining:
- Clustering
- Classification
- Association pattern mining
- Outlier analysis
Main topics of the course:
- Data types and Data Preparation
- Similarity and Distances, Association Pattern Mining,
- Cluster Analysis, Classification, Outlier analysis
- Data streams, Text Data, Time Series, Discrete Sequences,
- Spatial Data, Graph Data, Web Data, Social Network Analysis
Evaluation
- 2x mandatory closed book tests. Each test gives 10% of the final grade.
- 3x mandatory home assignments (Computational assignment +short write up.) Each assignment gives 10% of the final grade.
- final exam (gives 50 % of the final grade): Written report on assigned topic + discussion with lecturer.
Exam prerequisites: both closed book tests are accepted (graded as 51 or higher), all 3 home assignments are accepted (graded as 51 or higher).
Home assignments, code examples, data files and useful links will be distributed by means of ained.ttu.ee environment. Course enrollment process will be conducted during the first lecture.
Lectures
Lecture 1 Introduction and data preparation
Lecture 2 Similarity and distance
Lecture 3 Cluster analysis
Lecture 4 Classification
Lecture 5 Anomaly and Outlier Analysis
Practice 5 Implementation of EM - Algorithm
Lecture 6 Association Pattern Mining
Lecture 7 Similarity and Distance 2
Lecture 8 Text Data Mining
Lecture 9 Time Series Mining
Lecture 10 Data Streams Mining
Lecture 11 Graph Data Mining
Lecture 12 Social Network Analysis