Erinevus lehekülje "Data Mining (ITI8730)" redaktsioonide vahel
(ei näidata sama kasutaja 11 vahepealset redaktsiooni) | |||
1. rida: | 1. rida: | ||
− | <span style="color:red"> Information for perspective students: | + | <span style="color:red"> Information for perspective students:</span> |
− | + | ||
− | + | <span style="color:red"> Lecture schedule and slides content are tentative. Please follow the course page in TalTech Moodle for up to date information and lecture content!!!</span> | |
− | The course is open to students with valid TalTech UniID! | + | |
+ | <span style="color:red"> The course is open to students with valid TalTech UniID! | ||
The course targets M.Sc. curricula students. It is expected that the students are familiar with the Calculus, Linear algebra, Probability, Statistics and possess basic to intermediate knowledge of at least one programming language. This course is not recommended for students of B.Sc. curricula. | The course targets M.Sc. curricula students. It is expected that the students are familiar with the Calculus, Linear algebra, Probability, Statistics and possess basic to intermediate knowledge of at least one programming language. This course is not recommended for students of B.Sc. curricula. | ||
</span> | </span> | ||
+ | <span style="color:red"> | ||
+ | Code to join course page in Moodle and MS Teams will be provided to the students via ÕIS e-mail on Monday September the 4th. | ||
+ | </span> | ||
+ | |||
+ | <span style="color:red"> | ||
+ | Those planning to use their own computers please install "R" and "R-studio". | ||
+ | </span> | ||
16. rida: | 24. rida: | ||
Taught by: Sven Nõmm | Taught by: Sven Nõmm | ||
+ | |||
+ | Teaching assistants Ilja Matjas, Rajesh Kalakoti | ||
EAP: 6.0 | EAP: 6.0 | ||
43. rida: | 53. rida: | ||
* Cluster Analysis, Classification, Outlier analysis | * Cluster Analysis, Classification, Outlier analysis | ||
* Data streams, Text Data, Time Series, Discrete Sequences, | * Data streams, Text Data, Time Series, Discrete Sequences, | ||
− | * | + | * Graph Data, Social Network Analysis |
==Evaluation== | ==Evaluation== | ||
53. rida: | 63. rida: | ||
Home assignments, code examples, data files and useful links will be distributed by means of Moodle environment. Course enrollment process in Moodle TBA. | Home assignments, code examples, data files and useful links will be distributed by means of Moodle environment. Course enrollment process in Moodle TBA. | ||
− | =Lectures = | + | =Lectures and Time line = |
== 05.09.23 Distance function == | == 05.09.23 Distance function == | ||
[[Media:Lecture_01_DM2023_Introduction_distance_functions.pdf |Slides]] | [[Media:Lecture_01_DM2023_Introduction_distance_functions.pdf |Slides]] | ||
95. rida: | 105. rida: | ||
== 05.12.23 Graph data mining and Social analysis == | == 05.12.23 Graph data mining and Social analysis == | ||
− | [[Media: | + | [[Media:Lecture_13_DM2023_Mining_Data_Graph_Data.pdf |Slides]] |
− | [[Media: | + | [[Media:Lecture_13_DM2023_Social_Network_analysis.pdf |Slides]] |
== 12.12.23 Privacy preserving data mining== | == 12.12.23 Privacy preserving data mining== | ||
− | [[Media: | + | [[Media:Lecture_14_DM2023_Privacy_preserving_data_mining.pdf |Slides]] |
== 19.12.23 Closed Book Test II == | == 19.12.23 Closed Book Test II == |
Redaktsioon: 31. august 2023, kell 08:15
Information for perspective students:
Lecture schedule and slides content are tentative. Please follow the course page in TalTech Moodle for up to date information and lecture content!!!
The course is open to students with valid TalTech UniID! The course targets M.Sc. curricula students. It is expected that the students are familiar with the Calculus, Linear algebra, Probability, Statistics and possess basic to intermediate knowledge of at least one programming language. This course is not recommended for students of B.Sc. curricula.
Code to join course page in Moodle and MS Teams will be provided to the students via ÕIS e-mail on Monday September the 4th.
Those planning to use their own computers please install "R" and "R-studio".
Fall 2023
ITI8730: Data Mining and network analysis
Old code for this course is IDN0110
Taught by: Sven Nõmm
Teaching assistants Ilja Matjas, Rajesh Kalakoti
EAP: 6.0
Lectures: Tuesdays 12:15 - 13:45 ICT-A1
Labs (practices): Thursdays 14:00 - 15:30 ICT-404
Link to join MS Teams
Consultation: by appointment only Please do not hesitate to ask for appointment!!! For communication please use the following e-mail: sven.nomm@taltech.ee
Prerequisites to join the course
Students are expected to be familiar with the foundations of Calculus, Linear algebra, Probability theory and Statistics and possess the knowledge of at least one programming language.
Overview
The course aims to provide knowledge of theory behind different methods of data mining and develop practical skills in applying those methods on practice. Is is spanned around four "super problems" of data mining:
- Clustering
- Classification
- Association pattern mining
- Outlier analysis
Main topics of the course:
- Data types and Data Preparation
- Similarity and Distances, Association Pattern Mining,
- Cluster Analysis, Classification, Outlier analysis
- Data streams, Text Data, Time Series, Discrete Sequences,
- Graph Data, Social Network Analysis
Evaluation
- 2x mandatory closed book tests. Each test gives 10% of the final grade. One make-up attempt for each test.
- 3x mandatory home assignments (Computational assignment +short write up.) Each assignment gives 10% of the final grade. Late (after deadline) assignments are accepted with penalty of 10% for each day except Saturdays and Sundays.
- final exam (gives 50 % of the final grade): Written report on assigned topic + discussion with lecturer.
Exam prerequisites: All 2 closed book tests are accepted (graded as 51 or higher), all 3 home assignments are accepted (graded as 51 or higher).
Home assignments, code examples, data files and useful links will be distributed by means of Moodle environment. Course enrollment process in Moodle TBA.