Undergraduate
Faculty of Engineering and Architecture
Industrial Engineering
Anlık RSS Bilgilendirmesi İçin Tıklayınız.Düzenli bilgilendirme E-Postaları almak için listemize kaydolabilirsiniz.


Introduction to Data Mining

Course CodeSemester Course Name LE/RC/LA Course Type Language of Instruction ECTS
IE0104 Introduction to Data Mining 3/0/0 DE English 6
Course Goals
The main objective of this course is to provide students with the theoretical background and practical experience necessary to the application of data mining techniques to real world problems.
Prerequisite(s) IE3101 Introduction to Probability
Corequisite(s) -
Special Requisite(s) -
Instructor(s) Professor Ayça Çakmak Pehlivanlı
Course Assistant(s) -
Schedule This course is not offered in this semester.
Office Hour(s) This course is not offered in this semester.
Teaching Methods and Techniques Lectures, Project, Discussion, Homeworks

 
Principle Sources P.N. Tan, M. Steinbach, and V. Kumar, Introduction to Data Mining, Addison Wesley, Pearson Education, 2005. 
Other Sources C. Shmueli, N. R. Patel, P. C. Bruce, Data Mining for Business Intelligence: Concepts, Techniques and Applications in Microsoft Office Excel with XLMiner, Wiley-Interscience, 2007.

J. Han and M. Kamber, Data Mining - concepts and Techniques, 2nd Edition, Morgan Kaufmann, 2006.

R. J. Roiger and M. W. Geatz, Data Mining - A Tutorial Based Primer, Addison Wesley, Pearson Education, 2003.
Course Schedules
Week Contents Learning Methods
1. Week Overview of the course (syllbus, teaching methods, sources, outcomes) Introduction and Motivation to Data Mining - DM tools Presentation
2. Week Data Mining: Definitions, Motivation, Concepts and Techniques Data: Types of Data, Data Quality, Data Preprocessing, Measures of Similarity and Dissimilarity Presentation
3. Week Data Preprocessing: Data Sampling, Data Cleaning, Feature Selection and Dimensionality Reduction Presentation, Practice 1
4. Week Review of related and basic Statistics Supervised Learning - Prediction : Linear Regression and applications Classification: Tree-based Methods & Rule-based Methods & Bayesian Presentation
5. Week Supervised Learning - Classifications: Basic Concepts, Decision Trees and applications Presentation, Practice 2
6. Week Supervised Learning - Classifications: Bayes Theorem - Naive Bayes and applications Presentation, Practice 3
7. Week Review and Case Studies: Supervised Learning: problem definition and solution Presentation
8. Week Midterm Exam
9. Week Model Evaluation (Evaluating the Performance of a Classifier, Methods for Comparing Classifiers) Presentation, Practice 4
10. Week Association Analysis: Basic Concepts and Apriori Algorithm Presentation
11. Week Classification algorithms and Association Analysis by WEKA Presentation, Practice 5
12. Week Unsupervised Learning - Clustering: Hierarchical Clustering Methods Presentation, Practice 6
13. Week Unsupervised Learning - Clustering: K-Means algorithms and applications Presentation, Practice 7
14. Week Review Presentation
15. Week Final Exam
16. Week Final Exam
17. Week Final Exam
Assessments
Evaluation tools Quantity Weight(%)
Midterm(s) 1 35
Homework / Term Projects / Presentations 1 25
Final Exam 1 40


Program Outcomes
PO-1Ability to apply theoretical and practical knowledge gained by Mathematics, Science and their engineering fields and ability to use their knowledge in solving complex engineering problems.
PO-2Ability of determining, defining, formulating and solving complex engineering problems; for that purpose develop the ability of selecting and implementing suitable models and methods of analysis.
PO-3Ability of designing a complex system, process, device or product under real world constraints and conditions serving certain needs; for this purpose ability of applying modern design techniques
PO-4Ability of selecting and using the modern techniques and devices which are necessary for analyzing and solving complex problems in engineering implementations; ability of efficient usage of information technologies.
PO-5Ability of designing experiments, conducting tests, collecting data and analyzing and interpreting the solutions to investigate of complex engineering problems or discipline-specific research topics.
PO-6Ability of working efficiently in intra-disciplinary and multi-disciplinary teams; individual working ability and habits.
PO-7Ability of verbal and written communication skills; and at least one foreign language skills, ability to write effective reports and understand written reports, ability to prepare design and production reports, ability to make impressive presentation, ability to give and receive clear and understandable instructions
PO-8Awareness of importance of lifelong learning; ability to access data, to follow up the recent innovation in science and technology for continuous self-improvement.
PO-9Conformity to ethical principles; knowledge about occupational and ethical responsibility, and standards used in engineering applications.
PO-10Knowledge about work life implementations such as project management, risk management and change management; awareness about entrepreneurship and innovativeness; knowledge about sustainable development.
PO-11Knowledge about effects of engineering applications on health, environment and security in global and social dimensions, and on the problems of the modern age in engineering; awareness about legal outcomes of engineering solutions.
Learning Outcomes
LO-1Explain what data mining is and how data mining can be employed to solve real life problems
LO-2Recognize whether a data mining solution is a feasible alternative for a specific problem
LO-3Apply statistical and nonstatistical techniques to evaluate the results of a data mining session
LO-4Recognize several data mining strategies and know when each strategy is appropriate, Explain and apply how several data mining techniques build models to solve problems
LO-5Learn how to preprocess data before applying data mining techniques, Practice data mining techniques on different data sets using a software package, Evaluate the data mining performance
LO-6Discuss ethical and professional issues in data mining, Recognize the usage and the limitations of data mining algorithms
Course Assessment Matrix:
Program Outcomes - Learning Outcomes Matrix
 PO 1PO 2PO 3PO 4PO 5PO 6PO 7PO 8PO 9PO 10PO 11
LO 1
LO 2
LO 3
LO 4
LO 5
LO 6