NEC CSE 3-1 R20 Data Warehouse and Data Mining Syllabus and Materials PDF

0
UNIT– I

Introduction to data mining: -What Is Data Mining, -Motivating Challenges, -The Origins of Data Mining, -Data Mining Tasks, -Types of Data: Attributes and Measurement, Types of Data Sets, -Data Quality: Measurement and Data Collection Issues, Issues Related to Applications.

Download UNIT-I Material Here 

UNIT–II

 Data Preprocessing: Aggregation, Sampling, Dimensionality Reduction, Feature Subset Selection, Feature Creation, Discretization and Binarization, Variable Transformation, -Measures of similarity and dissimilarity: Basics, Similarity and Dissimilarity between Simple Attributes, Dissimilarities between Data Objects, Similarities between Data Objects, Examples of Proximity Measures.

Download UNIT-II Data Preprocessing Material Here 

Exploring data
: -The Iris Data set, -Summary Statistics: Frequencies and the Mode, Percentiles, Measures of Location: Mean and Median, Measures of Spread: Range and Variance, Multivariate Summary Statistics, Other Ways to Summarize the Data.

Download UNIT-II Exploring Data Material Here 




UNIT–III

Data Warehouse and OLAP Technology for Data Mining: -What is a Data Warehouse, -A Multidimensional Data Model: From tables to data cubes, Stars, snowflake, and fact constellations(schemas for multidimensional databases), Examples for defining star, snowflake, and fact constellation schemas, Measures(their categorization and computation), Introducing concept hierarchies, OLAP operations in the multidimensional data model, A starnet query model for querying multidimensional databases, -Data Warehouse Architecture: Steps for the design and construction of data warehouses, A three-tier data warehouse architecture, OLAP server


Architectures: ROLAP vs. MOLAP vs. HOLAP, SQL extensions to support OLAP operations, Data Warehouse implementation: Efficient computation of data cubes, Indexing OLAP data, Efficient processing of OLAP queries, Metadata repository, Data warehouse back-end tools and utilities, -Further development of data cube technology: Discovery-driven exploration of data cubes, Complex aggregation at multiple granularities (Multifeature cubes), -From data warehousing to data mining: Data warehouse usage, From online analytical processing to online analytical mining.


UNIT-IV

Classification: Basic Concepts, Decision Trees, and Model Evaluation: -Preliminaries, - General Approach to Solving a Classification Problem, Decision Tree Induction: How a Decision Tree Works, How to Build a Decision Tree, Methods for Expressing Attribute Test Conditions, Measures for Selecting the Best Split, Algorithm for Decision Tree Induction, -Model Overfitting: Overfitting Due to Presence of Noise, Overfitting Due to Lack of Representative Samples, Evaluating the Performance of a Classifier: Holdout Method, Random Subsampling, Cross Validation, Bootstrap.

Classification: alternative techniques: -Bayesian classifier: Bayes Theorem, Using Bayes theorem for classification, Naïve Bayesian classifier.

UNIT-V

Association Analysis: Basic Concepts and Algorithms: -Problem Definition, -Frequent Itemset Generation: The Apriori principle, Frequent Itemset Generation in the Apriori Algorithm, Candidate Generation and Pruning, Support Counting, -Rule Generation: Confidence-Based Pruning, -Compact Representation of Frequent Itemsets: Maximal Frequent Itemsets, Closed Frequent Itemsets, -FP-Growth algorithms: FP-Tree Representation, Frequent Itemset Generation in FP-Growth Algorithm.

Cluster Analysis: Basic Concepts and Algorithms: -Overview: What is Cluster Analysis? Different Types of Clustering, Different Types of Clusters, -K-means: The Basic K-means Algorithm, K-means Additional Issues, Bisecting K-means, K-means and Different Types of Clusters, Strengths and Weaknesses, -Agglomerative Hierarchical Clustering: Basic Agglomerative Hierarchical Clustering Algorithm, Specific Techniques, -DBSCAN: Traditional Density (center based approach), The DBSCAN algorithm, Strengths and Weaknesses.

Post a Comment

0Comments
Post a Comment (0)

Join CSE Team