Comparative Analysis : Classification Algorithms

3166 Words Feb 9th, 2015 13 Pages
Comparative Analysis of Classification Algorithms
Abstract
Data Mining is the non-trivial extraction of potentially useful information about data. In other words, Data Mining extracts the knowledge or interesting information from large set of structured data that are from different sources. There are various research domains in data mining specifically text mining, web mining, image mining, sequence mining, process mining, graph mining, etc. Data mining applications are used in a range of areas such as it is used for financial data analysis, retail and telecommunication industries, banking, health care and medicine. In health care, the data mining is mainly used for disease prediction. In data mining, there are several techniques have been developed and used for predicting the diseases that includes data preprocessing, classification, clustering, association rules and sequential patterns. This paper analyses the performance of two classification techniques such as Bayesian and Lazy classifiers for hepatitis dataset. In Bayesian classifier there are two algorithms namely BayesNet and NaiveBayes. In Lazy classifier we have two algorithms namely IBK and KStar. Comparative analysis is done by using the WEKA tool.It is open source software which consists of the collection of machine learning algorithms for data mining tasks.
Keywords: Data Mining, Classification, Bayesian, Lazy, BayesNet, NaiveBayes, IBK, KStar
I. Introduction Data mining refers to extracting knowledge from…
Open Document