Detection Using Principle Component Analysis And Case Based Reasoning With Support Vector Machine

2147 Words Feb 3rd, 2015 9 Pages
Splice site detection using principle component analysis and case based reasoning with support vector machine

Srabanti Maji*1 and Haripada Bhunia2

1 Computer Science Department
Sri Guru Harkrishan College of Management and Technology, Raipur, Bahadurgarh;
Dist: Patiala,Punjab, India

2 Department of Chemical Engineering
Thapar University, Patiala-147004, India

*Address Correspondence to this author at
Dr. Srabanti Maji
Computer Science Department,
Sri Guru Harkrishan College of Management and Technology, Raipur, Bahadurgarh;
District: Patiala, Punjab, India

E-mail address: srabantiindia@gmail.com, srabanti9@gmail.com
Tel: +91-9356006454

ABSTRACT

Identification of coding region from genomic DNA sequence is the foremost step in quest of the gene prediction. In the eukaryotic organism, the gene structure consists of promoter, intron, start codon, exons and stop codon, etc. In the prediction of splice site, which is the separating junction between exons and introns, though the sequences adjacent to the splice sites have a high conservation, but still, the accuracy of the tool is lower than 90%. Therefore, the algorithms used in the splice sites identification must be improved in order to recover the prediction accuracy and much attention has been paid during past few years in this area. This proposed method, SpliceCombo consists of three stages – initial stage, in which a Principal Component Analysis (PCA) is used, i.e. feature extraction; intermediate, or the…
Open Document