Skip to the main content

Original scientific paper

https://doi.org/10.20532/cit.2016.1002500

Knowledge-based Systems and Interestingness Measures: Analysis with Clinical Datasets

Jabez J. Christopher orcid id orcid.org/0000-0001-6744-9329 ; Ramanujan Computing Centre, Anna University, Chennai, India
Khanna H. Nehemiah ; Ramanujan Computing Centre Anna University Chennai, India
Kannan Arputharaj ; Department of Information Science and Technology Anna University Chennai, India


Full text: english pdf 414 Kb

page 65-78

downloads: 1.007

cite


Abstract

Knowledge mined from clinical data can be used for medical diagnosis and prognosis. By improving the quality of knowledge base, the efficiency of prediction of a knowledge-based system can be enhanced. Designing accurate and precise clinical decision support systems, which use the mined knowledge, is still a broad area of research. This work analyses the variation in classification accuracy for such knowledge-based systems using different rule lists. The purpose of this work is not to improve the prediction accuracy of a decision support system, but analyze the factors that influence the efficiency and design of the knowledge base in a rule-based decision support system. Three benchmark medical datasets are used. Rules are extracted using a supervised machine learning algorithm (PART). Each rule in the ruleset is validated using nine frequently used rule interestingness measures. After calculating the measure values, the rule lists are used for performance evaluation. Experimental results show variation in classification accuracy for different rule lists. Confidence and Laplace measures yield relatively superior accuracy: 81.188% for heart disease dataset and 78.255% for diabetes dataset. The accuracy of the knowledge-based prediction system is predominantly dependent on the organization of the ruleset. Rule length needs to be considered when deciding the rule ordering. Subset of a rule, or combination of rule elements, may form new rules and sometimes be a member of the rule list. Redundant rules should be eliminated. Prior knowledge about the domain will enable knowledge engineers to design a better knowledge base.

Keywords

knowledge base; decision support systems; rule-based classification; rule list; interestingness measures

Hrčak ID:

155087

URI

https://hrcak.srce.hr/155087

Publication date:

25.3.2016.

Visits: 1.868 *