Skip navigation

Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.12128/16855
Title: Exploration of outliers in if-then rule-based knowledge bases
Authors: Nowak-Brzezińska, Agnieszka
Horyń, Czesław
Keywords: rule-based knowledge base; outliers detection; cluster validity; data clustering; AHC; LOF; COF; K-MEANS; SMALL CLUSTERS
Issue Date: 2020
Citation: "Entropy" 2020, iss. 10, art. no. 1096
Abstract: The article presents both methods of clustering and outlier detection in complex data, such as rule-based knowledge bases. What distinguishes this work from others is, first, the application of clustering algorithms to rules in domain knowledge bases, and secondly, the use of outlier detection algorithms to detect unusual rules in knowledge bases. The aim of the paper is the analysis of using four algorithms for outlier detection in rule-based knowledge bases: Local Outlier Factor (LOF), Connectivity-based Outlier Factor (COF), K-MEANS, and SMALL CLUSTERS. The subject of outlier mining is very important nowadays. Outliers in rules If-Then mean unusual rules, which are rare in comparing to others and should be explored by the domain expert as soon as possible. In the research, the authors use the outlier detection methods to find a given number of outliers in rules (1% , 5%, 10%), while in small groups, the number of outliers covers no more than 5% of the rule cluster. Subsequently, the authors analyze which of seven various quality indices, which they use for all rules and after removing selected outliers, improve the quality of rule clusters. In the experimental stage, the authors use six different knowledge bases. The best results (the most often the clusters quality was improved) are achieved for two outlier detection algorithms LOF and COF.
URI: http://hdl.handle.net/20.500.12128/16855
DOI: 10.3390/e22101096
ISSN: 1099-4300
Appears in Collections:Artykuły (WNŚiT)

Files in This Item:
File Description SizeFormat 
Nowak-Brzezinska_Exploration_of_Outliers_in_If.pdf2,02 MBAdobe PDFView/Open
Show full item record


Uznanie Autorstwa 3.0 Polska Creative Commons License Creative Commons