Skip navigation

Zastosuj identyfikator do podlinkowania lub zacytowania tej pozycji: http://hdl.handle.net/20.500.12128/20581
Pełny rekord metadanych
DC poleWartośćJęzyk
dc.contributor.authorNowak-Brzezińska, Agnieszka-
dc.contributor.authorŁazarz, Weronika-
dc.date.accessioned2021-07-09T12:02:21Z-
dc.date.available2021-07-09T12:02:21Z-
dc.date.issued2021-
dc.identifier.citationEntropy, 2021, Vol. 23, art. no 869pl_PL
dc.identifier.issn1099-4300-
dc.identifier.urihttp://hdl.handle.net/20.500.12128/20581-
dc.description.abstractDetecting outliers is a widely studied problem in many disciplines, including statistics, data mining, and machine learning. All anomaly detection activities are aimed at identifying cases of unusual behavior compared to most observations. There are many methods to deal with this issue, which are applicable depending on the size of the data set, the way it is stored, and the type of attributes and their values. Most of them focus on traditional datasets with a large number of quantitative attributes. The multitude of solutions related to detecting outliers in quantitative sets, a large and still has a small number of research solutions is a problem detecting outliers in data containing only qualitative variables. This article was designed to compare three different categorical data clustering algorithms: K-modes algorithm taken from MacQueen’s K-means algorithm and the STIRR and ROCK algorithms. The comparison concerned the method of dividing the set into clusters and, in particular, the outliers detected by algorithms. During the research, the authors analyzed the clusters detected by the indicated algorithms, using several datasets that differ in terms of the number of objects and variables. They have conducted experiments on the parameters of the algorithms. The presented study made it possible to check whether the algorithms similarly detect outliers in the data and how much they depend on individual parameters and parameters of the set, such as the number of variables, tuples, and categories of a qualitative variable.pl_PL
dc.language.isoenpl_PL
dc.rightsUznanie autorstwa 3.0 Polska*
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/pl/*
dc.subjectqualitative datapl_PL
dc.subjectoutliers detectionpl_PL
dc.subjectdata clusteringpl_PL
dc.subjectK-modespl_PL
dc.subjectROCKpl_PL
dc.subjectSTIRRpl_PL
dc.titleQualitative Data Clustering to Detect Outlierspl_PL
dc.typeinfo:eu-repo/semantics/articlepl_PL
dc.relation.journalEntropypl_PL
dc.identifier.doi10.3390/e23070869-
Pojawia się w kolekcji:Artykuły (WNŚiT)

Pliki tej pozycji:
Plik Opis RozmiarFormat 
Nowak_Brzezinska_Qualitative_Data_Clustering.pdf1,77 MBAdobe PDFPrzejrzyj / Otwórz
Pokaż prosty rekord


Uznanie Autorstwa 3.0 Polska Creative Commons Creative Commons