Skip navigation

Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.12128/17028
Title: Reduct-based ranking of attributes
Authors: Zielosko, Beata
Stańczyk, Urszula
Keywords: feature reduction; reduct; rough sets; decision rules; classification; ranking of attributes; stylometry
Issue Date: 2020
Citation: "Procedia Computer Science" Vol. 176 (2020), s. 2576-2585
Abstract: The paper is dedicated to the area of feature selection, in particular a notion of attribute rankings that allow to estimate importance of variables. In the research presented for ranking construction a new weighting factor was defined, based on relative reducts. A reduct constitutes an embedded mechanism of feature selection, specific to rough set theory. The proposed factor takes into account the number of reducts in which a given attribute exists, as well as lengths of reducts. Two approaches for reduct generation were employed and compared, with search executed by a genetic algorithm. To validate the usefulness of the reduct-based rankings in the process of feature reduction, for gradually decreasing subsets of attributes, selected through rankings, sets of decision rules were induced in classical rough set approach. The performance of all rule classifiers was evaluated, and experimental results showed that the proposed rankings led to at least the same, or even increased classification accuracy for reduced sets of features than in the case of operating on the entire set of condition attributes. The experiments were performed on datasets from stylometry domain, with treating authorship attribution as a classification task, and stylometric descriptors as characteristic features defining writing styles.
URI: http://hdl.handle.net/20.500.12128/17028
DOI: 10.1016/j.procs.2020.09.315
ISSN: 1877-0509
Appears in Collections:Artykuły (WNŚiT)

Files in This Item:
File Description SizeFormat 
Zielosko_Reduct_based_ranking_of_attributes.pdf1,11 MBAdobe PDFView/Open
Show full item record


Uznanie autorstwa - użycie niekomercyjne, bez utworów zależnych 3.0 Polska Creative Commons License Creative Commons