Skip navigation

Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.12128/17715
Title: Parsing expression grammars and their induction algorithm
Authors: Wieczorek, Wojciech
Unold, Olgierd
Strąk, Łukasz
Keywords: classification; genetic programming; grammatical inference; parsing expression grammar
Issue Date: 2020
Citation: "Applied Science" (2020) iss. 23, art. no. 8747, s. 1-16
Abstract: Grammatical inference (GI), i.e., the task of finding a rule that lies behind given words, can be used in the analyses of amyloidogenic sequence fragments, which are essential in studies of neurodegenerative diseases. In this paper, we developed a new method that generates non-circular parsing expression grammars (PEGs) and compares it with other GI algorithms on the sequences from a real dataset. The main contribution of this paper is a genetic programming-based algorithm for the induction of parsing expression grammars from a finite sample. The induction method has been tested on a real bioinformatics dataset and its classification performance has been compared to the achievements of existing grammatical inference methods. The evaluation of the generated PEG on an amyloidogenic dataset revealed its accuracy when predicting amyloid segments. We show that the new grammatical inference algorithm achieves the best ACC (Accuracy), AUC (Area under ROC curve), and MCC (Mathew’s correlation coefficient) scores in comparison to five other automata or grammar learning methods.
URI: http://hdl.handle.net/20.500.12128/17715
DOI: 10.3390/app10238747
ISSN: 2076-3417
Appears in Collections:Artykuły (WNŚiT)

Files in This Item:
File Description SizeFormat 
Wieczorek_Strak_parsing_expressions_grammar.pdf970,43 kBAdobe PDFView/Open
Show full item record


Uznanie Autorstwa 3.0 Polska Creative Commons License Creative Commons