Základní údaje

Typ práce: Diplomová práce
Název tématu:
Mining textual data with using syntactic categories
Stav tématu:
schváleno (prof. Ing. Cyril Klimeš, CSc. - vedoucí pracoviště)
Vedoucí práce:
Provozně ekonomická fakulta
Garantující pracoviště:
Ústav informatiky (PEF)
Max. počet studentů:
Mining knowledge from textual data is a very topical issue today. Machine learning is a dominant approach enabling achieving satisfactory results in many classes of problems. By enriching the input data (the content of processed documents) with information about the syntactic word category, the results can be potentially improved. The aim of the thesis would be the implementation of a knowledge discovery process using a large collection of text data with application of a part-of-speech tagging procedure and comparison of the process and its results with the approach without considering the syntactic categories. The knowledge mining process will be using classification, clustering, or will focus on searching for meaningful words and expressions well describing the selected domain.

K tématu nejsou zadaná žádná omezení