Summary of topics offered - Department of Informatics (FBE)


Basic information

Type of work:
Diploma thesis
Topic:
Intrinsic Plagiairsm Detection
State of topic:
approved (prof. Ing. Cyril Klimeš, CSc. - head of department)
Thesis supervisor:
Faculty:
Faculty of Business and Economics
Supervising department:
Department of Informatics - FBE
Max. no. of students:
--
Proposed by:
Summary:
There are two approaches to plagiarism detection. Extrinsic plagiarism detection looks for similarities across different documents, intrinsic plagiarism detection looks for dissimilarities within one document. Crucial presumption is that different authors have different style of writing, which allows their identification. Given suspicious document, the goal is to identify passages with different (stylometric) characteristic. Given the set of these passages, the next goal is to group them by authorship. The task of the student will be to identify prospective stylometric features, implement intrinsic plagiarism detector and test it on PAN corpus. Outperforming contesters in PAN competition is welcome, but not necessary :-) The thesis can be elaborated in Czech or English.



Limitations of the topic

To sign up for a topic it is necessary to fulfil one of the following restrictions

Restrictions by study
The table shows restrictions by study to which the student has to be enrolled in order to sign up for the given topic.

Programme
C-SE System engineering and informatics
C-II Engineering Informatics
C-SIA System Engineering and Informatics

Limit to courses
The table shows limitations of a course the student has to complete to be able to register for a given topic.

DepartmentCourse title
No suitable data found.