Plagiarism Catcher

Description

  • Rais Kazi

  • December 2020

I developed a program to compare multiple text files and check for plagiarism in C++. I stored the contents of each file into its own easily-parsible vector of characters and then used a hashset to store all possible sequences. I used the instantaneous look up time to find similarities between multiple sets. Pairs of sets with a number of collisions that indicate plagiarism are outputted alongside this number. I developed a solution with a runtime of 0.1 seconds, while the expected runtime was approximately 60 seconds.

Technology