Question

I'm storing papers in SQL Server 2005 and am looking for a way to paste in the text of a paper and then search for potential plagiarism (copied content) in the database.

What's the best way to go about this? Is there a way to get a gauge for the extent to which something is similar to something else using full-text indexing, for several paragraphs of content?

Was it helpful?

Solution

why don't you install google desktop and have it only index that one directory

then you can have google do the indexing for you

OTHER TIPS

This is not really the sort of problem that full-text indexing in SQL Server is designed to solve. There's nothing built in to SQL Server that you can really use to help with this.

There are a number of specialised plagiarism detection tools, which a Google search will turn up for you. That's probably your best bet.

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top