Similarity index checker. Compiled by Kirsty Meddings, Product Manager at CrossRef

Similarity index checker. Compiled by Kirsty Meddings, Product Manager at CrossRef

—CrossCheck, the plagiarism assessment effort from CrossRef and iParadigms has welcomed its publisher that is 240th and becoming a well established area of the editorial procedure for most journals. CrossCheck users utilize the plagiarism that is iThenticate system to display submitted documents for originality and may quickly inform whether a paper contains passages of text which also come in other magazines or resources.

Whenever a manuscript is very first uploaded to iThenticate, a Similarity Score is came back showing the percentage of text within the uploaded document that matches text various other posted papers or webpages.

The similarity rating could be the thing that is first see whenever a document is prepared and, since it’s an easy task to give attention to this quantity as signifying an issue, a standard concern brand new users for the system ask is ‘what degree of similarity rating shows a problem?’

The solution to this real question is there is absolutely no such thing being a ‘magic number’ which will let you know whether a document contains problematic content. The similarity rating offers you a rough ‘headline’ that ensures heavily duplicated documents are brought right to your attention and enables you to quickly disregard documents with extremely little matches. Beyond that, the rating it self doesn’t provide you with definitive answers and definitely cannot inform you whether you have got an instance of plagiarism.

How come this?

Well, there are certain facets that have to be taken into consideration whenever evaluating a paper’s similarity score that is overall.

Firstly, it is crucial to see the similarity rating is letting you know the amount that is total of text. This can be most likely likely to be consists of a true quantity of smaller matches. It will be possible a 30% rating will grow to be a 30% match to 1 supply, however it’s greatly predisposed that whenever you appear during the reports you’ll find the 30% comprises of a true number of smaller matches, the biggest of which can be simply four or five%.

Needless to say, a paper with six split matches of 5% is possibly as problematic as you which have copied 30% of its content from a solitary supply, however it’s impractical to inform whether here is the situation without taking a look at the reports.

Next, where in actuality the match seems can be more important sometimes than how large the match is. As an example, editors in a few subject matter could be less concerned with sizable matches in techniques parts, where you will find only plenty approaches to explain a specific procedure. A match within the conversation or conclusions without any citation that is appropriate on one other hand, could set security bells ringing though it just makes up half the normal commission for the manuscript.

Likewise, acceptable thresholds for just one style of article may possibly not be suitable for another: Review articles might be likely to have a greater similarity that is overall than initial research articles.

It’s also crucial to remember there might be easy mistakes into the manuscript that is unedited mean matches are found wrongly. The exclude bibliography function of iThenticate hinges on the reference part having a name on its own line in the document. The references will not be excluded if this is omitted from the manuscript.

Similarly, the exclude quotes function searches for quote marks. The system will not recognize it as a quote, even though it might be apparent to the editor due to its layout and reference if the author has not used quotation marks or missed one at the start or end of the passage.

For many of those reasons it is essential to check out the reports as opposed to depend on the similarity rating alone.

Utilising the Information Monitoring Report

The default report in iThenticate may be the Similarity Report. This shows you content matches from highest to lowest. It highlights every area associated with the uploaded manuscript that match several sources in iThenticate’s comprehensive databases and provides you a good indicator of whether or not the paper contains significant sections of duplicate content.

A glance that is quick the Similarity Report are frequently all that is required to confirm a manuscript only contains tiny matches composed of commonly used terms or expressions, or at the worst, poorly cited content that may be corrected. If, but, the Similarity Report identifies more than one matches which can be quite big, or plenty of smaller matches even with the bibliography excluded, the information Tracking report should always be your port that is next of.

Content Tracking compares the uploaded manuscript to one supply at any given time. The Similarity Report combines the most effective matches from multiple sources into an overview, plus in performing this can only just attribute each match to 1 supply with regards to may in fact come in a few.

It is most useful explained utilizing an illustration. State a document posseses a general similarity rating of 25%, comprised into the Similarity Report of 1 match of 20% to supply A and an extra match of 5% to supply B. Switching to information monitoring reveals the next match to supply B is actually 15%, but 10% is a passing of text positioned in the match to source A and is consequently masked by the bigger match. This 10% cannot show as matching both supply papers into the Similarity Report you can toggle between individual sources with a radio button it will be attributed to each source separately because it can only be highlighted once, but in Content Tracking where.

A good example of where this is specially helpful occurs when there clearly was a mix of duplicate or redundant publication and plagiarism that is possible big matches towards the author’s previous work could conceal smaller passages copied off their articles. Content Tracking will set down the extent that is full of with every supply without any masking.

Side-by-Side Comparison

Finally, don’t forget that for just about any match you will see the entire text of this supply article or web site alongside

the manuscript that is uploaded hitting the highlighted passage into the left-hand display screen of either the Similarity Report or Content monitoring. Hitting the hyperlink into the pane that is right-hand simply simply take you to definitely the content or web site in its initial location, and that can be helpful for identifying site matches or checking unknown sources, but just the sideby-side view will highlight the matching passages next to one another.