Papermill Detection Software

It is not at all surprising in this era of “fake” everything, that there would suddenly be a business need for “fake paper” detection tools. Along with “plagiarism detection” and “image duplication or image manipulation detection”, another potential risk to the integrity of the scientific record that many publishers are now proactively on the lookout for during the manuscript submission process is “papermill detection”.  

According to this COPE blog post on “Potential paper mills“:

“This term describes the process by which manufactured manuscripts are submitted to a journal for a fee on behalf of researchers with the purpose of providing an easy publication for them, or to offer authorship for sale. The concerns with these submissions include faked or manipulated data/images, the use of stock images, substantial authorship changes, and plagiarism, which is not detected because it comes from a translated version of another article.” 

Publishers are already starting to incorporate these tools into their workflows. For example, a year ago, in April 2023, the STM Scientific Integrity hub (that provides tools/services for publishers in a cloud-based environment) launched their papermill detection tool as a:

“stand-alone application that allows publishers to automatically screen uploaded papers against key indicators that suggest that the manuscript has or may have originated from a paper mill”.

This year, in March 2024, Wiley announced that its journals will soon be piloting an “AI-powered Papermill Detection Service” integrated in their manuscript submission system.

Tools like “Papermill Alarm” have been reported on in the literature as far back as 2022:

Else H. ‘Papermill alarm’ software flags potentially fake papers. Nature. 2022 Sep 23. doi: 10.1038/d41586-022-02997-x. Epub ahead of print. PMID: 36151206. https://www.nature.com/articles/d41586-022-02997-x

Although this type of detection is not something that individual authors would need to use pre-emptively, other tools  – like iThenticate, a plagiarism detection tool – are now being subscribed to and made available to all potential authors in the MSK community via the MSK Library.

Questions? Be sure to Ask Us at the MSK Library!

An Article DOI is Not Proof of Digital Preservation

A March 2024 Nature paper entitled “More than 2 million research papers have disappeared from the Internet” shed some much needed light on the fact that just because an article is published in a journal that assigns DOIs (Digital Object Identifiers), authors should NOT automatically assume that this publisher is also taking the needed measures to ensure that the digital content (that its DOIs are pointing to) is being preserved.

And journal content that is not digitally preserved can – and has, as per these articles – gone missing from the Internet:

Learn more about how to ensure that your publications are digitally archived for posterity so that they never disappear from the Internet.

Questions? Ask Us at the MSK Library!

More on Avoiding Plagiarism

As plagiarism continues to be on peoples’ minds, it is probably a good time to take a moment to review the module on ethical writing provided on the website of the U.S. Department of Health and Human Services, Office of Research Integrity (ORI).

As per the ORI website – https://ori.hhs.gov/avoiding-plagiarism-self-plagiarism-and-other-questionable-writing-practices-guide-ethical-writing:

“Avoiding Plagiarism, Self-plagiarism, and Other Questionable Writing Practices: A Guide to Ethical Writing

The purpose of this module is to help students, as well as professionals, identify and prevent questionable practices and to develop an awareness of ethical writing. This guide was written by Miguel Roig, PhD, from St. Johns University with funding from ORI.

This module was originally created in 2003 and revised in 2006 and 2015.”

In addition to the HTML version, there is also a downloadable 71-page PDF version, as well as an abridged HTML version entitled 28 Guidelines at a Glance on Avoiding Plagiarism.

Questions related to citing references that often are asked of librarians are nicely addressed in these four items in particular that have been highlighted below:

From https://ori.hhs.gov/28-guidelines-glance-avoiding-plagiarism:

“Guideline 15: Authors are strongly urged to double-check their citations. Specifically, authors should always ensure that each reference notation appearing in the body of the manuscript corresponds to the correct citation listed in the reference section and vice versa and that each source listed in the reference section has been cited at some point in the manuscript. In addition, authors should also ensure that all elements of a citation (e.g., spelling of authors’ names, volume number of journal, pagination) are derived directly from the original paper, rather than from a citation that appears on a secondary source. Finally, when appropriate, authors should ensure that credit is given to those authors who first reported the phenomenon being studied.

Guideline 16: The references used in a paper should only be those that are directly related to its contents. The intentional inclusion of references of questionable relevance for purposes such as manipulating a journal’s or a paper’s impact factor or a paper’s chances of acceptance, is an unacceptable practice.

Guideline 17: Always cite the actual work that is consulted. When the published paper cannot be obtained, cite the specific version of the material being used whether it is conference presentation, abstract, or an unpublished manuscript. Ensure that the cited work has not been subsequently corrected or retracted.

Guideline 18: Generally, when describing others’ work, do not cite an original paper if you are only relying on a secondary summary of that paper. Doing so is a deceptive practice, reflects poor scholarly standards, and can lead to a flawed description of the work described.”

Questions? Ask Us at the MSK Library.