GPTSniffer: A CodeBERT-based classifier to detect source code written by ChatGPT

IRIS

Since its launch in November 2022, ChatGPT has gained popularity among users, especially programmers who use it to solve development issues. However, while offering a practical solution to programming problems, ChatGPT should be used primarily as a supporting tool (e.g., in software education) rather than as a replacement for humans. Thus, detecting automatically generated source code by ChatGPT is necessary, and tools for identifying AI -generated content need to be adapted to work effectively with code. This paper presents GPTSniffer- a novel approach to the detection of source code written by AI - built on top of CodeBERT. We conducted an empirical study to investigate the feasibility of automated identification of AI -generated code, and the factors that influence this ability. The results show that GPTSniffer can accurately classify whether code is human -written or AI -generated, outperforming two baselines, GPTZero and OpenAI Text Classifier. Also, the study shows how similar training data or a classification context with paired snippets helps boost the prediction. We conclude that GPTSniffer can be leveraged in different contexts, e.g., in software engineering education, where teachers use the tool to detect cheating and plagiarism, or in development, where AI -generated code may require peculiar quality assurance activities.

GPTSniffer: A CodeBERT-based classifier to detect source code written by ChatGPT

Nguyen, Phuong;Di Rocco, Juri;Di Sipio, Claudio;Rubei, Riccardo;Di Ruscio, Davide;Di Penta, Massimiliano

2024-01-01

Abstract

Since its launch in November 2022, ChatGPT has gained popularity among users, especially programmers who use it to solve development issues. However, while offering a practical solution to programming problems, ChatGPT should be used primarily as a supporting tool (e.g., in software education) rather than as a replacement for humans. Thus, detecting automatically generated source code by ChatGPT is necessary, and tools for identifying AI -generated content need to be adapted to work effectively with code. This paper presents GPTSniffer- a novel approach to the detection of source code written by AI - built on top of CodeBERT. We conducted an empirical study to investigate the feasibility of automated identification of AI -generated code, and the factors that influence this ability. The results show that GPTSniffer can accurately classify whether code is human -written or AI -generated, outperforming two baselines, GPTZero and OpenAI Text Classifier. Also, the study shows how similar training data or a classification context with paired snippets helps boost the prediction. We conclude that GPTSniffer can be leveraged in different contexts, e.g., in software engineering education, where teachers use the tool to detect cheating and plagiarism, or in development, where AI -generated code may require peculiar quality assurance activities.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2024
			
	Rivista
	
				THE JOURNAL OF SYSTEMS AND SOFTWARE
			
	Codice DOI
	
				https://dx.doi.org/10.1016/j.jss.2024.112059
			
	Appare nelle tipologie:
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
1-s2.0-S0164121224001043-main.pdf accesso aperto Tipologia: Documento in Versione Editoriale Licenza: Creative commons Dimensione 2.29 MB Formato Adobe PDF Visualizza/Apri	2.29 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11697/242199

Citazioni

ND

9

6

social impact