IRO Journals

Journal of Artificial Intelligence and Capsule Networks

Sentiment Analysis of Nepali COVID19 Tweets Using NB, SVM AND LSTM
Volume-3 | Issue-3

Blockchain-Enabled Federated Learning on Kubernetes for Air Quality Prediction Applications
Volume-3 | Issue-3

Smart Fashion: A Review of AI Applications in Virtual Try-On & Fashion Synthesis
Volume-3 | Issue-4

Deniable Authentication Encryption for Privacy Protection using Blockchain
Volume-3 | Issue-3

Hybrid Parallel Image Processing Algorithm for Binary Images with Image Thinning Technique
Volume-3 | Issue-3

Smart Medical Nursing Care Unit based on Internet of Things for Emergency Healthcare
Volume-3 | Issue-4

QoS-aware Virtual Machine (VM) for Optimal Resource Utilization and Energy Conservation
Volume-3 | Issue-3

Probabilistic Neural Network based Managing Algorithm for Building Automation System
Volume-3 | Issue-4

Fusion based Feature Extraction Analysis of ECG Signal Interpretation - A Systematic Approach
Volume-3 | Issue-1

Review on Image Recoloring Methods for Efficient Naturalness by Coloring Data Modeling Methods for Low Visual Deficiency
Volume-3 | Issue-3

Smart Fashion: A Review of AI Applications in Virtual Try-On & Fashion Synthesis
Volume-3 | Issue-4

Deniable Authentication Encryption for Privacy Protection using Blockchain
Volume-3 | Issue-3

Real Time Anomaly Detection Techniques Using PySpark Frame Work
Volume-2 | Issue-1

Sentiment Analysis of Nepali COVID19 Tweets Using NB, SVM AND LSTM
Volume-3 | Issue-3

Audio Tagging Using CNN Based Audio Neural Networks for Massive Data Processing
Volume-3 | Issue-4

Frontiers of AI beyond 2030: Novel Perspectives
Volume-4 | Issue-4

Smart Medical Nursing Care Unit based on Internet of Things for Emergency Healthcare
Volume-3 | Issue-4

Early Stage Detection of Crack in Glasses by Hybrid CNN Transformation Approach
Volume-3 | Issue-4

Artificial Intelligence Algorithm with SVM Classification using Dermascopic Images for Melanoma Diagnosis
Volume-3 | Issue-1

An Efficient Machine Learning based Model for Classification of Wearable Clothing
Volume-3 | Issue-4

Home / Archives / Volume-2 / Issue-3 / Article-5

Volume - 2 | Issue - 3 | september 2020

Plagiarism Detection in Programming Assignments using Machine Learning
Nishesh Awale, Mitesh Pandey, Anish Dulal, Bibek Timsina  271  184
Pages: 177-184
Cite this article
Awale, N., Pandey, M., Dulal, A. & Timsina, B. (2020). Plagiarism Detection in Programming Assignments using Machine Learning. Journal of Artificial Intelligence and Capsule Networks, 2(3), 177-184. doi:10.36548/jaicn.2020.3.005
Published
21 July, 2020
Abstract

Plagiarism in programming assignments has been increasing these days which affects the evaluation of students. Thispaper proposes a machine learning approach for plagiarism detection of programming assignments. Different features related to source code are computed based on similarity score of n-grams, code style similarity and dead codes. Then, xgboost model is used for training and predicting whether a pair of source code are plagiarised or not. Many plagiarism techniques ignores dead codes such as unused variables and functions in their predictions tasks. But number of unused variables and functions in the source code are considered in this paper. Using our features, the model achieved an accuracy score of 94% and average f1-score of 0.905 on the test set. We also compared the result of xgboost model with support vector machines(SVM) and report that xgboost model performed better on our dataset.

Keywords

Source Code Plagiarism Detection Xgboost Programming Assignments Plagiarism Source Code Features

Full Article PDF Download Article PDF 
×

Currently, subscription is the only source of revenue. The subscription resource covers the operating expenses such as web presence, online version, pre-press preparations, and staff wages.

Subscription Payment Details

townscript (INR / USD): click here

Subscription Fee

Annual Subscription 15,000 INR / 200 USD
Subscription form: click here