Enhancing generic pipeline model for code clone detection using divide and conquer approach

Code clone is known as identical copies of the same instances or fragments of source codes in software. Current code clone research focuses on the detection and analysis of code clones in order to help software developers identify code clones in source codes and reuse the source codes in order to de...

Full description

Saved in:
Bibliographic Details
Main Authors: Mubarak-Ali, Al-Fahim, Syed-Mohamad, Sharifah, Sulaiman, Shahida
Format: Article
Published: Zarka Private Univ 2015
Subjects:
Online Access:http://eprints.utm.my/id/eprint/55031/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.55031
record_format eprints
spelling my.utm.550312017-02-15T07:28:32Z http://eprints.utm.my/id/eprint/55031/ Enhancing generic pipeline model for code clone detection using divide and conquer approach Mubarak-Ali, Al-Fahim Syed-Mohamad, Sharifah Sulaiman, Shahida QA75 Electronic computers. Computer science Code clone is known as identical copies of the same instances or fragments of source codes in software. Current code clone research focuses on the detection and analysis of code clones in order to help software developers identify code clones in source codes and reuse the source codes in order to decrease the maintenance cost. Many approaches such as textual based comparison approach, token based comparison and tree based comparison approach have been used to detect code clones. As software grows and becomes a legacy system, the complexity of these approaches in detecting code clones increases. Thus, this scenario makes it more difficult to detect code clones. Generic pipeline model is the most recent code clone detection that comprises five processes which are parsing process, pre-processing process, pooling process, comparing processes and filtering process to detect code clone. This research highlights the enhancement of the generic pipeline model using divide and conquer approach that involves concatenation process. The aim of this approach is to produce a better input for the generic pipeline model by processing smaller part of source code files before focusing on the large chunk of source codes in a single pipeline. We implement and apply the proposed approach with the support of a tool called Java Code Clone Detector (JCCD). The result obtained shows an improvement in the rate of code clone detection and overall runtime performance as compared to the existing generic pipeline model. Zarka Private Univ 2015-09 Article PeerReviewed Mubarak-Ali, Al-Fahim and Syed-Mohamad, Sharifah and Sulaiman, Shahida (2015) Enhancing generic pipeline model for code clone detection using divide and conquer approach. International Arab Journal of Information Technology, 12 (5). pp. 510-517. ISSN 1683-3198
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Mubarak-Ali, Al-Fahim
Syed-Mohamad, Sharifah
Sulaiman, Shahida
Enhancing generic pipeline model for code clone detection using divide and conquer approach
description Code clone is known as identical copies of the same instances or fragments of source codes in software. Current code clone research focuses on the detection and analysis of code clones in order to help software developers identify code clones in source codes and reuse the source codes in order to decrease the maintenance cost. Many approaches such as textual based comparison approach, token based comparison and tree based comparison approach have been used to detect code clones. As software grows and becomes a legacy system, the complexity of these approaches in detecting code clones increases. Thus, this scenario makes it more difficult to detect code clones. Generic pipeline model is the most recent code clone detection that comprises five processes which are parsing process, pre-processing process, pooling process, comparing processes and filtering process to detect code clone. This research highlights the enhancement of the generic pipeline model using divide and conquer approach that involves concatenation process. The aim of this approach is to produce a better input for the generic pipeline model by processing smaller part of source code files before focusing on the large chunk of source codes in a single pipeline. We implement and apply the proposed approach with the support of a tool called Java Code Clone Detector (JCCD). The result obtained shows an improvement in the rate of code clone detection and overall runtime performance as compared to the existing generic pipeline model.
format Article
author Mubarak-Ali, Al-Fahim
Syed-Mohamad, Sharifah
Sulaiman, Shahida
author_facet Mubarak-Ali, Al-Fahim
Syed-Mohamad, Sharifah
Sulaiman, Shahida
author_sort Mubarak-Ali, Al-Fahim
title Enhancing generic pipeline model for code clone detection using divide and conquer approach
title_short Enhancing generic pipeline model for code clone detection using divide and conquer approach
title_full Enhancing generic pipeline model for code clone detection using divide and conquer approach
title_fullStr Enhancing generic pipeline model for code clone detection using divide and conquer approach
title_full_unstemmed Enhancing generic pipeline model for code clone detection using divide and conquer approach
title_sort enhancing generic pipeline model for code clone detection using divide and conquer approach
publisher Zarka Private Univ
publishDate 2015
url http://eprints.utm.my/id/eprint/55031/
_version_ 1643653673459384320
score 13.211869