Using regular expressions for mining data in large software repositories

The usage of data mining technique in collecting data from software repositories involves the extraction of both basic and value-added information from existing software repositories. Regular Expressions (Regex) provide a mechanism to select specific strings from a set of character strings. In this...

Full description

Saved in:
Bibliographic Details
Main Author: Awang Abu Bakar, Normi Sham
Format: Conference or Workshop Item
Language:English
English
Published: IEEE 2014
Subjects:
Online Access:http://irep.iium.edu.my/42896/6/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf
http://irep.iium.edu.my/42896/7/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf
http://irep.iium.edu.my/42896/
http://ieeexplore.ieee.org/document/7020649/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.iium.irep.42896
record_format dspace
spelling my.iium.irep.428962017-09-20T01:07:10Z http://irep.iium.edu.my/42896/ Using regular expressions for mining data in large software repositories Awang Abu Bakar, Normi Sham T Technology (General) The usage of data mining technique in collecting data from software repositories involves the extraction of both basic and value-added information from existing software repositories. Regular Expressions (Regex) provide a mechanism to select specific strings from a set of character strings. In this paper, we discuss how regular expressions are used to create a data mining tool, known as OSSGrab. We developed the mining tool using Python scripting, in combination with Regex, and as a result, the time spent on data collection can be saved significantly. IEEE 2014 Conference or Workshop Item REM application/pdf en http://irep.iium.edu.my/42896/6/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf application/pdf en http://irep.iium.edu.my/42896/7/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf Awang Abu Bakar, Normi Sham (2014) Using regular expressions for mining data in large software repositories. In: 2014 The 5th International Conference on Information and Communication Technology for The Muslim World (ICT4M), 17th-18th November 2014, Kuching, Sarawak, Malaysia. http://ieeexplore.ieee.org/document/7020649/ 10.1109/ICT4M.2014.7020649
institution Universiti Islam Antarabangsa Malaysia
building IIUM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider International Islamic University Malaysia
content_source IIUM Repository (IREP)
url_provider http://irep.iium.edu.my/
language English
English
topic T Technology (General)
spellingShingle T Technology (General)
Awang Abu Bakar, Normi Sham
Using regular expressions for mining data in large software repositories
description The usage of data mining technique in collecting data from software repositories involves the extraction of both basic and value-added information from existing software repositories. Regular Expressions (Regex) provide a mechanism to select specific strings from a set of character strings. In this paper, we discuss how regular expressions are used to create a data mining tool, known as OSSGrab. We developed the mining tool using Python scripting, in combination with Regex, and as a result, the time spent on data collection can be saved significantly.
format Conference or Workshop Item
author Awang Abu Bakar, Normi Sham
author_facet Awang Abu Bakar, Normi Sham
author_sort Awang Abu Bakar, Normi Sham
title Using regular expressions for mining data in large software repositories
title_short Using regular expressions for mining data in large software repositories
title_full Using regular expressions for mining data in large software repositories
title_fullStr Using regular expressions for mining data in large software repositories
title_full_unstemmed Using regular expressions for mining data in large software repositories
title_sort using regular expressions for mining data in large software repositories
publisher IEEE
publishDate 2014
url http://irep.iium.edu.my/42896/6/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf
http://irep.iium.edu.my/42896/7/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf
http://irep.iium.edu.my/42896/
http://ieeexplore.ieee.org/document/7020649/
_version_ 1643612278808903680
score 13.160551