Using regular expressions for mining data in large software repositories
The usage of data mining technique in collecting data from software repositories involves the extraction of both basic and value-added information from existing software repositories. Regular Expressions (Regex) provide a mechanism to select specific strings from a set of character strings. In this...
Saved in:
Main Author: | |
---|---|
Format: | Conference or Workshop Item |
Language: | English English |
Published: |
IEEE
2014
|
Subjects: | |
Online Access: | http://irep.iium.edu.my/42896/6/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf http://irep.iium.edu.my/42896/7/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf http://irep.iium.edu.my/42896/ http://ieeexplore.ieee.org/document/7020649/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.iium.irep.42896 |
---|---|
record_format |
dspace |
spelling |
my.iium.irep.428962017-09-20T01:07:10Z http://irep.iium.edu.my/42896/ Using regular expressions for mining data in large software repositories Awang Abu Bakar, Normi Sham T Technology (General) The usage of data mining technique in collecting data from software repositories involves the extraction of both basic and value-added information from existing software repositories. Regular Expressions (Regex) provide a mechanism to select specific strings from a set of character strings. In this paper, we discuss how regular expressions are used to create a data mining tool, known as OSSGrab. We developed the mining tool using Python scripting, in combination with Regex, and as a result, the time spent on data collection can be saved significantly. IEEE 2014 Conference or Workshop Item REM application/pdf en http://irep.iium.edu.my/42896/6/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf application/pdf en http://irep.iium.edu.my/42896/7/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf Awang Abu Bakar, Normi Sham (2014) Using regular expressions for mining data in large software repositories. In: 2014 The 5th International Conference on Information and Communication Technology for The Muslim World (ICT4M), 17th-18th November 2014, Kuching, Sarawak, Malaysia. http://ieeexplore.ieee.org/document/7020649/ 10.1109/ICT4M.2014.7020649 |
institution |
Universiti Islam Antarabangsa Malaysia |
building |
IIUM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
International Islamic University Malaysia |
content_source |
IIUM Repository (IREP) |
url_provider |
http://irep.iium.edu.my/ |
language |
English English |
topic |
T Technology (General) |
spellingShingle |
T Technology (General) Awang Abu Bakar, Normi Sham Using regular expressions for mining data in large software repositories |
description |
The usage of data mining technique in collecting data from software repositories involves the extraction of both basic and value-added information from existing software repositories. Regular Expressions (Regex) provide a mechanism to select specific strings from a set of character strings. In this paper, we discuss how regular expressions are used to create a data mining tool, known as OSSGrab. We developed the mining tool using Python scripting, in combination with Regex, and as a result, the time spent on data collection can be saved significantly. |
format |
Conference or Workshop Item |
author |
Awang Abu Bakar, Normi Sham |
author_facet |
Awang Abu Bakar, Normi Sham |
author_sort |
Awang Abu Bakar, Normi Sham |
title |
Using regular expressions for mining data in large software repositories |
title_short |
Using regular expressions for mining data in large software repositories |
title_full |
Using regular expressions for mining data in large software repositories |
title_fullStr |
Using regular expressions for mining data in large software repositories |
title_full_unstemmed |
Using regular expressions for mining data in large software repositories |
title_sort |
using regular expressions for mining data in large software repositories |
publisher |
IEEE |
publishDate |
2014 |
url |
http://irep.iium.edu.my/42896/6/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf http://irep.iium.edu.my/42896/7/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf http://irep.iium.edu.my/42896/ http://ieeexplore.ieee.org/document/7020649/ |
_version_ |
1643612278808903680 |
score |
13.160551 |