Analyzing malware log files for internet access investigation using Hadoop

On the Internet, malicious software (malware) is one of the most serious threats to system security. Major complex issues and problems on any software systems are frequently caused by malware. Malware can infect any computer software that has connection to Internet infrastructure. There are many typ...

Full description

Saved in:
Bibliographic Details
Main Author: Mat Deli, Mohd. Sharudin
Format: Thesis
Language:English
Published: 2017
Subjects:
Online Access:http://eprints.utm.my/id/eprint/78843/1/MohdSharudinMatMAIS2017.pdf
http://eprints.utm.my/id/eprint/78843/
http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:110301
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.78843
record_format eprints
spelling my.utm.788432018-09-17T04:21:09Z http://eprints.utm.my/id/eprint/78843/ Analyzing malware log files for internet access investigation using Hadoop Mat Deli, Mohd. Sharudin QA76 Computer software On the Internet, malicious software (malware) is one of the most serious threats to system security. Major complex issues and problems on any software systems are frequently caused by malware. Malware can infect any computer software that has connection to Internet infrastructure. There are many types of malware and some of the popular malwares are botnet, trojans, viruses, spyware and adware. Internet users with lesser knowledge on the malware threats are susceptible to this issue. To protect and prevent the computer and internet users from exposing themselves towards malware attacks, identifying the attacks through investigating malware log file is an essential step to curb this threat. The log file exposes crucial information in identifying the malware, such as algorithm and functional characteristic, the network interaction between the source and the destination, and type of malware. By nature, the log file size is humongous and requires the investigation process to be executed on faster and stable platform such as big data environment. In this study, the authors had adopted Hadoop, an open source software framework to process and extract the information from the malware log files that obtains from university’s security equipment. The Python program was used for data transformation then analysis it in Hadoop simulation environment. The analysis includes assessing reduction of log files size, performance of execution time and data visualization using Microsoft Power BI (Business Intelligence). The results of log processing have reduced 50% of the original log file size, while the total execution time would not increase linearly with the size of the data. The information will be used for further prevention and protection from malware threats in university’s network. 2017-12 Thesis NonPeerReviewed application/pdf en http://eprints.utm.my/id/eprint/78843/1/MohdSharudinMatMAIS2017.pdf Mat Deli, Mohd. Sharudin (2017) Analyzing malware log files for internet access investigation using Hadoop. Masters thesis, Universiti Teknologi Malaysia, Advanced Informatics School. http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:110301
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic QA76 Computer software
spellingShingle QA76 Computer software
Mat Deli, Mohd. Sharudin
Analyzing malware log files for internet access investigation using Hadoop
description On the Internet, malicious software (malware) is one of the most serious threats to system security. Major complex issues and problems on any software systems are frequently caused by malware. Malware can infect any computer software that has connection to Internet infrastructure. There are many types of malware and some of the popular malwares are botnet, trojans, viruses, spyware and adware. Internet users with lesser knowledge on the malware threats are susceptible to this issue. To protect and prevent the computer and internet users from exposing themselves towards malware attacks, identifying the attacks through investigating malware log file is an essential step to curb this threat. The log file exposes crucial information in identifying the malware, such as algorithm and functional characteristic, the network interaction between the source and the destination, and type of malware. By nature, the log file size is humongous and requires the investigation process to be executed on faster and stable platform such as big data environment. In this study, the authors had adopted Hadoop, an open source software framework to process and extract the information from the malware log files that obtains from university’s security equipment. The Python program was used for data transformation then analysis it in Hadoop simulation environment. The analysis includes assessing reduction of log files size, performance of execution time and data visualization using Microsoft Power BI (Business Intelligence). The results of log processing have reduced 50% of the original log file size, while the total execution time would not increase linearly with the size of the data. The information will be used for further prevention and protection from malware threats in university’s network.
format Thesis
author Mat Deli, Mohd. Sharudin
author_facet Mat Deli, Mohd. Sharudin
author_sort Mat Deli, Mohd. Sharudin
title Analyzing malware log files for internet access investigation using Hadoop
title_short Analyzing malware log files for internet access investigation using Hadoop
title_full Analyzing malware log files for internet access investigation using Hadoop
title_fullStr Analyzing malware log files for internet access investigation using Hadoop
title_full_unstemmed Analyzing malware log files for internet access investigation using Hadoop
title_sort analyzing malware log files for internet access investigation using hadoop
publishDate 2017
url http://eprints.utm.my/id/eprint/78843/1/MohdSharudinMatMAIS2017.pdf
http://eprints.utm.my/id/eprint/78843/
http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:110301
_version_ 1643658021819121664
score 13.15806