Implementation of Token Parsing Technique for Regex Based Classification of Unstructured Data for Cyber Threat Analysis

Data handling; Engines; Information use; Pattern matching; Cyber threats; Public resources; Structured data; Threat analysis; Unstructured data; Classification (of information)

Saved in:
Bibliographic Details
Main Authors: Mohd Pakhari M.H., Jamil N., Rusli M.E., Abdul Rahim A.A.
Other Authors: 57220805194
Format: Conference Paper
Published: Institute of Electrical and Electronics Engineers Inc. 2023
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.uniten.dspace-25340
record_format dspace
spelling my.uniten.dspace-253402023-05-29T16:08:19Z Implementation of Token Parsing Technique for Regex Based Classification of Unstructured Data for Cyber Threat Analysis Mohd Pakhari M.H. Jamil N. Rusli M.E. Abdul Rahim A.A. 57220805194 36682671900 16246214600 57220806943 Data handling; Engines; Information use; Pattern matching; Cyber threats; Public resources; Structured data; Threat analysis; Unstructured data; Classification (of information) Cyber Threat Intelligence (CTI) is a concept for information about cyber threats which were analysed, structured, and refined. This information is used to help organizations to understand the current risk that have different levels that might bring harm to their enterprises. Besides, CTI can also help organizations to plan for defensive countermeasures and protect themselves from the attacks that can cause them damage. In this paper, we introduce a token parsing technique for regex based classification of unstructured data for cyber threat analytic (CTA) engine that does threat analysis based on data crawled from several public resources. Our engine crawls and fetch data from the public resource in time series, analyse the data and provide a meaningful information to the user with the timeline of the fetched parameter. The collected data which appears as non-structured are converted by the engine to appear as a structured data and then be inserted into the database. Subsequently, the engine then analyses the threat data by modelling it before useful information be returned to the user. The challenge is to have a structured data useful for analysis. This paper explains how our token parsing technique is useful in regex based classification to convert the unstructured data into useful structured data. � 2020 IEEE. Final 2023-05-29T08:08:19Z 2023-05-29T08:08:19Z 2020 Conference Paper 10.1109/ICIMU49871.2020.9243415 2-s2.0-85097642842 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85097642842&doi=10.1109%2fICIMU49871.2020.9243415&partnerID=40&md5=72dc48523a5d04414d202f37cb40d776 https://irepository.uniten.edu.my/handle/123456789/25340 9243415 395 398 Institute of Electrical and Electronics Engineers Inc. Scopus
institution Universiti Tenaga Nasional
building UNITEN Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Tenaga Nasional
content_source UNITEN Institutional Repository
url_provider http://dspace.uniten.edu.my/
description Data handling; Engines; Information use; Pattern matching; Cyber threats; Public resources; Structured data; Threat analysis; Unstructured data; Classification (of information)
author2 57220805194
author_facet 57220805194
Mohd Pakhari M.H.
Jamil N.
Rusli M.E.
Abdul Rahim A.A.
format Conference Paper
author Mohd Pakhari M.H.
Jamil N.
Rusli M.E.
Abdul Rahim A.A.
spellingShingle Mohd Pakhari M.H.
Jamil N.
Rusli M.E.
Abdul Rahim A.A.
Implementation of Token Parsing Technique for Regex Based Classification of Unstructured Data for Cyber Threat Analysis
author_sort Mohd Pakhari M.H.
title Implementation of Token Parsing Technique for Regex Based Classification of Unstructured Data for Cyber Threat Analysis
title_short Implementation of Token Parsing Technique for Regex Based Classification of Unstructured Data for Cyber Threat Analysis
title_full Implementation of Token Parsing Technique for Regex Based Classification of Unstructured Data for Cyber Threat Analysis
title_fullStr Implementation of Token Parsing Technique for Regex Based Classification of Unstructured Data for Cyber Threat Analysis
title_full_unstemmed Implementation of Token Parsing Technique for Regex Based Classification of Unstructured Data for Cyber Threat Analysis
title_sort implementation of token parsing technique for regex based classification of unstructured data for cyber threat analysis
publisher Institute of Electrical and Electronics Engineers Inc.
publishDate 2023
_version_ 1806423426361982976
score 13.214268