JQPro: Join query processing in a distributed system for Big RDF data using the hash-merge join technique

In the last decade, the volume of semantic data has increased exponentially, with the number of Resource Description Framework (RDF) datasets exceeding trillions of triples in RDF repositories. Hence, the size of RDF datasets continues to grow. However, with the increasing number of RDF triples, com...

Full description

Saved in:
Bibliographic Details
Main Authors: Nahla Mohammed Elzein, Mazlina Abdul Majid, Ibrahim Abaker Targio Hashem, Ashraf Osman Ibrahim Elsayed, Anas W. Abulfaraj, Faisal Binzagr
Format: Article
Language:English
English
Published: Multidisciplinary Digital Publishing Institute (MDPI) 2023
Subjects:
Online Access:https://eprints.ums.edu.my/id/eprint/42224/1/ABSTRACT.pdf
https://eprints.ums.edu.my/id/eprint/42224/2/FULL%20TEXT.pdf
https://eprints.ums.edu.my/id/eprint/42224/
https://doi.org/10.3390/math11051275
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.ums.eprints.42224
record_format eprints
spelling my.ums.eprints.422242024-12-10T07:01:16Z https://eprints.ums.edu.my/id/eprint/42224/ JQPro: Join query processing in a distributed system for Big RDF data using the hash-merge join technique Nahla Mohammed Elzein Mazlina Abdul Majid Ibrahim Abaker Targio Hashem Ashraf Osman Ibrahim Elsayed Anas W. Abulfaraj Faisal Binzagr QA75.5-76.95 Electronic computers. Computer science T10.5-11.9 Communication of technical information In the last decade, the volume of semantic data has increased exponentially, with the number of Resource Description Framework (RDF) datasets exceeding trillions of triples in RDF repositories. Hence, the size of RDF datasets continues to grow. However, with the increasing number of RDF triples, complex multiple RDF queries are becoming a significant demand. Sometimes, such complex queries produce many common sub-expressions in a single query or over multiple queries running as a batch. In addition, it is also difficult to minimize the number of RDF queries and processing time for a large amount of related data in a typical distributed environment encounter. To address this complication, we introduce a join query processing model for big RDF data, called JQPro. By adopting a MapReduce framework in JQPro, we developed three new algorithms, which are hash-join, sort-merge, and enhanced MapReduce-join for join query processing of RDF data. Based on an experiment conducted, the result showed that the JQPro model outperformed the two popular algorithms, gStore and RDF-3X, with respect to the average execution time. Furthermore, the JQPro model was also tested against RDF-3X, RDFox, and PARJs using the LUBM benchmark. The result showed that the JQPro model had better performance in comparison with the other models. In conclusion, the findings showed that JQPro achieved improved performance with 87.77% in terms of execution time. Hence, in comparison with the selected models, JQPro performs better. Multidisciplinary Digital Publishing Institute (MDPI) 2023 Article NonPeerReviewed text en https://eprints.ums.edu.my/id/eprint/42224/1/ABSTRACT.pdf text en https://eprints.ums.edu.my/id/eprint/42224/2/FULL%20TEXT.pdf Nahla Mohammed Elzein and Mazlina Abdul Majid and Ibrahim Abaker Targio Hashem and Ashraf Osman Ibrahim Elsayed and Anas W. Abulfaraj and Faisal Binzagr (2023) JQPro: Join query processing in a distributed system for Big RDF data using the hash-merge join technique. Mathematics, 11. pp. 1-20. https://doi.org/10.3390/math11051275
institution Universiti Malaysia Sabah
building UMS Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaysia Sabah
content_source UMS Institutional Repository
url_provider http://eprints.ums.edu.my/
language English
English
topic QA75.5-76.95 Electronic computers. Computer science
T10.5-11.9 Communication of technical information
spellingShingle QA75.5-76.95 Electronic computers. Computer science
T10.5-11.9 Communication of technical information
Nahla Mohammed Elzein
Mazlina Abdul Majid
Ibrahim Abaker Targio Hashem
Ashraf Osman Ibrahim Elsayed
Anas W. Abulfaraj
Faisal Binzagr
JQPro: Join query processing in a distributed system for Big RDF data using the hash-merge join technique
description In the last decade, the volume of semantic data has increased exponentially, with the number of Resource Description Framework (RDF) datasets exceeding trillions of triples in RDF repositories. Hence, the size of RDF datasets continues to grow. However, with the increasing number of RDF triples, complex multiple RDF queries are becoming a significant demand. Sometimes, such complex queries produce many common sub-expressions in a single query or over multiple queries running as a batch. In addition, it is also difficult to minimize the number of RDF queries and processing time for a large amount of related data in a typical distributed environment encounter. To address this complication, we introduce a join query processing model for big RDF data, called JQPro. By adopting a MapReduce framework in JQPro, we developed three new algorithms, which are hash-join, sort-merge, and enhanced MapReduce-join for join query processing of RDF data. Based on an experiment conducted, the result showed that the JQPro model outperformed the two popular algorithms, gStore and RDF-3X, with respect to the average execution time. Furthermore, the JQPro model was also tested against RDF-3X, RDFox, and PARJs using the LUBM benchmark. The result showed that the JQPro model had better performance in comparison with the other models. In conclusion, the findings showed that JQPro achieved improved performance with 87.77% in terms of execution time. Hence, in comparison with the selected models, JQPro performs better.
format Article
author Nahla Mohammed Elzein
Mazlina Abdul Majid
Ibrahim Abaker Targio Hashem
Ashraf Osman Ibrahim Elsayed
Anas W. Abulfaraj
Faisal Binzagr
author_facet Nahla Mohammed Elzein
Mazlina Abdul Majid
Ibrahim Abaker Targio Hashem
Ashraf Osman Ibrahim Elsayed
Anas W. Abulfaraj
Faisal Binzagr
author_sort Nahla Mohammed Elzein
title JQPro: Join query processing in a distributed system for Big RDF data using the hash-merge join technique
title_short JQPro: Join query processing in a distributed system for Big RDF data using the hash-merge join technique
title_full JQPro: Join query processing in a distributed system for Big RDF data using the hash-merge join technique
title_fullStr JQPro: Join query processing in a distributed system for Big RDF data using the hash-merge join technique
title_full_unstemmed JQPro: Join query processing in a distributed system for Big RDF data using the hash-merge join technique
title_sort jqpro: join query processing in a distributed system for big rdf data using the hash-merge join technique
publisher Multidisciplinary Digital Publishing Institute (MDPI)
publishDate 2023
url https://eprints.ums.edu.my/id/eprint/42224/1/ABSTRACT.pdf
https://eprints.ums.edu.my/id/eprint/42224/2/FULL%20TEXT.pdf
https://eprints.ums.edu.my/id/eprint/42224/
https://doi.org/10.3390/math11051275
_version_ 1818835192785141760
score 13.223943