Exploring permissions in android applications using ensemble-based extra tree feature selection

The fast development of mobile apps and its usage has led to an increase the risk of exploiting user privacy. One method used in the Android security mechanism is permission control that restricts the access of apps to core facilities of devices. However, that permissions could be exploited by attac...

Full description

Saved in:
Bibliographic Details
Main Authors: Alkaaf, Howida Abuabker, Ali, Aida, Shamsuddin, Siti Mariyam, Hassan, Shafaatunnur
Format: Article
Language:English
Published: Institute of Advanced Engineering and Science 2020
Subjects:
Online Access:http://eprints.utm.my/id/eprint/91409/1/AidaAli2020_ExploringPermissionsinAndroidApplicationsusingEnsemble-Based.pdf
http://eprints.utm.my/id/eprint/91409/
http://dx.doi.org/10.11591/ijeecs.v19.i1.pp543-552
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The fast development of mobile apps and its usage has led to an increase the risk of exploiting user privacy. One method used in the Android security mechanism is permission control that restricts the access of apps to core facilities of devices. However, that permissions could be exploited by attackers when granting certain combinations of permissions. So, this paper aims to explore the pattern of malware apps based on analyzing permissions by proposing a framework utilizing feature selection based on ensemble extra tree classifier method and machine learning classifier. The used dataset had 25458 samples (8643 malware apps & 16815 benign apps) with 173 features. Three datasets with 25458 samples and 5, 10 and 20 features respectively were generated after using the proposed feature selection method. All the dataset was fed to machine learning. Support Vector Machine (SVM), K Neighbors Classifier, Decision Tree, Naïve Bayes and Multilayer Perceptron (MLP) classifiers were used. The classifiers models were evaluated using true negative rate (TNR), false positive rate (FNR) and accuracy metrics. The experimental results obtained showed that Support Vector Machine and KNeighbors Classifiers with 20 features achieved the highest accuracy with 94 % and TNR with a rate of 89 % using the KNeighbors Classifier. The FNR rate is dropped to 0.001 using 5 features with Support Vector Machine (SVM) and Multilayer Perceptrons (MLP) classifiers. The result indicated that reducing permission features improved the performance of classification and reduced the computational overhead.