Benchmark Static Api Call Datasets for Malware Family Classification

dc.authorscopusid 57885628000
dc.authorscopusid 57370585700
dc.authorscopusid 57219836294
dc.authorscopusid 57887008300
dc.authorscopusid 56497768800
dc.authorscopusid 6507328166
dc.contributor.author Gencaydin, B.
dc.contributor.author Demirkıran, Ferhat
dc.contributor.author Kahya, C.N.
dc.contributor.author Demirkiran, F.
dc.contributor.author Duzgun, B.
dc.contributor.author Cayir, A.
dc.contributor.author Dag, H.
dc.contributor.other Management Information Systems
dc.date.accessioned 2023-10-19T15:05:38Z
dc.date.available 2023-10-19T15:05:38Z
dc.date.issued 2022
dc.department-temp Gencaydin, B., Computer Engineering Gebze Technical University, Kocaeli, Turkey; Kahya, C.N., Management Information Systems Kadir Has University, Istanbul, Turkey; Demirkiran, F., Kadir Has University, Department of Cyber Security, Istanbul, Turkey; Duzgun, B., Computer Engineering Gebze Technical University, Kocaeli, Turkey; Cayir, A., Huawei R&d Center, Istanbul, Turkey; Dag, H., Management Information Systems Kadir Has University, Istanbul, Turkey en_US
dc.description 7th International Conference on Computer Science and Engineering, UBMK 2022 --14 September 2022 through 16 September 2022 -- --183844 en_US
dc.description.abstract Nowadays, malware and malware incidents are increasing daily, even with various antivirus systems and malware detection or classification methodologies. Machine learning techniques have been the main focus of the security experts to detect malware and determine their families. Many static, dynamic, and hybrid techniques have been presented for that purpose. In this study, the static analysis technique has been applied to malware samples to extract API calls, which is one of the most used features in machine/deep learning models as it represents the behavior of malware samples. Since the rapid increase and continuous evolution of malware affect the detection capacity of antivirus scanners, recent and updated datasets of malicious software became necessary to overcome this drawback. This paper introduces two new datasets: One with 14,616 samples obtained and compiled from VirusShare and one with 9,795 samples from VirusSample. In addition, benchmark results based on static API calls of malware samples are presented using several machine and deep learning models on these datasets. We believe that these two datasets and benchmark results enable researchers to test and validate their methods and approaches in this field. © 2022 IEEE. en_US
dc.description.sponsorship Türkiye Bilimsel ve Teknolojik Araştırma Kurumu, TÜBİTAK: 118E400 en_US
dc.description.sponsorship ACKNOWLEDGMENT This work is supported by The Scientific and Technological Research Council of Turkey under the grant number 118E400. en_US
dc.identifier.citationcount 0
dc.identifier.doi 10.1109/UBMK55850.2022.9919580 en_US
dc.identifier.endpage 141 en_US
dc.identifier.isbn 9781665470100
dc.identifier.scopus 2-s2.0-85141884823 en_US
dc.identifier.startpage 137 en_US
dc.identifier.uri https://doi.org/10.1109/UBMK55850.2022.9919580
dc.identifier.uri https://hdl.handle.net/20.500.12469/4975
dc.khas 20231019-Scopus en_US
dc.language.iso en en_US
dc.publisher Institute of Electrical and Electronics Engineers Inc. en_US
dc.relation.ispartof Proceedings - 7th International Conference on Computer Science and Engineering, UBMK 2022 en_US
dc.relation.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.scopus.citedbyCount 3
dc.subject API call en_US
dc.subject dataset en_US
dc.subject deep learning en_US
dc.subject machine learning en_US
dc.subject Malware en_US
dc.subject Classification (of information) en_US
dc.subject Deep learning en_US
dc.subject Learning systems en_US
dc.subject Malware en_US
dc.subject Anti-virus systems en_US
dc.subject API calls en_US
dc.subject Dataset en_US
dc.subject Deep learning en_US
dc.subject Learning models en_US
dc.subject Machine-learning en_US
dc.subject Malware classifications en_US
dc.subject Malware detection en_US
dc.subject Malware families en_US
dc.subject Malwares en_US
dc.subject Static analysis en_US
dc.title Benchmark Static Api Call Datasets for Malware Family Classification en_US
dc.type Conference Object en_US
dspace.entity.type Publication
relation.isAuthorOfPublication 695a8adc-2330-4d32-ab37-8b781716d609
relation.isAuthorOfPublication.latestForDiscovery 695a8adc-2330-4d32-ab37-8b781716d609
relation.isOrgUnitOfPublication ff62e329-217b-4857-88f0-1dae00646b8c
relation.isOrgUnitOfPublication.latestForDiscovery ff62e329-217b-4857-88f0-1dae00646b8c

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
4975.pdf
Size:
392.58 KB
Format:
Adobe Portable Document Format
Description:
Tam Metin / Full Text