Explaining Machine Learning DGA Detectors from DNS Traffic Data

Published in In the proceedings of Proceedings of the Italian Conference on Cybersecurity (ITASEC 2022), Rome, Italy, June 20-23, 2022, 2022

Recommended citation: Giorgio Piras, Maura Pintor, Luca Demetrio, Battista Biggio, "Explaining Machine Learning DGA Detectors from DNS Traffic Data." In the proceedings of Proceedings of the Italian Conference on Cybersecurity (ITASEC 2022), Rome, Italy, June 20-23, 2022, 2022. http://ceur-ws.org/Vol-3260/paper11.pdf

Abstract:

One of the most common causes of lack of continuity of online systems stems from a widely popular Cyber Attack known as Distributed Denial of Service (DDoS), in which a network of infected devices (botnet) gets exploited to flood the computational capacity of services through the commands of an attacker. This attack is made by leveraging the Domain Name System (DNS) technology through Domain Generation Algorithms (DGAs), a stealthy connection strategy that yet leaves suspicious data patterns. To detect such threats, advances in their analysis have been made. For the majority, they found Machine Learning (ML) as a solution, which can be highly effective in analyzing and classifying massive amounts of data. Although strongly performing, ML models have a certain degree of obscurity in their decision-making process. To cope with this problem, a branch of ML known as Explainable ML tries to break down the black-box nature of classifiers and make them interpretable and human-readable. This work addresses the problem of Explainable ML in the context of botnet and DGA detection, which at the best of our knowledge, is the first to concretely break down the decisions of ML classifiers when devised for botnet/DGA detection, therefore providing global and local explanations.

Access paper here

BibTeX:

@inproceedings{piras22explaining,
author = {Piras, Giorgio and Pintor, Maura and Demetrio, Luca and Biggio, Battista},
editor = {Demetrescu, Camil and Mei, Alessandro},
title = {Explaining Machine Learning {DGA} Detectors from {DNS} Traffic Data},
booktitle = {Proceedings of the Italian Conference on Cybersecurity {(ITASEC} 2022), Rome, Italy, June 20-23, 2022},
series = {CEUR Workshop Proceedings},
volume = {3260},
pages = {150–168},
publisher = {CEUR-WS.org},
year = {2022},
url = {http://ceur-ws.org/Vol-3260/paper11.pdf},
timestamp = {Sat, 05 Nov 2022 00:03:03 +0100},
biburl = {https://dblp.org/rec/conf/itasec/PirasPDB22.bib},
bibsource = {dblp computer science bibliography, https://dblp.org}
}