Dataset of Publication "Malware Communication in Smart Factories: A Network Traffic Data Set"
Description
Note: If you use this dataset, please cite the following paper:
Brenner, B., Fabini, J., Offermanns, M., Semper, S., & Zseby, T. (2024). Malware communication in smart factories: A network traffic data set. Computer Networks, 255, 110804.
or in BibTeX:
@article{brenner2024malware,
title={Malware communication in smart factories: A network traffic data set},
author={Brenner, Bernhard and Fabini, Joachim and Offermanns, Magnus and Semper, Sabrina and Zseby, Tanja},
journal={Computer Networks},
volume={255},
pages={110804},
year={2024},
publisher={Elsevier}
}
Context and methodology
Machine learning-based intrusion detection requires suitable and realistic data sets for training and testing. However, data sets that originate from real networks are rare. Network data is considered privacy-sensitive, and the purposeful introduction of malicious traffic is usually not possible.
In this paper, we introduce a labeled data set captured at a smart factory located in Vienna, Austria, during normal operation and during penetration tests with different attack types. The data set contains 173 GB of PCAP files, representing 16 days (395 hours) of factory operation. It includes MQTT, OPC UA, and Modbus/TCP traffic.
The captured malicious traffic originated from a professional penetration tester who performed two types of attacks:
(a) Aggressive attacks that are easier to detect.
(b) Stealthy attacks that are harder to detect.
Our data set includes the raw PCAP files and extracted flow data. Labels for packets and flows indicate whether they originated from a specific attack or from benign communication.
We describe the methodology for creating the dataset, conduct an analysis of the data, and provide detailed information about the recorded traffic itself. The dataset is freely available to support reproducible research and the comparability of results in the area of intrusion detection in industrial networks.
Technical details
- readme.txt
- Information about the data collection, format, necessary software and versions to access it.
- license.txt:
- Licensing information.
- a_day1, a_day2, s_day1, s_day2, tf_a, and tf_s:
- Main dataset, where files starting with "tf" are training files containing only benign,
operational data. All other files are attack files containing both operational data and
attack data.
- Main dataset, where files starting with "tf" are training files containing only benign,
- images.zip:
- Contains descriptive images about the data.
- extractions.zip:
- Contains extracted packets and flows in both labeled and unlabeled form.
- a_day_tuesday_dos.zip:
- An additional day of attack traffic containing benign and attack data, including a DoS attack. This day is not labeled.
- list_of_extracted_features:
- A complete list of features we extracted from the PCAP Files. All flow files contain these features.
- list_of_identified_protocols.csv:
- A complete list of all protocols that we could identify within the PCAP files provided.
Files
a_day1.zip
Files
(136.6 GiB)
Name | Size | |
---|---|---|
md5:4f64a19cdf5d0806512cb2cb34bb09a5
|
2.8 GiB | Preview Download |
md5:e64e23cc64756365a207292bc11c8fc3
|
1.9 GiB | Preview Download |
md5:31c7c2d1186095372e917885fb0c7e0e
|
21.2 GiB | Preview Download |
md5:80e6f68d3a5024131fd7d8b69f788f20
|
5.2 GiB | Preview Download |
md5:ac225b1d33d74663cb13b5b9000098cd
|
4.3 MiB | Preview Download |
md5:da37ec7dd605f6cfdc1f038c469f5c9d
|
1.3 KiB | Preview Download |
md5:966b08996198c62e25825e6140936fe9
|
179 Bytes | Preview Download |
md5:713611bf677326992b194f46835054ed
|
1.1 KiB | Preview Download |
md5:ecad1cbe916cbce4a100f042b2299593
|
4.8 KiB | Preview Download |
md5:cad30e6c1d0e7feeb06c41e3674974d0
|
5.5 GiB | Preview Download |
md5:94c35bf961d07abb2617080fe4783d57
|
6.0 GiB | Preview Download |
md5:e1e689c877b45858fc6c41fd000f85da
|
40.3 GiB | Preview Download |
md5:1a809db9022a5073ff4f30f00639addb
|
53.7 GiB | Preview Download |
Additional details
Related works
- Is documented by
- Journal Article: 10.1016/j.comnet.2024.110804 (DOI)
Dates
- Submitted
-
2024-08-11First Submission of Paper. Dataset available for Editors and Reviewers only.