Datasets

This page shows the datasets & codes used in our papers.

#1. Dataset & Codes for a Federated Learning Study — Prophet

Background: This dataset is used for the experiments conducted in the proactive candidate-selection for the synchronous-fashioned Federated Learning. Based on the predicted future conditions of both network connections and computing capabilities within a training round, a good group of candidate participants can be selected to participate in each round of Federated Learning training.
  • This dataset is only a part of the raw data collected by volunteers in our campus from Oct.21~Oct.25, 2019. More details are upcoming soon.
  • The codes of experiments are enclosed in the following ZIP file. Please refer to the methodology in the enclosed “readme.md“.

#2. Dataset & Codes for Predicting Machine Failures

Background: This dataset is to implement the failure prediction using machine learning methods and AI approaches such as SVM, random forest, or deep learning algorithms. Besides the original dataset, I also provide two reports written by two visiting students when they performed a visiting-study in my lab in July 2019.
Huawei Huang, and Song Guo, “Proactive Failure Recovery for NFV in Distributed Edge Computing”, IEEE Communications Magazine, vol. 57, no. 5, pp. 131-137, March 2019
  • The dataset after preprocessing:

  • The related technique reports and codes from two visiting students:


#3. Dataset & Codes for Predicting Server Failures

Background: This dataset is used to predict the failures of server machines that occurred on a datacenter. The related published papers are as follows.

Huakun Huang, Lingjun Zhao, Huawei Huang, Song Guo, "Machine Fault Detection for Intelligent Self-Driving Networks", IEEE Communications Magazine, Vol. 58 , Issue No. 1, pp. 40-46, January 2020  [RG-Page]
Huakun Huang, Shuxue Ding, Lingjun Zhao, Huawei Huang, et al., "Real-Time Fault-Detection for IIoT Facilities using GBRBM-based DNN", IEEE Internet of Things Journal, Oct. 21, 2019. DOI: 10.1109/JIOT.2019.2948396 [RG-Page]
  • Original dataset and cleaned dataset:
  • Processing codes: