{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Introduction\n", "\n", "In this workshop, we will go through the steps of training, debugging, deploying and monitoring a **network traffic classification model**.\n", "\n", "For training our model we will be using datasets CSE-CIC-IDS2018 by CIC and ISCX which are used for security testing and malware prevention.\n", "These datasets include a huge amount of raw network traffic logs, plus pre-processed data where network connections have been reconstructed and relevant features have been extracted using CICFlowMeter, a tool that outputs network connection features as CSV files. Each record is classified as benign traffic, or it can be malicious traffic, with a total number of 15 classes.\n", "\n", "Starting from this featurized dataset, we have executed additional pre-processing for the purpose of this lab:\n", "