Abstract
Network traffic classification is a vital task for service operators, network engineers, and security specialists to manage network
traffic, design networks, and detect threats. Identifying the type/name of applications that generate traffic is a challenging task as
encrypting traffic becomes the norm for Internet communication. -erefore, relying on conventional techniques such as deep
packet inspection (DPI) or port numbers is not efficient anymore. -is paper proposes a novel flow statistical-based set of features
that may be used for classifying applications by leveraging machine learning algorithms to yield high accuracy in identifying the
type of applications that generate the traffic. -e proposed features compute different timings between packets and flows. -is
work utilises tcptrace to extract features based on traffic burstiness and periods of inactivity (idle time) for the analysed traffic,
followed by the C5.0 algorithm for determining the applications that generated it. -e evaluation tests performed on a set of real,
uncontrolled traffic, indicated that the method has an accuracy of 79% in identifying the correct network application.
traffic, design networks, and detect threats. Identifying the type/name of applications that generate traffic is a challenging task as
encrypting traffic becomes the norm for Internet communication. -erefore, relying on conventional techniques such as deep
packet inspection (DPI) or port numbers is not efficient anymore. -is paper proposes a novel flow statistical-based set of features
that may be used for classifying applications by leveraging machine learning algorithms to yield high accuracy in identifying the
type of applications that generate the traffic. -e proposed features compute different timings between packets and flows. -is
work utilises tcptrace to extract features based on traffic burstiness and periods of inactivity (idle time) for the analysed traffic,
followed by the C5.0 algorithm for determining the applications that generated it. -e evaluation tests performed on a set of real,
uncontrolled traffic, indicated that the method has an accuracy of 79% in identifying the correct network application.
Original language | English |
---|---|
Article number | 5758437 |
Number of pages | 11 |
Journal | Journal of Computer Networks and Communications |
Volume | 2019 |
DOIs | |
Publication status | Published - 20 Aug 2019 |
Externally published | Yes |