http://iet.metastore.ingenta.com
1887

## A semi-supervised approach for network traffic labeling

• Author(s):
• DOI:

$16.00 (plus tax if applicable) ##### Buy Knowledge Pack 10 chapters for$120.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:

Network Classification for Traffic Management: Anomaly detection, feature selection, clustering and classification — Recommend this title to your library

## Thank you

As discussed in the previous two chapters, the recent promising studies for network classification have relied on the analysis of the statistics of traffic flows and the use of machine learning (ML) methods. However, due to the high cost of manual labeling, it is hard to obtain sufficient, reliable, and up-to-date labeled data for effective IP traffic classification. This chapter discusses a novel semi-supervised approach, called SemTra, which automatically alleviates the shortage of labeled flows for ML by exploiting the advantages of both supervised and unsupervised models. In particular, SemTra involves the followings: (i) generating multi-view representations of the original data based on dimensionality reduction methods to have strong discrimination ability; (ii) incorporating the generated representations into the ensemble clustering model to provide a combined clustering output with better quality and stability; (iii) adapting the concept of self-training to iteratively utilize the few labeled data along with unlabeled within local and global viewpoints; and (iv) obtaining the final class decision by combining the decisions of mapping strategy of clusters, the local self-training and global self-training approaches. Extensive experiments were carried out to compare the effectiveness of SemTra over representative semi-supervised methods using 16 network traffic datasets. The results clearly show that SemTra is able to yield noticeable improvement in accuracy (as high as 94.96%) and stability (as high as 95.04%) in the labeling process.

Chapter Contents:

• 9.1 Introduction
• 9.2 The semi-supervised traffic flow labeling
• 9.2.1 The multi-view layer
• 9.2.2 Initial clustering analysis
• 9.2.3 Ensemble clustering
• 9.2.4 Local self-training
• 9.2.5 Global self-training on meta-level features
• 9.2.6 Function agreement and labeling
• 9.3 Experimental evaluation
• 9.3.1 Datasets used in experiments
• 9.3.2 The baseline methods
• 9.3.3 The experimental setup
• 9.3.4 Performance metrics
• 9.3.5 Analysis of the experimental results
• 9.4 Conclusion

Preview this chapter:

A semi-supervised approach for network traffic labeling, Page 1 of 2

| /docserver/preview/fulltext/books/pc/pbpc032e/PBPC032E_ch9-1.gif /docserver/preview/fulltext/books/pc/pbpc032e/PBPC032E_ch9-2.gif

### Related content

content/books/10.1049/pbpc032e_ch9
pub_keyword,iet_inspecKeyword,pub_concept
6
6
This is a required field