Optimizing feature selection to improve transport layer statistics quality

Optimizing feature selection to improve transport layer statistics quality

For access to this article, please select a purchase option:

Buy chapter PDF
(plus tax if applicable)
Buy Knowledge Pack
10 chapters for $120.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Your details
Why are you recommending this title?
Select reason:
Network Classification for Traffic Management: Anomaly detection, feature selection, clustering and classification — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

There is significant interest in the network management and industrial security community about the need to improve the quality of transport layer statistics (TLS) and to identify the “best” and most relevant features. The ability to eliminate redundant and irrelevant features is important in order to improve the classification accuracy and to reduce the computational complexity related to the construction of the classifier. In practice, several feature selection (FS) methods can be used as a preprocessing step to eliminate redundant and irrelevant features and as a knowledge discovery tool to reveal the “best” features in many soft computing applications. This chapter investigates the advantages and disadvantages of such FS methods with new proposed metrics, namely goodness, stability, and similarity. The aim here is to come up with an integrated FS method that is built on the key strengths of existing FS methods. A novel way is described to identify efficiently and accurately the “best” features by first combining the results of some well-known FS methods to find consistent features and then use the proposed concept of support to select the smallest set of features and cover data optimality. The empirical study over ten high-dimensional network traffic datasets demonstrates significant gain in accuracy and improved runtime performance of a classifier compared to individual results of well-known FS methods.

Chapter Contents:

  • 6.1 Introduction
  • 6.2 FS methods for benchmarking
  • 6.3 The new metrics
  • 6.4 Experimental methodology
  • 6.5 Preliminary experiments
  • 6.5.1 Experimental results
  • 6.5.2 Discussion
  • 6.6 The local optimization approach (LOA)
  • 6.6.1 The algorithm
  • 6.6.2 An illustrative example
  • 6.6.3 Result and analysis
  • 6.6.4 Choice of parameters
  • 6.6.5 Impact of FS methods on runtime
  • 6.6.6 Comparing FS methods computational performance
  • 6.6.7 Summary of results with different datasets and limitations of LOA approach
  • 6.7 Conclusion

Inspec keywords: pattern classification; computational complexity; computer network management; telecommunication traffic; feature selection; statistical analysis; data mining

Other keywords: soft computing applications; feature selection methods; knowledge discovery tool; computational complexity; consistent features; redundant features; transport layer statistics quality; improved runtime performance; well-known FS methods; integrated FS method; irrelevant features; network management; industrial security community

Subjects: Computer networks and techniques; Data handling techniques; Computer communications; Other topics in statistics; Knowledge engineering techniques; Other topics in statistics; Network management

Preview this chapter:
Zoom in

Optimizing feature selection to improve transport layer statistics quality, Page 1 of 2

| /docserver/preview/fulltext/books/pc/pbpc032e/PBPC032E_ch6-1.gif /docserver/preview/fulltext/books/pc/pbpc032e/PBPC032E_ch6-2.gif

Related content

This is a required field
Please enter a valid email address