High-accuracy document classification with a new algorithm

High-accuracy document classification with a new algorithm

For access to this article, please select a purchase option:

Buy article PDF
(plus tax if applicable)
Buy Knowledge Pack
10 articles for $120.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Your details
Why are you recommending this title?
Select reason:
Electronics Letters — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

A new algorithm based on learning vector quantisation classifier is presented based on a modified proximity-measure, which enforces a predetermined correct classification level in training while using sliding-mode approach for stable variation in weight updates towards convergence. The proposed algorithm and some well-known counterparts are implemented by using Python libraries and compared in a task of text classification for document categorisation. Results reveal that the new classifier is a successful contender to those algorithms in terms of testing and training performances.


    1. 1)
      • 1. Joachims, T.: ‘A probabilistic analysis of the Rocchio algorithm with TFIDF for text categorization’. Proc. Int. Conf. Machine Learning, San Francisco, CA, USA, July 1997, pp. 143151.
    2. 2)
      • 2. Frank, E., Bouckaert, R.: ‘Naive Bayes for text classification with unbalanced classes’. Proc. European Conf. Principles and Practice of Knowledge Discovery in Databases, Berlin, Germany, September 2006, vol. 4213, pp. 503510.
    3. 3)
    4. 4)
      • 4. Gkanogiannis, A., Kalamboukis, T.: ‘A perceptron-like linear supervised algorithm for text classification’, in Cao, L., et al (Ed) ‘Advanced data mining and applications’ (Springer, Berlin, 6440, 2010), pp. 8697.
    5. 5)
    6. 6)
      • 6. Khan, A., Baharudin, B., Lee, L.H., et al: ‘A review of machine learning algorithms for text-documents classification’, J. Adv. Inf. Technol., 2010, 1, (1), pp. 420.
    7. 7)
    8. 8)
    9. 9)
      • 9. Pacella, M., Grieco, A., Blaco, M.: ‘On the use of self-organizing map for text clustering in engineering change process analysis: a case study’, Comput. Intel. Neurosci., 2016, ID 5139574, pp. 111.
    10. 10)
    11. 11)
    12. 12)
      • 12. Temel, T., Karlik, B.: ‘An improved odor recognition system using learning vector quantization with a new discriminant analysis’, Neural Netw. World, 2007, 17, (4), pp. 287294.
    13. 13)
      • 13. Temel, T.: ‘System and circuit design for biologically-inspired intelligent learning’ (IGI Global, PA, USA, 2010).
    14. 14)
    15. 15)
    16. 16)
    17. 17)
    18. 18)
    19. 19)
    20. 20)

Related content

This is a required field
Please enter a valid email address