Leveraging Auxiliary Knowledge for Web Service Clustering

View Fulltext

Author(s): Gang Tian ^{1, 2} ; Jian Wang ¹ ; Keqing He ¹ ; Cheng'ai Sun ²
- Affiliations: 1: State Key Laboratory of Software Engineering, School of Computer, Wuhan University, Wuhan 430072, China ;
  2: College of Information and Science Engineering, Shandong University of Science and Technology, Qingdao 266590, China
Source: Volume 25, Issue 5, September 2016, p. 858 – 865
DOI: 10.1049/cje.2016.06.008 , Print ISSN 1022-4653, Online ISSN 2075-5597

By grouping Web services that share similar functionalities, Web service clustering can greatly enhance Web service discovery and selection. Most existing clustering techniques are designed to handle long text documents. However, the descriptions of most publicly available Web services are in the form of short text, which impairs the quality of service clustering due to the sparseness of useful information. Towards this issue, we propose a new service clustering approach based on transfer learning from auxiliary long text data obtained from Wikipedia. To handle the inconsistencies in semantics and topics between service descriptions and auxiliary data, we introduce a novel topic model – Tag aided dual Author topical model (TD-ATM), which jointly learns two sets of topics on the two data sets and automatically couples the topic parameters to avoid the potential inconsistencies between these two data sets. Experimental results show the proposed approach outperforms several existing Web service clustering approaches.

Leveraging Auxiliary Knowledge for Web Service Clustering

Related content