http://iet.metastore.ingenta.com
1887

Identify content quality in online social networks

Identify content quality in online social networks

For access to this article, please select a purchase option:

Buy article PDF
$19.95
(plus tax if applicable)
Buy Knowledge Pack
10 articles for $120.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
IET Communications — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

The flooding of low-quality user generated contents (UGC) in online social network (OSN) has been a threat to web knowledge management systems. Recently several domain-specific systems have been developed addressing this problem, for example, predict correct answer in QA community; recognise reliable comment in products review forums etc. Major drawback of most research efforts is the lack of a general framework applicable to all OSNs. In this study, the authors start by analysing the effects of distinguishing features on UGC quality in different types of OSNs. Extensive statistical analysis leads to the discovery of existence of diverse patterns of human information sharing activity in dissimilar OSNs. This discovery is employed as prior knowledge in the classification framework, which decompose the original highly imbalanced problem into several balanced sub-problems. Ensemble classifiers are adopted in samples from clusters generated by incompact features. Experiments show the proposed framework is both effective and efficient for several OSNs.Contributions of this study are two-fold: (i) model posting activity in different types of OSNs; (ii) propose novel classification framework to identify UGC quality.

References

    1. 1)
      • Adamic, L.A., Zhang, J., Bakshy, E., Ackerman, M.S.: `Knowledge sharing and yahoo answers: everyone knows something', WWW'08: Proc. 17th Int. Conf. on World Wide Web, ACM, 2008, New York, NY, USA, p. 665–674.
    2. 2)
      • Agichtein, E., Castillo, C., Donato, D.: `Aristides Gionis, and Gilad Mishne. Finding high-quality content in social media', WSDM'08: Proc. Int. Conf. on Web Search and Web Data Mining, ACM, 2008, New York, NY, USA, p. 183–194.
    3. 3)
      • Bian, J., Liu, Y., Agichtein, E., Zha, H.: `Finding the right facts in the crowd: factoid question answering over social media', Proc. 17th Int. Conf. on World Wide Web, ACM, 2008, p. 467–476.
    4. 4)
      • Bian, J., Liu, Y., Zhou, D., Agichtein, E., Zha, H.: `Learning to recognize reliable users and content in social media with coupled mutual reinforcement', WWW'09: Proc. 18th Int. Conf. on World Wide Web, ACM, 2009, New York, NY, USA, p. 51–60.
    5. 5)
      • Harper, F.M., Moy, D., Konstan, J.A.: `Facts or friends?: distinguishing informational and conversational questions in social Q&A sites', Proc. 27th Int. Conf. on Human Factors in Computing Systems, ACM, 2009, p. 759–768.
    6. 6)
      • Liu, Y., Bian, J., Agichtein, E.: `Predicting information seeker satisfaction in community question answering', Proc. 31st Annual Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, ACM, 2008, p. 483–490.
    7. 7)
      • Sun, K., Cao, Y., Song, X., Song, Y.-I., Wang, X., Lin, C.-Y.: `Learning to recommend questions based on user ratings', Proc. 18th ACM Conf. on Information and Knowledge Management, CIKM'09, ACM, 2009, New York, NY, USA, p. 751–758.
    8. 8)
      • Nam, K.K., Ackerman, M.S., Adamic, L.A.: `Questions in, knowledge in?: a study of Naver's question answering community', Proc. 27th Int. Conf. on Human Factors in Computing Systems, ACM, 2009, p. 779–788.
    9. 9)
      • Jeon, J., Croft, W.B., Lee, J.H., Park, S.: `A framework to predict the quality of answers with non-textual features', Proc. 29th Annual Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, ACM, 2006, p. 228–235.
    10. 10)
      • Lu, Y., Tsaparas, P., Ntoulas, A., Polanyi, L.: `Exploiting social context for review quality prediction', Proc. 19th Int. Conf. on World Wide Web, ACM, 2010, p. 691–700.
    11. 11)
      • Wanas, N., El-Saban, M., Ashour, H., Ammar, W.: `Automatic scoring of online discussion posts', WICOW'08: Proc. Second ACM Workshop on Information Credibility on the Web, ACM, 2008, New York, NY, USA, p. 19–26.
    12. 12)
      • Lin, C., Yang, J.M., Cai, R., Wang, X.J., Wang, W.: `Simultaneously modeling semantics and structure of threaded discussions: a sparse coding approach and its applications', Proc. 32nd Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, ACM, 2009, p. 131–138.
    13. 13)
      • Baeza-Yates, R.: `User generated content: how good is it?', WICOW'09: Proc. Third Workshop on Information Credibility on the Web, ACM, 2009, New York, NY, USA, p. 1–2.
    14. 14)
      • Baeza-Yates, R., Tiberi, A.: `Extracting semantic relations from query logs', Proc. 13th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, ACM, 2007, p. 76–85.
    15. 15)
      • H.T. Welser , E. Gleave , D. Fisher , M. Smith . Visualizing the signatures of social roles in online discussion groups. J. Soc. Struct. , 2 , 1 - 32
    16. 16)
      • N.V. Chawla , K.W. Bowyer , L.O. Hall , W.P. Kegelmeyer . SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. , 6 , 321 - 357
    17. 17)
      • N. Japkowicz , S. Stephen . The class imbalance problem: a systematic study. Intell. Data Anal. , 5 , 429 - 449
    18. 18)
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-com.2011.0202
Loading

Related content

content/journals/10.1049/iet-com.2011.0202
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address