Copyright Statement: This is an open access article licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, even commercially as long as the original work is properly cited.
Digital Object Identifier (DOI) : 10.14569/IJACSA.2013.040930
Article Published in International Journal of Advanced Computer Science and Applications(IJACSA), Volume 4 Issue 9, 2013.
Abstract: Spam is one of the main problems of the WWW. Many studies exist about characterising and detecting several types of Spam (mainly Web Spam, Email Spam, Forum/Blob Spam and Social Networking Spam). Nevertheless, to the best of our knowledge, there are no studies about the detection of Spam in Linkedin. In this article, we propose a method for detecting Spammers and Spam nets in the Linkedin social network. As there are no public or private Linkedin datasets in the state of the art, we have manually built a dataset of real Linkedin users, classifying them as Spammers or legitimate users. The proposed method for detecting Linkedin Spammers consists of a set of new heuristics and their combinations using a kNN classifier. Moreover, we proposed a method for detecting Spam nets (fake companies) in Linkedin, based on the idea that the profiles of these companies share content similarities. We have found that the proposed methods were very effective. We achieved an F-Measure of 0.971 and an AUC close to 1 in the detection of Spammer profiles, and in the detection of Spam nets, we have obtained an F-Measure of 1.
V´ictor M. Prieto, Manuel A´ lvarez and Fidel Cacheda, “Detecting Linkedin Spammers and its Spam Nets” International Journal of Advanced Computer Science and Applications(IJACSA), 4(9), 2013. http://dx.doi.org/10.14569/IJACSA.2013.040930