Distributed classification for image spam detection

Amiza Amir, Bala Srinivasan, Asad I. Khan

    Research output: Contribution to journalArticleResearchpeer-review

    6 Citations (Scopus)

    Abstract

    Spam appears in various forms and the current trend in spamming is moving towards multimedia spam objects. Image spam is a new type of spam attacks which attempts to bypass the spam filters that mostly text-based. Spamming attacks the users in many ways and these are usually countered by having a server to filter the spammers. This paper provides a fully-distributed pattern recognition system within P2P networks using the distributed associative memory tree (DASMET) algorithm to detect spam which is cost-efficient and not prone to a single point of failure, unlike the server-based systems. This algorithm is scalable for large and frequently updated data sets, and specifically designed for data sets that consist of similar occurring patterns.We have evaluated our system against centralised state-of-the-art algorithms (NN, k-NN, naive Bayes, BPNN and RBFN) and distributed P2P-based algorithms (Ivote-DPV, ensemble k-NN, ensemble naive Bayes, and P2P-GN). The experimental results show that our method is highly accurate with a 98 to 99% accuracy rate, and incurs a small number of messages—in the best-case, it requires only two messages per recall test. In summary, our experimental results show that the DAS-MET performs best with a relatively small amount of resources for the spam detection compared to other distributed methods.

    Original languageEnglish
    Pages (from-to)13249-13278
    Number of pages30
    JournalMultimedia Tools and Applications
    Volume77
    Issue number11
    DOIs
    Publication statusPublished - Jun 2018

    Keywords

    • Distributed classification
    • Distributed data mining
    • Distributed pattern recognition
    • Image spam
    • P2P classification
    • P2P data mining
    • Spam detection

    Cite this