Please use this identifier to cite or link to this item: http://www.dspace.espol.edu.ec/handle/123456789/8588
Title: Implementación y evaluación de un detector masivo de web spam
Authors: González, Jesús
Bastidas, Washington
Abad, Cristina
Keywords: MAPREDUCE
MÁQUINAS DE VECTORES DE APOYO
WEB SPAM
COMPUTACIÓN EN NUBE.
Issue Date: 8-Jan-2010
Abstract: This work presents a mechanism to detect Web Spam in a massive way, using a distributed architecture based on the paradigm MapReduce for the parallel processing and the Support Vectors Machines (SVM) as learning algorithm for the classification. The Web Spam that is, the unjustified assignment of relevance to pages in the Web, has become a topic very approached actually since the involved parts, the Searching Machines on one hand and for other the users that demand information of them, can be benefited or harmed by the treatment of this issue. Our solution presents an alternative to detect Web Spam pages that combine the programming pattern MapReduce, implemented with Hadoop, with a cascade model of SVM using the Amazon web services that, offer a very practical and not expensive form to carry out the computation of big quantities of information in the cloud.
URI: http://www.dspace.espol.edu.ec/handle/123456789/8588
Appears in Collections:Artículos de Tesis de Grado - FIEC

Files in This Item:
File Description SizeFormat 
Implementación y evaluación de un detector masivo de Web Spam.pdf557.79 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.