Web Document Classification Using Naïve Bayes

Adetunji, A. B. and Oguntoye, J. P. and Fenwa, O. D. and Akande, N. O. (2018) Web Document Classification Using Naïve Bayes. Journal of Advances in Mathematics and Computer Science, 29 (6). pp. 1-11. ISSN 24569968

[thumbnail of Oguntoye2962017JAMCS34128.pdf] Text
Oguntoye2962017JAMCS34128.pdf - Published Version

Download (389kB)

Abstract

World Wide Web has become a huge collection of documents and the amount of documents available is increasing on a daily basis. How to correctly classify the vast documents into a particular category and locate any document of interest easily has become a challenge researchers have been trying to solve for decades and different researchers have attempted different algorithms using different platform to achieve this aim. In this paper, a University web site was used as a case study and a machine learning workbench called WEKA (Waikato Environment for Knowledge Analysis) which provides a general-purpose environment for automatic classification, regression, clustering and feature selection was used as a machine learning platform. Running Naïve Bayes with 10-fold cross validation on the selected web data gives a 77% correctly classified instances in zero second with relative absolute error of 68.9937%. This shows the ability of Naïve Bayes algorithm to accurately classify vast amount of web document in a short time.

Item Type: Article
Subjects: Article Archives > Mathematical Science
Depositing User: Unnamed user with email support@articlearchives.org
Date Deposited: 10 May 2023 06:28
Last Modified: 18 Oct 2024 04:43
URI: http://archive.paparesearch.co.in/id/eprint/1154

Actions (login required)

View Item
View Item