UNDERSTANDING WEB TRAFFIC ACTIVITIES USING WEB MINING TECHNIQUES
Web Usage Mining is a computational process of discovering patterns in large data sets involving methods using the artificial intelligence, machine learning, statistical analysis and database systems with the goal to extract valuable information from accessing server logs of World Wide Web data repositories and transform it into an understandable structure for further understanding and use. Main focus of this paper will be centered on exploring methods that expedites the log mining process and present the result of log mining process through data visualization and compare data-mining algorithms. For the comparison between classification techniques, precision, recall and ROC area are the correct measures that are used to compare algorithms. Based on this study it shows that Naïve Bayes and Bayes Network are proven to be the best algorithms for that.
Srivastava, J., Cooley, R., Deshpande, M., Tan, P.N. (2002): Web usage mining: Discovery and applications of usage patterns from web data. SIGKDD Explorations 1, pp. 12-23.
Jose M. Domenech, Javier Lorenzo (2007). A Tool for Web Usage Mining. 8th International Conference on Intelligent Data Engineering and Automated Learning (IDEAL'07), Birmingham, UK. DOI: https://doi.org/10.1007/978-3-540-77226-2_70
Anand S. Lalani (2003). Data Mining of Web Access Logs. A minor thesis, School of Computer Science and Information Technology, Faculty of Applied Science, Royal Melbourne Institute of Technology, Melbourne, Victoria, Australia.
James Pitkow (1997). In search of reliable usage data on the www. In Proc. of the Sixth International WWW Conference, pp. 1-13. DOI: https://doi.org/10.1016/S0169-7552(97)00021-4
Teressa T. Chikohora (2014): A Study of the Factors Considered when Choosing an Appropriate Data Mining Algorithm. International Journal of Soft Computing and Engineering (IJSCE) , Volume-4, Issue-3
Berson, A., Smith, S., Thearling, K. (ed.) (1999). Building Data Mining Applications for CRM. NewYork: McGraw-Hill.
Aniket Dash (2010) Web Usage Mining: An Implementation. A minor thesis, National Institute of Technology, Rourkela, India.
Robert Cooley, BamshadMobasher, and Jaideep Srivastava (1999). Data Preparation for Mining World Wide Web. Browsing Patterns Knowledge and Information Systems Vol. 1 Issue 1, pp 5- 32. DOI: https://doi.org/10.1007/BF03325089
Spilipoulou M., Mobasher B, Berendt B. (2003) “A framework for the Evaluation of Session Reconstruction Heuristics in Web Usage Analysis,” INFORMS Journal on Computing spring. DOI: https://doi.org/10.1287/ijoc.126.96.36.19945
ChitraNasa, Suman (2012). Evaluation of Different Classification Techniques for WEB Data. International Journal of Computer Applications (0975–8887), Volume 52.
Copyright (c) 2017 Ng Qi Yau, Wan Zainon
This work is licensed under a Creative Commons Attribution 4.0 International License.
License and Copyright Agreement
In submitting the manuscript to the journal, the authors certify that:
- They are authorized by their co-authors to enter into these arrangements.
- The work described has not been formally published before, except in the form of an abstract or as part of a published lecture, review, thesis, or overlay journal.
- That it is not under consideration for publication elsewhere.
- That its release has been approved by all the author(s) and by the responsible authorities – tacitly or explicitly – of the institutes where the work has been carried out.
- They secure the right to reproduce any material that has already been published or copyrighted elsewhere.
- They agree to the following license and copyright agreement.
Authors who publish with International Journal of Engineering Technologies and Management Research agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (CC BY-SA 4.0) that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors can enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or edit it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) before and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.
For More info, please visit CopyRight Section