A Novel Prefetching Technique through Frequent Sequential Patterns from Web Usage Data
Keywords:
Frequent sequential patterns (fsp), mining algorithm, WUDAbstract
Frequent sequential patterns (fsp) from web usage data (wud) are very important for analyzing and understanding users behavior to improve the quality of services offered by the world wide web(www). Web prefetching is one of the techniques for reducing the web latency there by improve the web retrieval process. This technique makes use of prefetching rules that are derived from fsps. In this paper, we explore the different fsp mining algorithms such as spm, fp growth, and spade for extraction of fsps from wud of an academic website for a period that varies from weekly to quarterly. Performance analysis on all of these fsp algorithms has been made against the number of fsps they generate with a given minimum support. Experimental results shows that spade fsp mining algorithm perform better compared to spm and fp growth algorithms. Based on the fsps, we propose a novel prefetching technique that generate prefetching rules from the fsps and prefetch the web pages so as to reduce the users’ perceived latency.
References
. Jinlin Chen, (2010) “An UpDown Directed Acyclic Graph Approach for Sequential Pattern Mining”, IEEE Transactions on Knowledge and Data Engineering, Vol. 22, No. 7, pp. 913-928.
. Ding-An Chiang, Cheng-Tzu Wang, Shao-Ping Chen. & Chun-Chi Chen, (2009) “The Cyclic Model Analysis on Sequential Patterns”, IEEE Transactions on Knowledge and Data Engineering, Vol. 21, No. 11, pp. 1617 – 1628.
. Yu Hirate. & Hayato Yamana, (2006) “Generalized Sequential Pattern Mining with Item Intervals”, Journal of Computers, Vol. 1, No. 3, pp. 51-60.
. Mohammed J. Zaki, (2001) “Spade: An Efficient Algorithm For Mining Frequent sequences”, Machine Learning, Vol. 42, pp. 31-60.
. Zhenglu Yang. & Masaru Kitsuregawa, (2005) “LAPIN-SPAM: An Improved Algorithm for MiningSequential Pattern”, IEEE International
Conference on Data Engineering Workshops, pp. 1222-1226.
. Zhenglu Yang, Yitong Wang. & Masaru Kitsuregawa, (2007) “LAPIN: Effective Sequential Pattern Mining Algorithms by Last Position Induction for Dense Databases”, International Conference on Database systems for advanced applications, pp. 1020-1023.
. Jianyong Wang, Jiawei Han. & Chun Li, (2007) “Frequent Closed Sequence Mining without Candidate Maintenance”, IEEE Transactions on Knowledge and Data Engineering, Vol.19, No. 8, pp. 1042-1056.
. Kuo-Yu Huang, Chia-Hui Chang, Jiun-Hung Tung. & Cheng-Tao Ho, (2006) “COBRA: Closed Sequential Pattern Mining Using Bi-phase Reduction Approach”, International Conference on Data Warehousing and Knowledge Discovery, pp. 280-291.
. Jian Pei, Jiawei Han, Behzad Mortazavi-Asl, Jianyong Wang, Helen Pinto, Qiming Chen, Umeshwar Dayal. & Mei-Chun Hsu, (2004) “Mining Sequential Patterns by Pattern-Growth: The PrefixSpan Approach”, IEEE Transactions on Knowledge and Data Engineering, Vol. 16, No. 11, pp. 1424-1440.
.Xiaorong Cheng. & Hong Liu, (2009) "Personalized Services Research Based On Web Data Mining Technology", IEEE International Symposium on Computational Intelligence and Design, pp. 177-180.
.Ford Lumban Gaol, (2010) “Exploring The Pattern of Habits of Users Using Web Log Sequential Pattern”, IEEE International Conference on Advances in Computing, Control and Telecommunication Technologies, pp. 161-163.
.Jian Pei, Jiawei Han, Behzad Mortazavi-asl. & Hua Zhu, (2000) “Mining Access Patterns Efficiently from Web Logs”, Pacific-Asia Conference on Knowledge Discovery and Data Mining Current Issues and New Applications, pp. 396-407.
.Tan Xiaoqiu, Yao Min. & Zhang Jianke, (2006) "Mining Maximal Frequent Access Sequences Based on Improved WAP-tree", IEEE International Conference on Intelligent Systems Design and Applications, pp. 616-620.
.Sen Yang, Jiankui Guo. & Yangyong Zhu, (2007) “An Efficient Algorithm for Web Access Pattern Mining”, Fourth International Conference on Fuzzy Systems and Knowledge Discovery, pp. 726 – 729.
.Lizhi Liu. & Jun Liu, (2010) “Mining Web Log Sequential Patterns with Layer Coded Breadth-First Linked WAP-Tree”, IEEE International Conference on Information Science and Management Engineering, pp. 28-31.
.V. Mohan, S. Vijayalakshmi . & S. Suresh Raja, (2009) “Mining Constraint-based Multidimensional Frequent Sequential Pattern in Web Logs”, European Journal of Scientific Research, Vol. 36, No. 3, pp. 480-490.
.Hai-yan Wu, Jing-jun Zhu. & Xin-yu Zhang, (2009) "The Explore of the Web-based Learning Environment based on Web Sequential Pattern Mining", IEEE International Conference on Computational Intelligence and Software Engineering, pp. 1-6.
.Bhupendra Verma, Karunesh Gupta, Shivani Panchal. & Rajesh Nigam, (2010) “Single Level Algorithm: An Improved Approach for Extracting User Navigational Patterns to Technology”, International Conference on Computer & Communication Technology, pp. 436-441.
.Olfa Nasraoui, Maha Soliman, Esin Saka, Antonio Badia. & Richard Germain, (2008) "A Web Usage Mining Framework for Mining Evolving User Profiles in Dynamic Web Sites", IEEE Transactions on Knowledge and Data Engineering, Vol. 20, No. 2, pp. 202-215.
.Arthur Pitman. & Markus Zanker, (2010) “Insights From Applying Sequential Pattern Mining To ECommerce Click Stream Data”, IEEE International Conference on Data Mining Workshops, pp.967- 975.
.Florent Masseglia, Maguelonne Teisseire. & Pascal Poncelet, (2002) "Real Time Web Usage Mining with a Distributed Navigation Analysis", International Workshop on Research Issues in Data Engineering, pp. 169-174.
.Baoyao Zhou, Siu Cheung Hui. & Kuiyu Chang, (2004) “An Intelligent Recommender System using Sequential Web Access Patterns”, IEEE Conference on Cybernetics and Intelligent Systems, pp. 393-398.
.Show-Jane Yen, Yue-Shi Lee. & Min-Chi Hsieh, (2005) “An Efficient Incremental Algorithm for Mining Web Traversal Patterns”, IEEE
International Conference on e-Business Engineering, pp. 274-281.
.Zhennan Zhang, Xu Qian. & Yu Zhao, (2008) "Galois Lattice for Web Sequential Patterns Mining", IEEE Pacific-Asia Workshop on Computational Intelligence and Industrial Application, pp. 102-106.
.Dr. Suresh Jain, Ratnesh Kumar Jain. & Dr. R. S. Kasana, (2009) “Efficient Web Log Mining using Doubly Linked Tree”, International Journal of Computer Science and Information Security, Vol. 3, No. 1, pp. 1-5.
.Dhirendra Kumar Jha, Anil Rajput, Manmohan Singh. & Archana Tomar, (2010) "An Efficient Model for Information Gain of Sequential Pattern from Web Logs based on Dynamic Weight Constraint", IEEE International Conference on Computer Information Systems and Industrial Management Applications, pp. 518-523.
.Xiaogang Wang, Yan Bai. & Yue Li, (2010) "An Information Retrieval Method Based on Sequential Access Patterns", IEEE Asia-Pacific Conference on Wearable Computing Systems, pp. 247-250.
.Kanak Saxena. & Rahul Shukla, (2010) "Significant Interval and Frequent Pattern Discovery in Web Log Data", IJCSI International Journal of Computer Science Issues, Vol. 7, No. 3, pp. 29-36.
.Diamanto Oikonomopoulou, Maria Rigou, Spiros Sirmakessis. & Athanasios Tsakalidis, (2004) “FullWeb Prediction based on Web Usage Mining and Site Topology”, IEEE/WIC/ACM International Conference on Web Intelligence, pp. 716-719.
.Rajimol A. & Raju G, (2011) “Mining Maximal Web Access Patterns- A New Approach”, International Journal of Machine Intelligence, Vol. 3, No. 4, pp. 346-348.
. J. Han, J.Pei, and Y. Yin,” Mining Frequent Patterns without Candidate Generation,” Proc. 2000 ACM-SIGMOD Int’l Conf. Management of Data (SIGMOD ’00),pp.1-12, May 2000.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2015 COMPUSOFT: An International Journal of Advanced Computer Technology
This work is licensed under a Creative Commons Attribution 4.0 International License.
©2023. COMPUSOFT: AN INTERNATIONAL OF ADVANCED COMPUTER TECHNOLOGY by COMPUSOFT PUBLICATION is licensed under a Creative Commons Attribution 4.0 International License. Based on a work at COMPUSOFT: AN INTERNATIONAL OF ADVANCED COMPUTER TECHNOLOGY. Permissions beyond the scope of this license may be available at Creative Commons Attribution 4.0 International Public License.