Semantic Focused Web Crawler for Service Discovery Using Data Mining Technique
Keywords:
Web Mining, Web Crawler, Focused Crawler, World Wide Web (WWW)Abstract
Data mining is the process of extraction of hidden predictive information from the huge databases. It is a new technology with great latent to help companies focus on the most important information in their data warehouses. Web mining is a data mining techniques which automatically discover information from web documents. The amount of data and its dynamicity makes it impossible to crawl the World Wide Web (WWW) completely. It’s a challenge in front of crawlers to crawl only the relevant pages from this information explosion. Thus a focused crawler solves this issue of relevancy by focusing on web pages for some given topic or a set of topics. Nowadays finding meaningful information among the billions of information resources on the World Wide Web is a difficult task due to growing popularity of the Internet. This paper basically focuses on study of the various techniques of data mining for finding the relevant information from World Wide Web using web crawler.
References
Lu LIU, Tao PENG “Clustering-based topical Web crawling using CFu-tree guided by linkcontext” in Higher Education Press and Springer-Verlag Berlin Heidelberg 2014
Hai Dong, Farookh Khadeer Hussain, and Elizabeth Chang “Ontology-Learning-Based Focused Crawling for Online Service Advertising Information Discovery and Classification” in Springer-Verlag Berlin Heidelberg 2012
Rodolfo Zunino, Roberto Surlinelli “An AnalystAdaptive Approach to Focused Crawlers” in 2013 IEEE/ACM International Conference on
Advances in Social Networks Analysis and Mining
Hai Dong, Farookh Khadeer Hussain “SelfAdaptive Semantic Focused Crawler for Mining Services Information Discovery” in IEEE Transactions On Industrial Informatics, Vol. 10, No. 2, May 2014
Hardik P. Trivedi, Gaurav N. Daxini, Jignesh A. Oswal, Vinay D. Gor, Swati Mali “An Approach to Design Personalized Focused Crawler” in International Journal of Computer Science and Engineering Volume-2, Issue-3 E-ISSN: 2347-2693
Bireshwar Ganguly, Devashri Raich “Performance Optimization of Focused Web Crawling Using Content Block Segmentation’’ in 978-1-4799-2102-7/14 $31.00 © 2014 IEEE DOI 10.1109/ICESC.2014.69
Hai Dong, Farookh Khadeer Hussain, Elizabeth Chang “A Transport Service Ontology based Focused Crawler” in 2008 IEEE
R.Eswaramoorthy, M.Jayanthi “A Survey on Detection of Mining Service Information Discovery Using SASF Crawler” in International
Journal of Innovative Research in Computer and Communication Engineering Vol. 2, Issue 10, October 2014
Boser BE, Guyon IM, Vapnik VN. A training algorithm for optimal margin classifiers. Proceedings of the Fifth Annual Workshop on Computational Learning Theory, ACM: Pennsylvania, United States, 1992; 144-152.
http://www.csie.ntu.edu.tw/~cjlin/libsvm/
Osmar R. Zaïane, “Introduction to Data Mining” in CMPUT690 Principles of Knowledge Discovery in Databases
Trupti V. Udapure, Ravindra D. Kale, Rajesh C. Dharmik, “Study of Web Crawler and its Different Types” in IOSR Journal of Computer
Engineering (IOSR-JCE)
http://cacm.acm.org/blogs/blog-cacm/153780- data-mining-the-web-via-crawling/fulltext
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2015 COMPUSOFT: An International Journal of Advanced Computer Technology
This work is licensed under a Creative Commons Attribution 4.0 International License.
©2023. COMPUSOFT: AN INTERNATIONAL OF ADVANCED COMPUTER TECHNOLOGY by COMPUSOFT PUBLICATION is licensed under a Creative Commons Attribution 4.0 International License. Based on a work at COMPUSOFT: AN INTERNATIONAL OF ADVANCED COMPUTER TECHNOLOGY. Permissions beyond the scope of this license may be available at Creative Commons Attribution 4.0 International Public License.