Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/65416
Title: A study of web mining tools for query optimization
Researcher: Kumar, Sourabh
Guide(s): Mansotra, Vibhakar
Keywords: Computer Science and Information Technology
web mining tools
query optimization
University: University of Jammu
Completed Date: 2015
Abstract: The World Wide Web or simply the web may be seen as a huge collection of documents freely produced and published by a very large number of people, without any solid editorial control. This is probably the most democratic and anarchic widespread mean for anyone to express feelings, comments, convictions and ideas, independently of ethnics, sex, religion or any other characteristic of human societies. The web constitutes a comprehensive, dynamic, up-to-date repository of information regarding most of the areas of human knowledge and support an increasingly important part of commercial, artistic, scientific and personal transactions, which gives rise to a very strong interest from individuals, as well as from institutions, at a universal scale. However, the web also exhibits some characteristics that are adverse to the process of collecting information from it in order to satisfy specific needs some of the characteristics are, the large volume of data it contains, its dynamic nature, constituted by unstructured or semi-structured data, content and format heterogeneity and irregular data quality. End-users also introduce some additional difficulties in the retrieval process. Information needs are often imprecisely defined, generating a semantic gap between user needs and their specification. The satisfaction of a specific information need on the web is supported by search engines and other tools aimed at helping users to gather information from the web. While information retrieval (IR) has been an active field of research for decades, for much of its history it has had a very strong bias towards English as the language of choice for research and evaluation purposes. Whatever they may have been, over the years, many of the motivations for an almost exclusive focus on English as the language of choice in IR have lost their validity. The Internet is no longer monolingual, as the non- English content is growing rapidly. Asia is the largest and the most culturally and linguistically diverse continent.
Pagination: 207p.
URI: http://hdl.handle.net/10603/65416
Appears in Departments:Department of Computer Science & IT

Files in This Item:
File Description SizeFormat 
01_abstract.pdfAttached File588.39 kBAdobe PDFView/Open
02_certificate.pdf804.3 kBAdobe PDFView/Open
03_contents.pdf523.07 kBAdobe PDFView/Open
04_list of tables.pdf285.59 kBAdobe PDFView/Open
05_list of figures.pdf281.23 kBAdobe PDFView/Open
06_chapter_1.pdf511.58 kBAdobe PDFView/Open
07_chapter_2.pdf602.39 kBAdobe PDFView/Open
08_chapter_3.pdf594.41 kBAdobe PDFView/Open
09_chapter_4.pdf1.04 MBAdobe PDFView/Open
10_chapter_5.pdf887.16 kBAdobe PDFView/Open
11_chapter_6.pdf778.23 kBAdobe PDFView/Open
12_chapter_7.pdf234.38 kBAdobe PDFView/Open
13_references.pdf382.23 kBAdobe PDFView/Open


Items in Shodhganga are protected by copyright, with all rights reserved, unless otherwise indicated.