Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/202826
Title: | Improved Strategies for Session Identification and Frequent Pattern Generation in Web Usage Mining |
Researcher: | Kavitha D |
Guide(s): | Kalpana B |
Keywords: | Associative rule mining Frequent pattern generation Session Identification |
University: | Avinashilingam Deemed University For Women |
Completed Date: | 03/08/2017 |
Abstract: | The heterogeneous nature of the web combined with the rapid diffusion of web newlinebased applications has made web browsing an intricate activity for users. This has given newlinerise to an urgent need for developing systems capable of assisting and guiding users newlineduring their navigational activity in the web. Web Usage Mining (WUM) refers to the newlineapplication of data mining techniques for the automatic discovery of meaningful usage newlinepatterns characterizing the browsing behavior of users, starting from access data newlinecollected through the interactions of users with websites. The preprocessing, pattern newlinediscovery, and pattern analysis are the three main phases of web usage mining. In order newlineto implement functionalities the discovered patterns may be conveniently exploited to newlineoffer useful assistance to users. newlineWith the increase of internet usage and the steady growth of users, the www has newlinebecome a vast repository of data. The users access to web sites are stored in web newlineserver logs. However, the web log data do not present an exact picture of the users newlineaccesses to the web site. Preprocessing of the web log data is a crucial prerequisite that newlinemust be performed prior to applying data mining algorithms. To find useful patterns, newlinerequests (or log entries) need to be grouped into usage sessions. Session identification newlineof web log and discovering patterns from web log is a difficult task, since each user newlinemaintains multiple sessions for the specific duration. To solve this problem automatic newlinesession identification is performed based on the timeout method, in which the session is newlinedifferentiated based on the time interval with predefined threshold value. But, it is difficult newlineto set the time threshold for each session identification process. In recent years, several newlinework have found on dynamic log session identification among them n-gram models newlineproduces higher log session identification results. But the major issue of the n-gram newlinemodel is that it assumes the entire database query to be static, so dynamic query type is newlinenot applicable. |
Pagination: | 158 p. |
URI: | http://hdl.handle.net/10603/202826 |
Appears in Departments: | Department of Computer Science |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 91.26 kB | Adobe PDF | View/Open |
02_certificate.pdf | 94.31 kB | Adobe PDF | View/Open | |
03_acknowledgement.pdf | 97.3 kB | Adobe PDF | View/Open | |
04_contents.pdf | 107.77 kB | Adobe PDF | View/Open | |
05_list of tables,figures & abbreviations.pdf | 123.35 kB | Adobe PDF | View/Open | |
07_chapter 1.pdf | 386.48 kB | Adobe PDF | View/Open | |
08_chapter 2.pdf | 274.28 kB | Adobe PDF | View/Open | |
09_chapter 3.pdf | 437.35 kB | Adobe PDF | View/Open | |
10_chapter 4.pdf | 413.94 kB | Adobe PDF | View/Open | |
11_chapter 5.pdf | 553.14 kB | Adobe PDF | View/Open | |
12_chapter 6.pdf | 117.13 kB | Adobe PDF | View/Open | |
13_references.pdf | 162.54 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: