Study on Web Mining Algorithm based on Usage Mining
ABSTRACT
Web usage mining is an application of data mining technology to mining the data of the Web server log file. It can discover the browsing patterns of user and some kind of correlations between the web pages. Web usage mining provides the support for the Web site design, providing personalization server and other business making decision, etc. Web mining applies the data mining, the artificial intelligence and the chart technology and so on to the Web data and traces users¿ visiting characteristics, and then extracts the users¿ using pattern. This article will study on Web Mining Algorithm based on Usage Mining. And it also produces the design mentality of the electronic commerce website application algorithm. This algorithm is simple, effective and easy to realize, it is suitable to the Web usage mining demand of construct a low cost B2C website.
Similar to traditional data mining, three important Web mining operations include clustering, association, and sequential analysis. Typical clustering operations in Web mining involve finding natural groupings of Web resources or Web users. Researchers have pointed out some important differences between clustering in conventional applications and clustering in Web mining. For example, the clusters and associations in Web mining do not necessarily have crisp boundaries. Moreover, due to a variety of reasons inherent in Web browsing and Web logging, the likelihood of bad or incomplete data is higher. As a result, researchers have studied the possibility of using fuzzy sets in Web mining clustering applications. The paper describes how rough set theory can also be used to develop clustering schemes for Web mining. The unsupervised classification described in the paper uses properties of rough sets along with genetic algorithms to represent clusters as interval sets. The paper also describes the design of an experiment including data collection and the clustering process. The experiment is used to create interval set representations of groups of Web visitors
Web data mining is a new important research field in data mining. In this paper, the conception and characteristic of data mining based on Web are introduced the process and the general methods of data mining based on Web are expatiated. At present many websites are built with HTML, which is difficult to achieve real effective and accurate web mining. The appearance of XML has brought convenience for it. Based on the research of web mining, XML is used to transform semi-structured data to well structured data, and a model of web mining system which has basic data mining function and faces multi-data on the Web is built. At the same time, the problem in data mining is analyzed and studied. An example is put forward to prove the solution.
Web mining is a cross point of database, information retrieval and artificial intelligence. Web content mining (WCM), Web structure mining (WSM) and Web usage mining (WUM) buildup the whole Web mining. The research issues, techniques and development efforts are presented in this paper.
Existing Web usage mining (WUM) tools do not indicate which data mining algorithms are used or provide effective graphical visualizations of the results obtained. WUM techniques can be used to determine typical navigation patterns in an organizational Web site. The process of combining WUM and information visualization techniques in order to discover useful information about Web usage patterns is called visual Web mining. The goal of this paper is to discuss the development of a visual Web mining prototype, called WebPatterns, which allows the user to effectively visualize Web usage patterns
با سلام و تشکر از مدیریت محترم سایت که مقالات خوبی را قرار داده اند
داده کاوی یکی از موضوعات بین رشته ای است که امروزه در بسیاری از سیستم های اطلاعاتی استفاده می شود و ارتباط تنگاتنگی با هوش مصنوعی و آمار دارد. در کشور ما هم 4-5سالی هست که حتی انجمنی تخصصی در این راستا ایجاد شده است و هرساله کنفرانسی را نیز برگزار می کند.
به آدرس ذیل: .: پنجمین کنفرانس داده کاوی ایران :.
درواقع در داده کاوی به دنبال یافتن دانش از دل انبوهی از داده های موجود هستیم که بتوانیم با این یافته ها مشکلات موجود را مرتفع کرده و به عنوان تصمیم یار مدیران از آنها بهره گرفت.
با آرزوی توفیق