Abstract: - In today’s era, as we all know internet technologies are growing rapidly. Along with this, instantly, Web page recommendations are also improving. The aim of a Web page recommender system is to predict the Web page or pages, which will be visited from a given Web-page of a website. Data preprocessing is one basic and essential part of Web page recommendation. Data preprocessing consists of cleanup and constructing data to organize for extracting pattern. In this paper, we discuss and focus on Web page Recommendation and role of data preprocessing in Web page recommendation, considering how data preprocessing is related to Web page recommendation.Keywords: Recommender System, Web server logs, Web mining, Web usage mining, data preprocessing.
—————————— —————————— 1. INTRODUCTION The unpredictable increase and growth of information on the World Wide Web, with the progress of innovative electronic devices, has made information of Web increasingly important in everyone’s life. In today’s era, as we
…show more content…
Problems like this relates with usage of Web. Hence, there is a need of cleaning and constructing or structuring Web log data, which is nothing but data preprocessing part in Web Usage Mining [3]. Data preprocessing plays a vigorous role because of redundant irrelevant log data nature [4]. Thus, we find that, data preprocessing is one basic and essential part of Web-page recommendation. This paper is structured as below: Section 2 comprises a review of Web page recommendation. Section 3 clarifies categorization of recommendation system and web mining, and it discusses how data preprocessing is related to Web page recommendation. Section 4 illustrates data preprocessing and its steps. Section 5 provides comparative analysis of data preprocessing techniques use; and finally, section 6 gives the
The most lucrative business on the Internet is marketing. Companies have come up with ingenious ways to generate revenue with very targeted advertising. Each company has their unique method to identify their consumers, some more complicated than others. For example, on a website geared to new mothers the advertisements would reflect that by advertising for baby diapers or formula. This type of targeted advertising is understood and acceptable. The consumer benefits by having advertisements in their interests and the vendor has a higher likelihood of making a sale. The Internet has introduced novel ways to track consumer habits and interests thereby creating smarter advertising. Microsoft employs their browser Internet Explorer using “cookies” to track user habits. Cookies are pieces of text stored by a user’s web browser, they are sent back and forth every time a user accesses a web page. These can be tracked to follow web surfers’ actions. Cookies are used to store...
Various web-based companies have developed techniques to document their customer’s data, enabling them to provide a more enhanced web experience. One such method called “cookies,” employs Microsoft’s web browser, Internet Explorer. It traces the user’s habits. Cookies are pieces of text stored by the web browser that are sent back and forth every time the user accesses a web page. These can be tracked to follow web surfers’ actions. Cookies are used to store the user’s passwords making your life easier on banking sites and email accounts. Another technique used by popular search engines is to personalize the search results. Search engines such as Google sell the top search results to advertisers and are only paid when the search results are clicked on by users. Therefore, Google tries to produce the most relevant search results for their users with a feature called web history. Web history h...
Wallace, Jonathon. (1997). Labelling, rating and filtering systems on the Internet. [Online]. Available: http://www.spectacle.org/cda/rate.html. [1997, Sep. 02].
Over the past few decades, the generation and availability of information over the cyberspace is increasing enormously. There exist an alarming need for solutions that will help to filter the relevant data from the collection of disorganised data for the users to select the most suitable data from the available collection of data. A lot of strategies have been developed, that assist in the selection of relevant information for the user. Applications on the internet are making searching convenient for users by incorporating recommender systems within the applications which helps to filter unwanted information, predict the needs and preferences of users (Long, Zhang, & Hu, 2011) and provide suggestions to the users. When compared to the other fields of information systems, recommender systems is a relatively new field, as it initially used to be a part of information retrieval and management sciences.
Numerous studies have pointed out that while almost all Fortune 500 companies have great investment in web analytics they still struggle to make any meaningful business decisions. Most people complain that there are terabytes of data and gigabytes of reports and megabytes of excel and power point files. Yet no actionable insights, no innate awareness is present on what is really going on through the clutter of the clickstream data.
In today’s fast paced technology, search engines have become vastly popular use for people’s daily routines. A search engine is an information retrieval system that allows someone to search the...
There is a debate between the benefits and potential informational privacy issues in web-data mining. There are large amount of valuable data on the web, and those data can be retrieved easily by using search engine. When web-data mining techniques are applied on these data, we can get a large number of benefits. Web-data mining techniques are appealing to business companies for several reasons [1]. For example, if a company wants to expand its bu...
The Internet has become a key ingredient of strenuous and busy lifestyle. ‘Internet’ has become the central-hub for communication, explorations, connecting with people or for official purposes. Resultantly, Internet growth has led to a plethora of new developments, such as decreased margins for companies as consumers turn more and more to the internet to buy goods and demand the best prices.
Data mining is a field that is a combination of numerous other fields such as the database research, artificial intelligence and statistics. Data mining involves looking for patterns in vast amounts of data as a part of knowledge discovery process. (Huang, Joshua Zhexue, Cao, Longbing, Srivastava, Jaideep, 2011) contains numerous papers that are solely dedicated to discussing the advancements that have been made in the field of data mining and knowledge discovery. A lot of people have performed a thorough research on all that has been done in data mining and the future possibilities that are soon to be implemented practically. The research not only covers the history and the reasons that led to various advancements being made but they also cover the detail models of the proposed solutions to deficiencies in existing systems.
Information Retrieval (IR) is to represent, retrieve from storage and organise the information. The information should be easily access. User will be more interested with easy access information. Information retrieval process is the skills of searching for documents, for information within documents and for metadata about documents, as well as that of searching relational databases and the World Wide Web. According to (Shing Ping Tucker, 2008), E-commerce is rapidly a growing segment in the internet.
Today, the topic of data mining has much interest in government, business, and research circles. With the growth of computer use within these areas has also come a greater desire to let the computers do the work that used to be done by humans. The problem, nowadays, is that the data that needs to be analyzed has become too large and cumbersome for one person or even teams of people to envision tackling without help from computers. These computers are no longer mere crunchers of numbers but now they find the patterns that the humans used to find. From this growth has arisen a vast body of knowledge concerned with this process of data analysis. As with much other information, the Internet is employed to make available the ever-growing body of information on this topic. Many general sources of information [a,b,c] are now online. These are updated and expanded upon almost a constant basis. The use of the Internet to disseminate and collect information is itself a consideration in this field. The amount of information is expanding at such a rate that old methods of information disposal, such as paper journals and b...
THURAISINGHAM, BHAVANI. (2003). Web Data Mining and Applications in Business Inteligence and Counter-Terrorism.Taylor & Francis.http://www.myilibrary.com?id=6372.
Today, our society has access to mankind’s collective knowledge with the internet. Constantly updated, the internet keeps everyone in the loop. If there is a traffic jam, Google Maps will notify you. If there is a new movie release, Fandango will ask to reserve tickets for you. If there is a limited-time sale, Amazon will email you. Information constantly bombards us. The internet moves fast, and we must try to keep up to stay in
The Internet has become a major tool for communication and access to information for over two and half billion people (Wright 121). Although Internet has become an unavoidable reality that is consuming our planet in a web of information. This process is being shaped by our actions and choices which ultimately drives us together (Deibert 11). Nowadays China has over 538 million netizens, the world’s largest online community (Feng & Guo 335).
The Internet’s influence on our lives has spread throughout. According a 2009 US Census survey 74% of Americans use the internet and have access within their household.A number that has increased every year since 1990 and will sure grow in the future. In this survey they relieved that they did various activities on the internet including social media, (Facebook and Twitter) researching and reading news articles, watching YouTube videos, shopping and so much more all can be done with a computer or Internet enabled phone. With this ease of use and convenience it casts a shadow upon the future of printed and broadcast information. The Web’s instant and vast knowledge bank has changed ...