Preview
Preview

Text Clustering Essay

No Works Cited
Length: 862 words (2.5 double-spaced pages)
Rating: Yellow      
Open Document
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

The idea of text clustering long preceded the computer age: “Clustering is one of the most primitive mental activities of humans, used to handle the huge amount of information they receive every day” (Theodoridis and Koutroubas, 2003: 398). The act of indexing long used in libraries is an obvious example. Manual clustering was the only type of document clustering possible prior to the computer age. This circumstance may have influenced much clustering work that relied only on immediate intuitive knowledge of the world without making use of quantitative numerical methods. In other words, text clustering was usually performed in subjective ways that relied heavily on the perception, knowledge, and judgment of the researcher. With more and easier accessibility to electronic digital data in different disciplines and the power of computing data processing on one hand and the need for maintaining objectivity standards on the other, it has become ever more likely that such procedures must involve computational automated methods (Arabie et al., 1996) where human intuition and traditional organization methods are replaced by mathematical and computational techniques (Golub, 2006; Golub, 2005). In this, recent years have witnessed a flourishing of the development of automated statistical clustering and classification systems for systematizing the inherent subjectivity in traditional text classification applications. It is this need for automated objective methodology that motivates our clustering of Hardy’s novels and short stores.
 Clustering vs. classification
The two terms clustering and classification are extensively used throughout this thesis. The question that rises at this point is: are they synonymous or is there a distinction...


... middle of paper ...


...ion is that clustering is an “unsupervised” activity while classification is a supervised one. In clustering, there is no one who assigns documents to classes but it is only the distribution and makeup of the data that will determine cluster membership (Manning et al., 2008).
To illustrate the argument, let us consider the following example. Having a set of 1000 documents on the history of English literature, these can be both clustered and classified. In performing a clustering task, documents are just clustered into distinct groups where similar or related documents are grouped together. In classification, on the other hand, predefined sets are given first. These can be Old English literature, Shakespearean literature, Augustan Literature, Romantic Literature, and Victorian Literature. Then documents are placed or classified under these predefined categories.



Click the button above to view the complete essay, speech, term paper, or research paper








This essay is 100% guaranteed.


Title Length Color Rating  
Recent Trends in Document Clustering with Evolutionary-Based Algorithms Essay examples - Document clustering is the process of organizing a particular electronic corpus of documents into subgroups of similar text features. Previously, a number of statistical algorithms had been applied to perform clustering to the data including the text documents. There are recent endeavors to enhance the performance of the clustering with the optimization based algorithms such as the evolutionary algorithms. Thus, document clustering with evolutionary algorithms became an emerging topic that gained more attention in the recent years....   [tags: clustering, inspection, research]
:: 80 Works Cited
742 words
(2.1 pages)
Better Essays [preview]
Essay Text Classification Systems - Currently, there are many classification systems. Broadly speaking, these systems fall into two main categories. These are binary and multiclass systems. Binary classification systems are only concerned with classifying documents into two main categories or groups. Classification systems of this kind are used to distinguish between just two classes of objects. As Maranis and Bebenko (2009) explain, these systems provide Yes/No answer to the question: Does this document belong to class X. In this, such systems can be useful in classifying emails where they are classified whether spam or not, or commercial transactions where they are determined to be fraudulent or not....   [tags: Text Analysis] 1054 words
(3 pages)
Strong Essays [preview]
Computer-Assisted Text Analysis Essay - Computational approaches are largely used in the variety of text applications such as feature selection and classification tasks because of their efficiency of dealing with huge amount of data. The discussion is concerned, however, with the applications of computational approaches to only literary texts in general and Hardy’s texts in particular. To my knowledge, there is no computer-aided thematic classification of the works of Thomas Hardy. The only study that approached Hardy’s works in terms of clustering techniques is Hoover’s (2002)....   [tags: Text Analysis] 870 words
(2.5 pages)
Better Essays [preview]
Essay on Exploring Affinity Propagation - ... Various mechanism-learning investigators have found that unlabeled data, when rummage-sale in conjunction with a small amount of categorized data, can yield substantial development in learning accuracy. A) Irrelevant Based Feature Selection A feature selection algorithm may be evaluated from both the efficiency and effectiveness arguments. Although the effectiveness concerns the time requisite to find a subsection of features, the efficiency is associated to the excellence of the subsection of features....   [tags: clustering data, algorithms, statistics] 1014 words
(2.9 pages)
Research Papers [preview]
Essay about The Effects of Words Clustering on Memory - As world spin and time pass by, we learn a lot of new things every day. Some we may remember immediately but some are impossible to do it. Thus we as a human try a lot of ways to make every day life easier. Word clustering is one of it. Clustering is meant “similar” or “same”. The way we use it is, when we a given a thing to do, we will categorize same thing that have similar or same pattern or any common pattern. This word clustering seems to give a different impact on every people. We have two types of memory, long term memory and short term memory....   [tags: Words Clustering] 2253 words
(6.4 pages)
Strong Essays [preview]
Clustering in Financial Services: A Literrature Review Essay - The theory of cluster has became one of theory that considered important in the regional economic development theoryin the recent worlds. It suggest that the co-ocation or geographic proximity results for firms that do clustering will be crucial to increase the economics scope of the firms, mainly due to lower input cost that are resulted from agglomeration economies and facilitates knowledge spillovers which can increase the firms’ productivity (Wolman and Hincapie, 2010, p1). Therefore, this can creates more competitive firms that do the clustering, and also produce significant growth for the firms....   [tags: markusen, clustering, financial services]
:: 12 Works Cited
2165 words
(6.2 pages)
Term Papers [preview]
Essay on The Persuasive Text - The purpose of a persuasive text is to change or alter the viewpoint of the reader for it to agree with the author’s perspective. The intention of this specific text is to persuade the reader to help end poverty today by joining ‘Make Poverty History’ and it uses persuasive language and techniques to do this – this essay will explain the effect on the reader and will focus on analysing persuasive language. Pronouns are an effective persuasive language technique because they address the reader directly....   [tags: persuasive text] 835 words
(2.4 pages)
Better Essays [preview]
The Visual Text of a Herbal Essences´ Advertisement Essay - In the October 21, 2013 issue of In Touch magazine, Herbal Essences’ hair products has an eye catching, bright advertisement to catch a readers attention. As readers browse through the magazine, the advertisement serves the purpose to stop and catch the reader’s attention. The dark haired, dark skinned model, Nicole Scherzinger, is used to show-off the products. Scherzinger’s hair is an image a reader can’t help but notice. Long, think, and full a volume is what attracts the reader, especially females because they would love for their hair to look like the hairstyle shown on the advertisement....   [tags: advertisement, hair, model, text] 643 words
(1.8 pages)
Better Essays [preview]
We Never Talk Anymore: The Problem with Text Messaging Essay - Text messaging has become a norm in our generation, as technology rapidly advances and gives way to more efficient forms of communication in a fast-paced world; and many are skeptical about the influence this new form of interaction is having on our society, especially with our younger generation. David Crystal, a professor at the University of Wales, writes “2b or Not 2b?” in support of text messaging. He insists, despite those who underestimate or negate the beneficial influence text messaging has on language proficiency, that “there is increasing evidence that [texting] helps rather than hinders literacy” and that the fairly recent form of communication has actually been around for a whil...   [tags: text messaging, texting, communication] 1354 words
(3.9 pages)
Strong Essays [preview]
Essay about text comparison - I chose to compare the Martini chapter, which I will refer to as “Martini,” to “Human Anatomy” by Kent Van De Graaff, which I will refer to as “Graaff.” The chapter being compared in both texts is the reproductive system. Graaff decided to separate the male and female reproductive systems into two chapters, which didn’t help or hurt the content. Both texts provided very good information, and both had their good and not so good aspects. The opening pages of both texts look very similar and provide a lot of the same material....   [tags: essays research papers] 897 words
(2.6 pages)
Strong Essays [preview]