Deskewing Using Binary and Grayscale Images

1032 Words3 Pages

Document Image Analysis has today become an increasingly important domain due to the de-

sire to reduce the amount of paper documents and archives. Optical Character Recognition

(OCR) systems and document structure analyzers are the essential tools to achieve this task.

It often appears that the document to be recognized is not correctly placed on the flat-bed

scanner, especially when the document comes from a book or a magazine. This results in a

skewed digitalized image which is a real problem for the document analysis, understanding,

character segmentation and recognition. Deskewing the input image is then a crucial step in the

document understanding. In this report, we propose a deskewing method based on the Hough

transform and filtering algorithms.

Moreover, the study of characters features is necessary if a most faithful reconstitution of the

document is expected. This implies the study of characters color, average boldness and skele-

ton. Furthermore, most Optical Character Recognition systems are sensitive to the quality and

size of the given characters. Skew angle correction and binarization steps damage the charac-

ters, especially for little characters. In order to maximize the efficiency of characters recognition,

these are upscaled and smoothed by using pixel art scaling algorithm.

This report is composed of 2 sections. Firstly, the different proposed algorithms to estimate a

document skew angle and secondly the characters features study.

In this section, we propose a simple method to estimate the skew angle of a document based

on the combination of the Sobel edge detection filter, a filtering algorithm and the Hough trans-

form. This method has been designed for a quick detection of li...

... middle of paper ...

...too many pixels are deleted during the

filtering step, which then won’t leave enough information for the Hough transform.

The tolerance of this filtering algorithm depends on the window size and the threshold value.

The more the difference between the window size and the threshold value is important, the

more tolerant the algorithm is.

1.2.2 Grayscale images filtering algorithm

This grayscale filtering algorithm has been designed to do the same type of filtering than the

previous algorithm, but on grayscale images. The major difficulty, compared to binary images,

lies in the distinction between the objects and the background. In binary images we only deal

with true or false values, while in grayscale images the values range goes from 0 to 255.

For the remaining of this paragraph we assume that the text has a darker color than the back-

ground.

Why is Hubel and Wiesel's Description of the Classical Receptive Field Inadequate for an Understanding of Visual Perception?
2170 Words | 5 Pages
The ultimate goal for a system of visual perception is representing visual scenes. It is generally assumed that this requires an initial ‘break-down’ of complex visual stimuli into some kind of “discrete subunits” (De Valois & De Valois, 1980, p.316) which can then be passed on and further processed by the brain. The task thus arises of identifying these subunits as well as the means by which the visual system interprets and processes sensory input. An approach to visual scene analysis that prevailed for many years was that of individual cortical cells being ‘feature detectors’ with particular response-criteria. Though not self-proclaimed, Hubel and Wiesel’s theory of a hierarchical visual system employs a form of such feature detectors. I will here discuss: the origins of the feature detection theory; Hubel and Wiesel’s hierarchical theory of visual perception; criticism of the hierarchical nature of the theory; an alternative theory of receptive-field cells as spatial frequency detectors; and the possibility of reconciling these two theories with reference to parallel processing.
Read More
Examples Of Direct Characterization In The Crucible
505 Words | 2 Pages
Characterization is the process by which the author reveal the personality of a character. Characterization can be created in two different ways: direct and indirect characterization. Direct characterization is when the author tells the readers what a character is like. Indirect characterization is based on clues from the story, the reader decides what a character is like. Indirect characterization can come from what the character says/does, what the character thinks, what others say about the character, and the character’s physical appearance.
Read More
Ted Chiang's Story Of Your Life
785 Words | 2 Pages
In Ted Chiang's Story of Your life, the author tells the story of Dr. Banks largely focused on the communication between Dr. Banks and other characters. As Dr. Banks communicates with the rest of the characters, the author takes this time for characterization. Characterization is the concept of creating a character for a narrative. Characterization can be presented by descriptions, through their actions, speech, thoughts and interactions with other characters. Overall, characterization and communication are tied together because characterization can include how the character communicates with others. In Ted Chiang's Story of Your Life, indirect characterization was used to illustrate the broader theme of communication between Dr. Banks and the heptapods, Gary, and her daughter.
Read More
Autism On Family Paper
1936 Words | 4 Pages
Accuracy: This paper demonstrates much accuracy, this is proven through the subtitles, statistics and in text citations for
Read More
Essay On Segmentation
531 Words | 2 Pages
Retinal vessel segmentation is important for the diagnosis of numerous eye diseases and plays an important role in automatic retinal disease screening systems. Automatic segmentation of retinal vessels and characterization of morphological attributes such as width, length, tortuosity, branching pattern and angle are utilized for the diagnosis of different cardiovascular and ophthalmologic diseases. Manual segmentation of retinal blood vessels is a long and tedious task which also requires training and skill. It is commonly accepted by the medical community that automatic quantification of retinal vessels is the first step in the development of a computer-assisted diagnostic system for ophthalmic disorders. A large number of algorithms for retinal vasculature segmentation have been proposed. The algorithms can be classified as pattern recognition techniques, matched filtering, vessel tracking, mathematical morphology, multiscale approaches, and model based approaches. The first paper on retinal blood vessel segmentation appeared in 1989 by Chaudhuri et al. [21]...
Read More
Characterization in The Sailor Who Fell From Grace with the Sea and Wonderful Fool
1546 Words | 4 Pages
The literary technique of characterization is often used to create and delineate a human character in a work of literature. When forming a character, writers can use many different methods of characterization. However, there is one method of characterization that speaks volumes about the character and requires no more than a single word - the character's personal name. In many cases, a personal name describes the character by associating him with a certain type of people or with a well known historical figure. Therefore, since the reader learns the character's name first, a personal name is a primary method of characterization; it creates an image in the reader's mind that corresponds with the name of the character. Once this image has been created, all subsequent actions and beliefs of the character are somehow in accordance with this image; otherwise, the character does not seem logical and the reader is not be able to relate to the work. In the novels The Sailor Who Fell From Grace with the Sea, by Yukio Mishima, and Wonderful Fool, by Shusako Endo, each author gives one of his characters a personal name that guides the character's actions and beliefs.
Read More
Telecommuting In The Workplace Case Study
2195 Words | 5 Pages
In this section, the results of the research are presented. For each task carried out, the most important information obtained is presented.
Read More
Witness for the Prosecution: A Character Analysis of Mr. Mayherne
1028 Words | 3 Pages
The categories associated with the means of means of characterization are considered to be explicit vs. implicit characterization, auto- vs. alterocharacterization and figural and narratorial as the foci of characterization. The use of certain means of characterization depends upon the preference of the author: his style, intentions and choice of focus. The characters are characterized by 1) what they say themselves, 2) what they do, 3) what the narrator says about them and 4) what other characters say about them. One should not, however, take for granted what is said by other characters since they might not be reliable, especially if one notices certain inconsistencies. This essay focuses on a story called Witness for the Prosecution written by the famous writer of detective stories, Agatha Christie.
Read More
The Book Thief: Movie Analysis
969 Words | 2 Pages
Characterization in a novel is an incredibly important tool for the author, as it sets up what the character will be like for the rest of the novel. Thus, characterization can never
Read More
Analysis: A Computational Approach To Edge Detection
1079 Words | 3 Pages
Gaussian filter is exclusively used for this purpose as the mask is simple. The standard convolution method is performed once the mask is calculated. Since the convolution mask is usually much smaller than the actual image, the mask slides over the image , manipulating the pixels in the image. The large width Gaussian masks are not preferred as detector's sensitivity to noise is low and moreover, the localization error in the detected edges also increases with increase in Gaussian mask width.
Read More
Iterative Least Square For GPS Navigation
1028 Words | 3 Pages
The relation of the filter to the system is illustrated in the block diagram of figure 16. The basic steps of the computational procedure for the discrete-time Kalman estimator are as follows:
Read More
Grey Relationsical Analysis: The Process Of Grey Relational Analysis
1027 Words | 3 Pages
Grey Relational generating is normalization process for performance attributes. Equation (12) is used to normalize beneficial attributes (the higher value the better option). Equation (13) is used to normalize non-beneficial attributes (the lower value the better option). Equation (14) is used to normalize attributes where the closer to the desired value (x_j*) the better option.
Read More
Importance Of Digital Image Forensics
1319 Words | 3 Pages
...ting the disparity map was based on belief propagation and mean shift segmentation [19]. The disparity map and the reference image (JI_L) are segmented into some objects. The objects and the average disparity of these objects are denoted by O_(JI_L)^i & d_(JI_L)^i, respectively, i = 1,2,…,m. If d_(JI_L)^i is in [D_b,D_f ], O_(JI_L)^i is regarded as the main content, O_(JI_L)^i∈O_maipart.If d_(JI_L)^iis not in [D_b,D_f ], O_(JI_L)^i is regarded as the background, O_(JI_L)^i∈O_background. That is,
Read More
Computer Vision Essay
685 Words | 2 Pages
In this report, the concepts of Speeded Up Robust Features algorithm, hierarchical K-means clustering, Term Frequency- Inverse Document Frequency weights and Random Sample Consensus are reviewed first and then the algorithm implemented in this project is discussed. In the experimental results, the accuracy of the algorithm is shown for images with noisy background, different scale size and inclined images. Last section of this report, concludes the proposed approach and refers to the future extensions of this project.
Read More
The Importance Of Image Matching
1159 Words | 3 Pages
By searching correct feature point and setting bidirectional threshold value,the matching process can be quickly and precisely implemented with optimistic result. The resemblance of two images is defined as the overall similarity between two families of image features[1]. Same proportion image matching algorithm using bi-directional threshold image matching technique is used. Small window of pixels in a reference image (template) is compared with equally sized windows of pixels in other (target) images. In FBM, instead of matching all pixels in an image, only selected points with certain features are to be matched. Area based matching provide low speed. feature based matching algorithm is faster in comparison to the area based matching technique. feature based matching time complexity depend on number of feature to be selected as well as right or wrong threshold. If the number of feature are high then sometimes it takes more computational time in comparison to area based feature. The number of features extracted from an image depends largely on the contents of an image. If there are high variations then features computed are high. This reduces time efficiency to
Read More

Open Document

Wait a second!

Deskewing Using Binary and Grayscale Images

Report this document

Please chosse a reason

You'll be redirected

Why is Hubel and Wiesel's Description of the Classical Receptive Field Inadequate for an Understanding of Visual Perception?

Examples Of Direct Characterization In The Crucible

Ted Chiang's Story Of Your Life

Autism On Family Paper

Essay On Segmentation

Characterization in The Sailor Who Fell From Grace with the Sea and Wonderful Fool

Telecommuting In The Workplace Case Study

Witness for the Prosecution: A Character Analysis of Mr. Mayherne

The Book Thief: Movie Analysis

Analysis: A Computational Approach To Edge Detection

Iterative Least Square For GPS Navigation

Grey Relationsical Analysis: The Process Of Grey Relational Analysis

Importance Of Digital Image Forensics

Computer Vision Essay

The Importance Of Image Matching

More about Deskewing Using Binary and Grayscale Images