Sem Soni

CHAPTER 1
INTRODUCTION
In the running world, there is growing demand for the software systems to recognize characters
in computer system when information is scanned through paper documents as we know that we
have number of newspapers and books which are in printed format related to different subjects.
These days there is a huge demand in “storing the information available in these paper
documents in to a computer storage disk and then later reusing this information by searching
process”. One simple way to store information in these paper documents in to computer system
is to first scan the documents and then store them as IMAGES. But to reuse this information it is
very difficult to read the individual contents and searching the contents form these documents
line-by-line and word-by-word. The reason for this difficulty is the font characteristics of the
characters in paper documents are different to font of the characters in computer system. As a
result, computer is unable to recognize the characters while reading them. This concept of storing
the contents of paper documents in computer storage place and then reading and searching the
content is called DOCUMENT PROCESSING. Sometimes in this document processing we need
to process the information that is related to languages other than the English in the world.For this
document processing we need a software system called CHARACTER RECOGNITION
SYSTEM. This process is also called DOCUMENT IMAGE ANALYSIS (DIA). Thus our need
is to develop character recognition software system to perform Document Image Analysis which
transforms documents in paper format to electronic format. For this process there are various
techniques in the world. Among all those techniques we have chosen Optical Character
Recognition as main fundamental technique to recognize characters. The conversion of paper
documents in to electronic format is an on-going task in many of the organizations particularly
in Research and Development (R&D) area, in large business enterprises, in government
institutions, so on. From our problem statement we can introduce the necessity of Optical
Character Recognition in mobile electronic devices such as cell phones, digital cameras to
acquire images and recognize them as a part of face recognition and validation. To effectively
use Optical Character Recognition for character recognition in-order to perform Document
Image Analysis (DIA), we are using the information in Grid format. This system is thus effective
and useful in Virtual Digital Library’s design and construction
1.1 PURPOSE
The main purpose of Optical Character Recognition (OCR) system based on a grid infrastructure
is to perform Document Image Analysis, document processing of electronic document formats
converted from paper formats more effectively and efficiently. This improves the accuracy of
recognizing the characters during document processing compared to various existing available
character recognition methods. Here OCR technique derives the meaning of the characters, their
font properties from their bit-mapped images.
 The primary objective is to speed up the process of character recognition in document

processing. As a result the system can process huge number of documents with-in less
time and hence saves the time.
 Since our character recognition is based on a grid infrastructure, it aims to recognize

multiple heterogeneous characters that belong to different universal languages with
different font properties and alignments.
1.2 PROJECT SCOPE
The scope of our product Optical Character Recognition on a grid infrastructure is to provide an
efficient and enhanced software tool for the users to perform Document Image Analysis,
document processing by reading and recognizing the characters in research, academic,
governmental and business organizations that are having large pool of documented, scanned
images. Irrespective of the size of documents and the type of characters in documents, the
product is recognizing them, searching them and processing them faster according to the needs
of the environment.
1.3 EXISTING SYSTEM
In the running world there is a growing demand for the users to convert the printed documents in
to electronic documents for maintaining the security of their data. Hence the basic OCR system
was invented to convert the data available on papers in to computer process able documents, So
that the documents can be editable and reusable. The existing system/the previous system of
OCR on a grid infrastructure is just OCR without grid functionality. That is the existing system
deals with the homogeneous character recognition or character recognition of single languages.
1.4 DRAWBACK OF EXISTING SYSTEM
The drawback in the early OCR systems is that they only have the capability to convert and
recognize only the documents of English or a specific language only. That is, the older OCR
system is uni-lingual.
1.5 PROPOSED SYSTEM
Our proposed system is OCR on a grid infrastructure which is a character recognition system that
supports recognition of the characters of multiple languages. This feature is what we call grid
infrastructure which eliminates the problem of heterogeneous character recognition and supports
multiple functionalities to be performed on the document. The multiple functionalities include
editing and searching too where as the existing system supports only editing of the document. In
this context, Grid infrastructure means the infrastructure that supports group of specific set of
languages. Thus OCR on a grid infrastructure is multi-lingual.
16 BENEFIT OF PROPOSED SYSTEM
The benefit of proposed system that overcomes the drawback of the existing system is that it
supports multiple functionalities such as editing and searching. It also adds benefit by providing
heterogeneous characters recognition.
1.7 ARCHITECTURE OF THE PROPOSED SYSTEM
The Architecture of the optical character recognition system on a grid infrastructure consists of
the three main components. They are:-
 Scanner
 OCR Hardware or Software
 Output Interface
Figure.1: OCR Architecture

1.8 INTENDED AUDIENCE AND READING SUGGESTIONS
In this section, we identify the audience who are interested with the product and are involved in
the implementation of the product either directly or indirectly. As from our research, the OCR
system is mainly useful in R&D at various scientific organizations, in governmental institutes
and in large business organizations, we identify the following as various interested audience in
implementing OCR system:-
 The scientists, the research scholars and the research fellows in telecommunication
institutions are interested in using OCR system for processing the word document that
contains base paper for their research
 The Librarian to manage the information contents of the older books in building virtual
digital library requires use of OCR system.
 Various sites that vendor e-books have a huge requirement of this OCR system in- order
to scan all the books in to electronic format and thus make money. The Amazon book
world is largely using this concept to build their digital libraries.
Now we present the reading suggestions for the users or clients through which the user can better
understand the various phases of the product. These suggestions may be effective and useful for
the beginners of the product rather than the regular users such as research scholars, librarians and
administrators of various web-sites. With these suggestions, the user need not waste his time in
scrolling the documents up and down, browsing through the web, visiting libraries in search of
different books.
The following are the various reading suggestions that the user can follow in-order to completely
understand about our product and to save time:-
 It would help you if you start with Wikipedia.com. It lets you know the basic concept of
every keyword you require. First learn from it what is OCR? And how does it work based
on a Grid infrastructure?
 Now you can proceed your further reading with the introduction of our product we
provided in our documentation. From these two steps you completely get an in- depth
idea of the use of our product and several processes involved in it.
 The more you need is the implementation of the product. For this you can visit
FreeOCR.com where you can view how the sample OCR works and you can try it.
CHAPTER 2
FEASIBILITY STUDY
A feasibility study is a high-level capsule version of the entire System analysis and Design
Process. The study begins by classifying the problem definition. Feasibility is to determine if it’s
worth doing. Once an acceptance problem definition has been generated, the analyst develops a
logical model of the system. A search for alternatives is analyzed carefully. There are 3 parts in
feasibility study.
2.1 TECHNICAL FEASIBILITY
Evaluating the technical feasibility is the trickiest part of a feasibility study. This is because, at
this point in time, not too many detailed design of the system, making it difficult to access issues
like performance, costs on (on account of the kind of technology to be deployed) etc. A number
of issues have to be considered while doing a technical analysis. Understand the different
technologies involved in the proposed system before commencing the project we have to be very
clear about what are the technologies that are to be required for the development of the new
system. Find out whether the organization currently possesses the required technologies. Is the
required technology available with the organization?.
2.2 OPERATIONAL FEASIBILITY
Proposed project is beneficial only if it can be turned into information systems that will meet the
organizations operating requirements. Simply stated, this test of feasibility asks if the system will
work when it is developed and installed. Are there major barriers to Implementation? Here are
questions that will help test the operational feasibility of a project:
 Is there sufficient support for the project from management from users? If the current
system is well liked and used to the extent that persons will not be able to see reasons for
change, there may be resistance.
 Are the current business methods acceptable to the user? If they are not, Users may
welcome a change that will bring about a more operational and useful systems.
 Have the user been involved in the planning and development of the project?
 Early involvement reduces the chances of resistance to the system and in general and
increases the likelihood of successful project.
Since the proposed system was to help reduce the hardships encountered. In the existing manual
system, the new system was considered to be operational feasible.
2.3 ECONOMIC FEASIBILITY
Economic feasibility attempts to weigh the costs of developing and implementing a new system,
against the benefits that would accrue from having the new system in place. This feasibility study
gives the top management the economic justification for the new system. A simple economic
analysis which gives the actual comparison of costs and benefits are much more meaningful in
this case. In addition, this proves to be a useful point of reference to compare actual costs as the
project progresses. There could be various types of intangible benefits on account of automation.
These could include increased customer satisfaction, improvement in product quality better
decision making timeliness of information, expediting activities, improved accuracy of
operations, better documentation and record keeping, faster retrieval of information, better
employee morale.
2.4 TRAINING
Training is a very important process of working with a neural network. As seen from neural
networks, there are two forms of training that can be employed with a neural network. They are
namely Supervised training provides the neural network with training sets and the anticipated
output. Unsupervised training supplies the neural network with training sets, but there is no
anticipated output provided.
2.5 INTRODUCING KOHONEN NEURAL NETWORK
The Kohonen neural network differs considerably from the feed-forward back propagation
neural network. The Kohonen neural network differs both in how it is trained and how it recalls a
pattern. The Kohonen neural network does not use any sort of activation function. Further, the
Kohonen neural network does not use any sort of a bias weight. Output from the Kohonen neural
network does not consist of the output of several neurons. When a pattern is presented to a
Kohonen network one of the output neurons is selected as a "winner". This "winning" neuron is
the output from the Kohonen network. Often these "winning" neurons represent groups in the
data that is presented to the Kohonen network. For example, in an OCR program that uses 26
output neurons, the 26 output neurons map the input patterns into the 26 letters of the Latin
alphabet.
The most significant difference between the Kohonen neural network and the feed
forward back propagation neural network is that the Kohonen network trained in an unsupervised
mode. This means that the Kohonen network is presented with data, but the correct output that
corresponds to that data is not specified. Using the Kohonen network this data can be classified
into groups. We will begin our review of the Kohonen network by examining the training
process. It is also important to understand the limitations of the Kohonen neural network. Neural
networks with only two layers can only be applied to linearly separable problems. This is the
case with the Kohonen neural network. Kohonen neural networks are used because they are a
relatively simple network to construct that can be trained very rapidly.
A "feed forward" neural network is similar to the types of neural networks that we are ready
examined. Just like many other neural network types the feed forward neural network begins
with an input layer. This input layer must be connected to a hidden layer. This hidden layer can
then be connected to another hidden layer or directly to the output layer. There can be any
number of hidden layers so long as at least one hidden layer is provided. In common use most
neural networks will have only one hidden layer. It is very rare for a neural network to have
more than two hidden layers. We will now examine, in detail, and the structure of a "feed
forward neural network".
The Structure of a Feed Forward Neural Network
A "feed forward" neural network differs from the neural networks previously examined.
Figure 2.1 shows a typical feed forward neural network with a single hidden layer
The Input Layer
The input layer to the neural network is the conduct through which the external environment
presents a pattern to the neural network. Once a pattern is presented to the input layer of the
neural network the output layer will produce another pattern. In essence this is all the neural
network does. The input layer should represent the condition for which we are training the neural
network for. Every input neuron should represent some independent variable that has an
influence over the output of the neural network. It is important to remember that the inputs to the
neural network are floating point numbers. These values are expressed as the primitive Java data
type "double". This is not to say that you can only process numeric data with the neural network.
If you wish to process a form of data that is non-numeric you must develop a process that
normalizes this data to a numeric representation.
The Output Layer
The output layer of the neural network is what actually presents a pattern to the external
environment. Whatever patter is presented by the output layer can be directly traced back to the
input layer. The number of a output neurons should directly related to the type of work that the
neural network is to perform. To consider the number of neurons to use in your output layer you
must consider the intended use of the neural network. If the neural network is to be used to
classify items into groups, then it is often preferable to have one output neurons for each groups
that the item is to be assigned into. If the neural network is to perform noise reduction on a signal
then it is likely that the number of input neurons will match the number of output neurons. In this
sort of neural network you would one day he would want the patterns to leave the neural network
in the same format as they entered. For a specific example of how to choose the numbers of input
and output neurons consider a program that is used for optical character recognition, or OCR. To
determine the number of neurons used for the OCR example we will first consider the input
layer. The number of input neurons that we will use is the number of pixels that might represent
any given character. Characters processed by this program are normalized to universal size that
is represented by a 5x7 grid. A 5x7 grid contains a total of 35 pixels. The optical character
recognition program therefore has 35 input neurons. The number of output neurons used by the
OCR program will vary depending on how many characters the program has been trained for.
The default training file that is provided with the optical character recognition program is trained
to recognize 26 characters. As a result using this file the neural network would have 26 output
neurons.
CHAPTER 3
PROBLEM STATEMENT
The problem here is for the software systems to recognize characters in computer system when
information is scanned through paper documents as we know that we have number of
newspapers and books which are in printed format related to different subjects. Whenever we
scan the documents through the scanner, the documents are stored as images such as jpeg, gif
etc., in the computer system. These images cannot be read or edited by the user. But to reuse this
information it is very difficult to read the individual contents and searching the contents form
these documents line-by-line and word-by-word. These days there is a huge demand in “storing
the information available in these paper documents in to a computer storage disk and then later
editing or reusing this information by searching process”.
3.1 MODULES AND THEIR FUNCTIONALITIES
Our software system Optical Character Recognition on a grid infrastructure can be divided into
five modules based on its functionality.The modules classified are as follows:-
 Document Processing Module
 System Training Module.
 Document Recognition Module.
 Document Editing Module and
 Document Searching Module.
3.1.1 DOCUMENT PROCESSING MODULE
This module is accessed by administrator whose role in our application is a librarian.This module
perform certain activities such as scanning documents, storing them as images, recognizing
characters in images to transfer them into word format. During the recognition process, this
module uses the OCR methodology in support of grid infrastructure datastructure. The module
supports the following services:-
 Scanning printed documents.
 Storing the documents as snapshots or images.
 Processing those image-based documents.

 Converting these image-based documents into e-documents(also called structured
documents).
 Recognizing the characters in documents.
 Generating grid infrastructure datastructure.
3.1.2 SYSTEM TRAINING MODULE
This module can be accessed by both the administrator and the end-user. Before converting the
printed documents in to editable and searchable documents, the first and the mandatory step is
providing training to the system. Here training in the sense the font followed in the scanned
document should be identified by the user. Then the user types all the characters that are required
for recognition from the scanned document as an image file. This image file should be provided
as an input during the training process. The user then clicks the train button provided in the
recognition module. Then the training gets completed. Thus the system gets familiar with the
new font. This module supports:-
 Training the system with the pre-defined fonts.
 Training the system with the new fonts that are not present in the system and that cannot
be identified by the system.
3.1.3 DOCUMENT RECOGNITION MODULE
This module can be accessed by both the administrator and the end-user. Once the printed
documents are converted into structured documents, any user can recognize the characters
present in the document. That means the user can recognize the characters of any language he
chooses which makes OCR more flexible. This flexibility is due to the adaptation of grid
infrastructure. This is the module where the main functionality of OCR is tested. Under this
module, there are two types of recognition. They are handwritten recognition and scanned
document recognition. In handwritten recognition, the handwriting of the user in any language is
trained to the system only for the first time. From there on-wards, the system recognizes the
characters or words written by the user. Thus handwritten document recognition recognizes the
human handwriting. In scanned document recognition, the system is first trained with the font
characters in the document in the training module itself. Now in the recognition module, the
system takes the scanned documents image as an input file, first crops the image and then
extracts/recognizes the characters from the document and makes these documents editable and
searchable. Thus the scanned document recognition recognizes the chracters from the scanned
document image and makes the document editable and searchable.
Hence the document recogniiton module on a whole supports the following services:-
 Converts the document into specific format
 Recognizes the characters
 Heterogeneous character Recognition
3.1.4 DOCUMENT EDITING MODULE
This module can be accessed by both the administrator and the end-user during document editing
to implement the character recogniiton process. Once the scanned documents are stored, they
reside in computer memory. This data resides in the form of an image that is just viewable in an
image viewer. Hence, the document is first coverted into a form such that it is editable. The
desired form of the document may be MS-Word,Text,… as specified by the user.
The objective of this module is to let the user perform :-
 Addition of specific content to the documents
 Deletion of certain content from documents
 Any other modification of documents.
3.1.5 DOCUMENT SEARCHING MODULE
This module can be accessed by both the administrator and the end-user during the search of the
user required document to implement the character recogniiton process on it. The user requests
the system to search for a particular document. Then the system finds the documents based on
OCR methodology and returns the result.
CHAPTER 4
DATA FLOW DIAGRAM
The DFD is also called as bubble chart. A data-flow diagram (DFD) is a graphical representation
of the "flow" of data through an information system. DFD’s can also be used for the
visualization of data processing. The flow of data in our system can be described in the form of
dataflow diagram as follows:-
Firstly, if the user is administrator he can initialize the following actions:-
 Document processing
 Document search
 Document editing.
All the above actions come under 2cases.They are described as follows:-
 If the printed document is a new document that is not yet read into the system, then the
document processing phase reads the scanned document as an image only and then
produces the document image stored in computer memory as a result.
 Now the document processing phase has the document at its hand and can read the
document at any point of time. Later the document processing phase proceeds with
recognizing the document using OCR methodology and the grid infrastructures. Thus it
produces the documents with the recognized characters as final output which can be later
searched and edited by the end-user or administrator.
 If the printed document is already scanned in and is held in system memory, then the
document processing phase proceeds with document recognition using OCR
methodology and grid infrastructure. And thus it finally produces the document with
recognized documents as output.
If the user using the OCR system is the end-user, then he can perform the following actions:-
1. Document Searching:- The documents which are recognized can be searched by the user
whenever required by requesting from the system database.
2. Document Editing:- The recognized documents can be edited by adding the specific
content to the document, deleting specific content from the document and modifying the
document.
4.1 UML DIAGRAMS
UML combines best techniques from data modeling (entity relationship diagrams), business
modeling (work flows), object modeling, and component modeling. It can be used with all
processes, throughout the software development life cycle, and across different implementation
technologies. UML has 14 types of diagrams divided into two categories. Seven diagram types
represent structural information, and the other seven represent general types of behavior,
including four that represent different aspects of interactions. Some of these diagrams we
provided to describe the design and implementation of our OCR system can be categorized
hierarchically as below:-
 Use case diagram
 Class diagram
 Sequence diagram
 Collaboration diagram
 Activity diagram
 Component diagram
 Deployment diagram
4.1.1 USE-CASE DIAGRAMS
Our software system can be used to support library environment to create a Digital Library
where several paper documents are converted into electronic-form for accessing by the users. For
this purpose the printed documents must be recognized before they are converted into electronic-
form. The resulting electronic-documents are accessed by the users like faculty and students for
reading and editing. Now according to this information, the following are the different actors
involved in implementing our OCR system:-
 If we consider for virtual digital library, the Administrator can be the Librarian and the
End-users can be Students or/and Faculty.
 The following are the list of use diagrams that altogether form the complete or the overall
use-case diagram. They are listed below:-
1) Use-case diagram for document processing
2) Use-case diagram for neural network training

3) Use-case diagram for document recognition
4) Use-case diagram for document editing
5) Use-case diagram for document searching
In each of the use-case diagrams below we clearly explained about that particular use- case
functionality. In this we provided a description about the
 Use-case name
 Details about the use-case
 Actors using this use-case
 The flow of events carried out by the use-case
 The conditions that occur in this use-case
4.2 CLASS DIAGRAMS
The class diagram is the main building block in object oriented modeling. The classes in a class
diagram represent both the main objects and or interactions in the application and the objects to
be programmed. The class diagram of our OCR system consists of 9classes. They are
 MainScreen
 Editor
 HelpFrame
 Document
 HEntry
 Entry
 TrainingSet
 KohonenNetwork
 PrintedFrame.
Among all these classes the MainScreen is the main class that represents all the major functions
carried out by our OCR system. The MainScreen class has an association with five classes viz.,
Editor, HelpFrame, Document, TrainingSet, PrintedFrame. And the TrainingSet class in-turn has
an association with the HEntry and the KohonenNetwork classes. The PrintedFrame has an
association with the Entry and KohonenNetwork classes.
Figure 4.2.1:Class Diagram

4.2.1 SEQUENCE DIAGRAMS
Sequence diagrams are sometimes called Event-trace diagrams, event scenarios, and timing
diagrams. A sequence diagram shows, as parallel vertical lines (lifelines), different processes or
objects that live simultaneously, and, as horizontal arrows, the messages exchanged between
them, in the order in which they occur. This allows the specification of simple runtime scenarios
in a graphical manner. In sequence diagram, the class objects that are used to describe the
interaction between various classes vary from one function to another function. There are five
sequence diagrams short-listed below for presenting the sequence of actions performed by each
of the five modules. The key class object involved in all of these module functions is
MainScreen class which controls the interaction among various class objects.
Sequence Diagram for Document Processing
 Objects
• Administrator-
• “a” MainScreen-
• “m” Document-
• “d” –
• SystemMemory –
• “s”-
 Links
• Administrator object to MainScreen object

• MainScreen object to Document object.
• Document object to SystemMemory object.
• SystemMemory object to Administrator object.
• Scans
• Stores documents
• Stores
• Returns the processed documents
a:Administraror m:MainSreen d:Document
5.Stores
Figure 4.2.2: Sequence Diagram for Processing
Sequence Diagram for System Training

 Objects
 Administrator –
 “a” System –
 “s” TrainingSet –
 “t”
 Links
 Administrator object to System object
 System object to TrainingSet object
 TrainingSet object to System object
 System object to Administrator object
 Messages
 Specifies the font characters
 Stores it as an image
 Trains the system with new font
 System recognizes new font and returns for user
Figure 4.2.3:Sequence Diagram for Training

Sequence Diagram for Document Recognition
 Objects
 Administrator –
 “a” MainScreen –
 “m” SystemMemory –
 “s” TrainingSet –
 “t”
 Links
 Administrator object to MainScreen object
 MainScreen object to SystemMemory object
 SystemMemory object to MainScreen object
 MainScreen object to TrainingSet object
 TrainingSet object to MainScreen object
 MainScreen object to Administrator object
 Messages
 Recognize document
 Store processed document
 Read file image
 Recognize using ocr
 Send processed document
 Recognize the characters
Figure 4.2.4:Sequence Diagram for Recognition
Sequence Diagram for Document Editing

1. Object
Administrator
“a” MainScreen –
“m” Document –
“d” SystemMemory –
“s”
2. Links
1. Administrator object to MainScreen object.
2. MainScreen object to Document object.
3. MainScreen object to Document object
5. Document object to SystemMemory object.
6. SystemMemory object to Administrator object.
3. Messages
1. Edit document
2. Adding document
3. Adds
4. Deleting document
5. Deletes
6. Modifying document
7. Modifies
8. Stores the edited documents
9. Administrator accesses the edited documents
s:SystemMemory
1.Edit document
2.Adding document
4.Deleting content
6.Modifing content
8.Stores the edited documents
9.Administrator accesses the edited documents
Figure 4.2.5:Sequence Diagram for Editing

Sequence Diagram for Document Searching
Objects
Administrator –
“a” MainScreen –
“m” Document –
“d”
Links
1. Administrator object to MainScreen object
3. Document object to Administrator object
Messages
1. Specifies the word
2. Searches the word
3. Searches
4. Returns the location of the word
4.2.5 ACTIVITY DIAGRAMS
The purpose of activity diagram is to provide a view of flows and what is going on inside a use
case or among several classes. Activity diagram can also be used to represent a class’s method
implementation. A token represents an operation. An activity is shown as a round box containing
the name of the operation. An outgoing solid arrow attached to the end of activity symbol
indicates a transition triggered by the completion.
In the diagram of our OCR system, the server role is played by admin called Librarian. There can
be N number of clients who can access the digital library data content at a time. The clients here
may be either the students or the faculty or the both.
Figure 4.2.5:Activity Diagram For Processing
[ Document does not exist ]
[ Document exists ]
user
Figure 4.2.6:Activity Diagram for document Retrieval

Edit
documents
Add document [ user choses add ] [ user choses delete ] Delete document
content content
[ user choses modify ]
Modify
document
Store
documents
Figure4.2.7:Activity Diagram for Document Storage

4.2.8 COMPONENT DIAGRAM
The crucial component in our component diagram that plays a major role in implementing the
OCR system is the GUI component. All other components that is Document processing and
recognition, Document editing and Document Searching depends on it. They are as follows:-
GUI Component that is used to design GUI screens for interacting with the end-user and
administrator.
From the GUI component other components functionalities are carried out. The functionalities
include Document processing and recognition, Document editing and Document Searching
Figure 4.2.8:Component Diagram

4.2.9 DEPLOYMENT DIAGRAM
A deployment diagram serves to model the physical deployment of artifacts on deployment

targets. Deployment diagrams show "the allocation of Artifacts to Nodes according to the
Deployments defined between them.”.
In the deployment diagram of our OCR system, the server role is played by admin called
Librarian. There can be N number of clients who can access the digital library data content at a
time. The clients here may be either the students or the faculty or the both.
• The actions performed by the Administrator are document processing, searching and
editing where as the actions performed by the end-user are only document searching and editing.
Figure4.2.9:Deployment Diagram
CHAPTER 5
PLATFORM/TOOLS USED
5.1 SOFTWARE REQUIREMENTS SPECIFICATION

 Operating System : Windows-XP
 Programming Language : Core Java
 User Interface : Swings
5.2 HARDWARE REQUIREMENTS SPECIFICATION
 Processor : Pentium IV processor or higher
 RAM : Minimum of 512 MB RAM
 Memory : 500 MB or higher
CHAPTER 6
CONCLUSION AND FUTURE SCOPE
What does the future hold for OCR? Given enough entrepreneurial designers and sufficient
research and development dollars, OCR can become a powerful tool for future data entry
applications. However, the limited availability of funds in a capital-short environment could
restrict the growth of this technology. But, given the proper impetus and encouragement, a lot of
benefits can be provided by the OCR system. They are
 The automated entry of data by OCR is one of the most attractive, labor reducing
technology
 The recognition of new font characters by the system is very easy and quick.
 We can edit the information of the documents more conveniently and we can reuse the
edited information as and when required.
 The extension to software other than editing and searching is topic for future work
 The Grid infrastructure used in the implementation of Optical Character Recognition
system can be efficiently used to speed up the translation of image based documents into
structured documents that are currently easy to discover, search and process.
FUTURE ENHANCEMENTS
 The Optical Character Recognition software can be enhanced in the future in different
kinds of ways such as:
 Many applications exist where it would be desirable to read handwritten entries. Reading
handwriting is a very difficult task considering the diversities that exist in ordinary
penmanship. However, progress is being made
REFERENCES
[1] On Demand, HPE Haven. "OCR Document".
[2] On Demand, HPE Haven. "undefined".
[3] Schantz, Herbert F. (1982). The history of OCR, optical character recognition.
[Manchester Center, Vt.]: Recognition Technologies Users Association.
[4] Albe, E. E. F. (July 1, 1914). "On a Type-Reading Optophone". Proceedings of the Royal
Society A Mathematical, Physical and Engineering Sciences.
[5] Extracting text from images using OCR on Android". June 27, 2015.
[6] OCR on Google Glass". October 23, 2014.
[7] The History of OCR". Data Processing Magazine. 12: 46. 1970.
[8] PrintToBraille Tool. "ocr-gui-frontend". MILE Lab, Dept of EE, IISc. Archived from the
original on December 25, 2014. Retrieved December 7, 2014.
[9] Using OCR and Entity Extraction for LinkedIn Company Lookup". July 22, 2014.
[10] How To Crack Captchas".. June 28, 2006. Retrieved June 16, 2013.
[11] Breaking a Visual CAPTCHA " December 10, 2002. Retrieved June 16,2013.
[12] John Resig (January 23, 2009). "John Resig – OCR and Neural Nets in JavaScript".
Ejohn.org. Retrieved June 16, 2013.
[13] The state of the art in online handwriting recognition". IEEE Transactions on Pattern
Analysis and Machine Intelligence. .
[14] "Optical Character Recognition (OCR) – How it works". Nicomsoft.com. Retrieved June
16, 2013.
[15] "Survey over image thresholding techniques and quantitative performance
evaluation" (PDF). Journal of Electronic Imaging

Sem Soni

Enviado por

Dados do documento

Título original

Direitos autorais

Formatos disponíveis

Compartilhar este documento

Compartilhar ou incorporar documento

Opções de compartilhamento

Você considera este documento útil?

Este conteúdo é inapropriado?

Direitos autorais:

Formatos disponíveis

Sem Soni

Enviado por

Direitos autorais:

Formatos disponíveis

CHAPTER 1

 The primary objective is to speed up the process of character recognition in document

 Since our character recognition is based on a grid infrastructure, it aims to recognize

1.2 PROJECT SCOPE

1.3 EXISTING SYSTEM

1.4 DRAWBACK OF EXISTING SYSTEM

1.5 PROPOSED SYSTEM

16 BENEFIT OF PROPOSED SYSTEM

1.7 ARCHITECTURE OF THE PROPOSED SYSTEM

 OCR Hardware or Software

Figure.1: OCR Architecture

2.1 TECHNICAL FEASIBILITY

2.2 OPERATIONAL FEASIBILITY

2.5 INTRODUCING KOHONEN NEURAL NETWORK

The Structure of a Feed Forward Neural Network

The Input Layer

The Output Layer

3.1 MODULES AND THEIR FUNCTIONALITIES

 Document Processing Module

 System Training Module.

 Document Recognition Module.

 Document Editing Module and

 Document Searching Module.

3.1.1 DOCUMENT PROCESSING MODULE

 Scanning printed documents.

 Storing the documents as snapshots or images.

 Processing those image-based documents.

 Recognizing the characters in documents.

 Generating grid infrastructure datastructure.

3.1.2 SYSTEM TRAINING MODULE

 Training the system with the pre-defined fonts.

3.1.3 DOCUMENT RECOGNITION MODULE

 Converts the document into specific format

 Recognizes the characters

 Heterogeneous character Recognition

3.1.4 DOCUMENT EDITING MODULE

The objective of this module is to let the user perform :-

 Addition of specific content to the documents

 Deletion of certain content from documents

 Any other modification of documents.

3.1.5 DOCUMENT SEARCHING MODULE

DATA FLOW DIAGRAM

 Use case diagram

4.1.1 USE-CASE DIAGRAMS

1) Use-case diagram for document processing

2) Use-case diagram for neural network training

4) Use-case diagram for document editing

5) Use-case diagram for document searching

 Details about the use-case

 Actors using this use-case

 The flow of events carried out by the use-case

 The conditions that occur in this use-case

4.2 CLASS DIAGRAMS

Figure 4.2.1:Class Diagram

Sequence Diagram for Document Processing

• Administrator object to MainScreen object

Figure 4.2.2: Sequence Diagram for Processing

Sequence Diagram for System Training

Figure 4.2.3:Sequence Diagram for Training

Sequence Diagram for Document Editing

8.Stores the edited documents

9.Administrator accesses the edited documents

Figure 4.2.5:Sequence Diagram for Editing