Please use this identifier to cite or link to this item:
https://repository.iimb.ac.in/handle/2074/19922
Title: | Conversion of handwritten forms to excel sheet | Authors: | Nikitha, Desari Saha, Anusmita |
Keywords: | Data processing;Digital conversion;Character recognition;Digitization;Optical character recognition;OCR | Issue Date: | 2019 | Publisher: | Indian Institute of Management Bangalore | Series/Report no.: | PGP_CCS_P19_050 | Abstract: | In this project initially, we have done some literature survey and have done extensive secondary research to get an overall idea from scratch. In this process, we found out different ways to process the data. Such as OCR can be done by using classifier or by using neural networks. In classifier, there are a different kind of classifiers such as supervised: K nearest neighbor, minimum distance to mean, SVM, bays classifier, unsupervised: k means clustering. We studied neural networks also for optical character recognition as it is suitable for big data set and image recognition. We got in-depth knowledge about classifiers. On the other hand, we also had researched neural networks. There are different inbuilt software in different packages, which can also be used for optical character recognition. We studied thoroughly about all the tools and software which can be used for optical character recognition, which helped us to make the project success. For optical character recognition, we need to train a model which can identify the letters with high accuracy. Hence we collected the handwritten data. We collected these data from different people with different handwriting using a pen and paper. Only regular handwritten pages were not sufficient as people have cursive handwriting also, and it is difficult to identify those letters and word. Hence, we collected cursive handwriting data to made the model better. On the other hand, rather than collecting the data, lakhs of data can be generated by augmentation. Where you give a letter input and with distortion, it changes the orientation and curves shape of the letter and in this way numbers of data can be generated. For neural network, usually massive amount of data is required for processing | URI: | https://repository.iimb.ac.in/handle/2074/19922 |
Appears in Collections: | 2019 |
Files in This Item:
File | Size | Format | |
---|---|---|---|
PGP_CCS_P19_050.pdf | 5.52 MB | Adobe PDF | View/Open Request a copy |
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.