Project Bhashitha - mobile based optical character recognition and text-to-speech system

D. S.S. De Zoysa, J. M. Sampath, E. M.P. De Seram, D. M.I.D. Dissanayake, L. Wijerathna, S. Thelijjagoda

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearch

3 Citations (Scopus)

Abstract

In the modern era when computers play a vital role in people's day today activities, visually impaired people face numerous problems when accessing printed text using existing technologies. This will rise to the need for the improvement of devices that could bring relief to this tasks that the blind people have to go beginning to end. Due to digitization of books there are many excellent attempts at building a vigorous document analysis system in industries and research labs, but this is only for those who are able to visible aided. 'Bhashitha' is an android based mobile application contains OCR and TTS for Sinhala, Tamil and English languages as single product by resolving problems in existing systems. In order to make the proposed system, user needs to acquire printed document as optical image using a camera of the mobile phone. The image skew will reduce the OCR accuracy drastically due to the angle view of the document. Therefore after doing the image skew detection optical image is passing to the OCR engine to convert the image to character streams representing letters of recognized words. Finally, the converted text output is access by TTS system to convert the textual content into a voice output. Additionally, it consists audio assist system to navigate through the pages in the diligence for differently abled users. This is easier, portable and faster solution comparing to the existing systems which are made for visually impaired.

Original languageEnglish
Title of host publicationThe 13th International Conference on Computer Science & Education, ICCSE 2018
EditorsWang Qing, Zhou Wei
Place of PublicationPiscataway NJ USA
PublisherIEEE, Institute of Electrical and Electronics Engineers
Pages623-628
Number of pages6
ISBN (Electronic)9781538654958, 9781538654941
ISBN (Print)9781538654941
DOIs
Publication statusPublished - 2018
Externally publishedYes
EventInternational Conference on Computer Science and Education 2018 - Colombo, Sri Lanka
Duration: 8 Aug 201811 Aug 2018
Conference number: 13th
https://ieeexplore.ieee.org/xpl/conhome/8456809/proceeding (Proceedings)

Publication series

Name13th International Conference on Computer Science and Education, ICCSE 2018
PublisherIEEE, Institute of Electrical and Electronics Engineers
ISSN (Electronic)2473-9464

Conference

ConferenceInternational Conference on Computer Science and Education 2018
Abbreviated titleICCSE 2018
Country/TerritorySri Lanka
CityColombo
Period8/08/1811/08/18
Internet address

Keywords

  • Image Processing
  • Image Skew Detection and Correction
  • Optical Character Recognition (OCR)
  • Sinhala
  • Text-to-Speech (TTS)
  • Visually Disabled Users

Cite this