Using data mining for digital ink recognition: Dividing text and shapes in sketched diagrams

Rachel Blagojevic, Beryl Plimmer, John Grundy, Yong Wang

Research output: Contribution to journalArticleResearchpeer-review

17 Citations (Scopus)


The low accuracy rates of textshape dividers for digital ink diagrams are hindering their use in real world applications. While recognition of handwriting is well advanced and there have been many recognition approaches proposed for hand drawn sketches, there has been less attention on the division of text and drawing ink. Feature based recognition is a common approach for textshape division. However, the choice of features and algorithms are critical to the success of the recognition. We propose the use of data mining techniques to build more accurate textshape dividers. A comparative study is used to systematically identify the algorithms best suited for the specific problem. We have generated dividers using data mining with diagrams from three domains and a comprehensive ink feature library. The extensive evaluation on diagrams from six different domains has shown that our resulting dividers, using LADTree and LogitBoost, are significantly more accurate than three existing dividers.

Original languageEnglish
Pages (from-to)976-991
Number of pages16
JournalComputers and Graphics
Issue number5
Publication statusPublished - Oct 2011
Externally publishedYes


  • Pen-based interfaces
  • Recognition algorithms
  • Sketch recognition
  • Sketch tools

Cite this