Difference between revisions of "Tristan-cs488-tasks"

From Earlham CS Department
Jump to navigation Jump to search
(surprising discovery)
Line 3: Line 3:
  
 
* '''File structuring'''
 
* '''File structuring'''
**  
+
** just doing it, will restructure later.
 
*<span style="text-decoration: line-through;">Image importing
 
*<span style="text-decoration: line-through;">Image importing
 
**Read in jpg image</span>
 
**Read in jpg image</span>
Line 10: Line 10:
 
** Algorithm for figuring out correct threshold value. Take a mean of the page color and adjust for that value. Usually 110±10?
 
** Algorithm for figuring out correct threshold value. Take a mean of the page color and adjust for that value. Usually 110±10?
 
** Draw box around where dark is detected, extract that selection.  
 
** Draw box around where dark is detected, extract that selection.  
 +
 
* '''Line segmentation'''
 
* '''Line segmentation'''
 
** Decided it was not necessary to find beginning and end of line
 
** Decided it was not necessary to find beginning and end of line
Line 17: Line 18:
 
*** create selection between line1 and line2
 
*** create selection between line1 and line2
 
*** push to dynamic array as line[0]
 
*** push to dynamic array as line[0]
 +
 
* '''Word segmentation'''
 
* '''Word segmentation'''
 
** Outline algorithm similar to line segmentation
 
** Outline algorithm similar to line segmentation
 +
 
* '''Character segmentation'''
 
* '''Character segmentation'''
 
**openCV historgram test
 
**openCV historgram test
 +
 +
11-26-12: I tried an openCV example that makes and highlights bounding box and bounding polygon regions, it gave me this [[Media: Surprising.png | image]]. Suddenly I'm on the recognition step??
 +
 
* '''Character recognition'''
 
* '''Character recognition'''
 
** Figure out a basic neural network
 
** Figure out a basic neural network
 
*** look atOpenCV neural network type
 
*** look atOpenCV neural network type
 
*** I've found some NN tutorials, the implementation of one is what's bothering me.
 
*** I've found some NN tutorials, the implementation of one is what's bothering me.

Revision as of 23:13, 26 November 2012

Set of discrete tasks for Schrift (the name of the proposed handwriting recognizer).
These notes and more are are kept on paper.

  • File structuring
    • just doing it, will restructure later.
  • Image importing
    • Read in jpg image
  • Text Scan
    • OpenCV image import, test threshold filter
    • Algorithm for figuring out correct threshold value. Take a mean of the page color and adjust for that value. Usually 110±10?
    • Draw box around where dark is detected, extract that selection.
  • Line segmentation
    • Decided it was not necessary to find beginning and end of line
    • General Algorithm
      • Scan from top line by line
      • Stop when line of nothing
      • create selection between line1 and line2
      • push to dynamic array as line[0]
  • Word segmentation
    • Outline algorithm similar to line segmentation
  • Character segmentation
    • openCV historgram test

11-26-12: I tried an openCV example that makes and highlights bounding box and bounding polygon regions, it gave me this image. Suddenly I'm on the recognition step??

  • Character recognition
    • Figure out a basic neural network
      • look atOpenCV neural network type
      • I've found some NN tutorials, the implementation of one is what's bothering me.