Difference between revisions of "Tristan-cs488-tasks"
Jump to navigation
Jump to search
(surprising discovery) |
|||
Line 1: | Line 1: | ||
− | Set of discrete tasks for '''''Schrift''''' (the name of the proposed handwriting recognizer). | + | Set of discrete tasks for '''''Schrift''''' (the name of the proposed handwriting recognizer). These notes and more are are kept on paper. Strikethrough tasks are done or no longer necessary, hopefully dated entries will make it clear which. This page might get a little messy. |
− | These notes and more are are kept on paper. | ||
* '''File structuring''' | * '''File structuring''' | ||
Line 9: | Line 8: | ||
**<span style="text-decoration: line-through;">OpenCV image import, test threshold filter</span> | **<span style="text-decoration: line-through;">OpenCV image import, test threshold filter</span> | ||
** Algorithm for figuring out correct threshold value. Take a mean of the page color and adjust for that value. Usually 110±10? | ** Algorithm for figuring out correct threshold value. Take a mean of the page color and adjust for that value. Usually 110±10? | ||
− | ** Draw box around where dark is detected, extract that selection. | + | ** <span style="text-decoration: line-through;">Draw box around where dark is detected, extract that selection. </span> |
− | * '''Line segmentation''' | + | |
+ | *<span style="text-decoration: line-through;"> '''Line segmentation''' | ||
** Decided it was not necessary to find beginning and end of line | ** Decided it was not necessary to find beginning and end of line | ||
** General Algorithm | ** General Algorithm | ||
Line 17: | Line 17: | ||
*** Stop when line of nothing | *** Stop when line of nothing | ||
*** create selection between line1 and line2 | *** create selection between line1 and line2 | ||
− | *** push to dynamic array as line[0] | + | *** push to dynamic array as line[0]</span> |
− | + | * <span style="text-decoration: line-through;">'''Word segmentation''' | |
− | * '''Word segmentation''' | + | ** Outline algorithm similar to line segmentation</span> |
− | ** Outline algorithm similar to line segmentation | + | * <span style="text-decoration: line-through;">'''Character segmentation''' |
− | + | **openCV historgram test </span> | |
− | * '''Character segmentation''' | ||
− | **openCV historgram test | ||
− | 11-26-12: I tried an openCV example that makes and highlights bounding box and bounding polygon regions, it gave me this [[Media: Surprising.png | image]]. Suddenly I'm on the recognition step?? | + | *'''11-26-12''': I tried an openCV example that makes and highlights bounding box and bounding polygon regions, it gave me this [[Media: Surprising.png | image]]. Suddenly I'm on the recognition step?? |
+ | *'''11-27-12''': The vector that the contours are in are stored in an unusual order, if I sort them by the rectangles bottom left coordinate, a count of lines could be established and I could get the letters into their correct order. Once in order I can detect spaces easily by looking at changes between characters in the x direction. line and word ''segmentation'' are basically kaput, instead I have line and word ''detection''. | ||
* '''Character recognition''' | * '''Character recognition''' | ||
** Figure out a basic neural network | ** Figure out a basic neural network | ||
*** look atOpenCV neural network type | *** look atOpenCV neural network type | ||
− | *** I've found some NN tutorials, the implementation of one is what's bothering me. | + | *** <span style="text-decoration: line-through;">I've found some NN tutorials, the implementation of one is what's bothering me.</span> |
Revision as of 00:17, 28 November 2012
Set of discrete tasks for Schrift (the name of the proposed handwriting recognizer). These notes and more are are kept on paper. Strikethrough tasks are done or no longer necessary, hopefully dated entries will make it clear which. This page might get a little messy.
- File structuring
- just doing it, will restructure later.
- Image importing
- Read in jpg image
- Text Scan
- OpenCV image import, test threshold filter
- Algorithm for figuring out correct threshold value. Take a mean of the page color and adjust for that value. Usually 110±10?
- Draw box around where dark is detected, extract that selection.
- Line segmentation
- Decided it was not necessary to find beginning and end of line
- General Algorithm
- Scan from top line by line
- Stop when line of nothing
- create selection between line1 and line2
- push to dynamic array as line[0]
- Word segmentation
- Outline algorithm similar to line segmentation
- Character segmentation
- openCV historgram test
- 11-26-12: I tried an openCV example that makes and highlights bounding box and bounding polygon regions, it gave me this image. Suddenly I'm on the recognition step??
- 11-27-12: The vector that the contours are in are stored in an unusual order, if I sort them by the rectangles bottom left coordinate, a count of lines could be established and I could get the letters into their correct order. Once in order I can detect spaces easily by looking at changes between characters in the x direction. line and word segmentation are basically kaput, instead I have line and word detection.
- Character recognition
- Figure out a basic neural network
- look atOpenCV neural network type
- I've found some NN tutorials, the implementation of one is what's bothering me.
- Figure out a basic neural network