I'm Working On Optical Character Recognition.In our work we need automatically generate some rendered word image and we need each characters location(boundary) in rendered word image. this meta data about rendered image is called ground truth. How can I do that?

有帮助吗?

解决方案

I found a rendering c api called Pango wich has a function named pango_layout_Iter_get_char_extent() that can be used for that.

https://developer.gnome.org/pango/stable/pango-Layout-Objects.html#pango-layout-iter-get-char-extents

许可以下: CC-BY-SA归因
不隶属于 StackOverflow
scroll top