我正在写一个文本识别程序,我在排序等值线时遇到了问题。程序对于一行文本工作得很好,但是当涉及到整个文本块时,我的程序在80%的时间内不能检测到文本行。什么是真正有效的方法来提取一行文本,然后提取所有其他行(一次提取一行)?
我想要实现的目标:
发布于 2018-06-10 03:47:59
要实现这一点,需要执行一系列步骤:
上绘制边界框
更新
具体实现如下:
x = 'C:/Users/Desktop/text.jpg'
img = cv2.imread(x)
gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)
#--- performing Otsu threshold ---
ret,thresh1 = cv2.threshold(gray, 0, 255,cv2.THRESH_OTSU|cv2.THRESH_BINARY_INV)
cv2.imshow('thresh1', thresh1)
#--- choosing the right kernel
#--- kernel size of 3 rows (to join dots above letters 'i' and 'j')
#--- and 10 columns to join neighboring letters in words and neighboring words
rect_kernel = cv2.getStructuringElement(cv2.MORPH_RECT, (15, 3))
dilation = cv2.dilate(thresh1, rect_kernel, iterations = 1)
cv2.imshow('dilation', dilation)
#---Finding contours ---
_, contours, hierarchy = cv2.findContours(dilation, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_NONE)
im2 = img.copy()
for cnt in contours:
x, y, w, h = cv2.boundingRect(cnt)
cv2.rectangle(im2, (x, y), (x + w, y + h), (0, 255, 0), 2)
cv2.imshow('final', im2)
https://stackoverflow.com/questions/50777688
复制相似问题