TensorFlow와 OpenCV를 사용하여 웹캠에 비춘 손글씨 숫자 인식하기

TensorFlow와 OpenCV를 사용하여 웹캠에 비춘 손글씨 숫자 인식하기Deep Learning & Machine Learning/강좌&예제 코드2019. 10. 1. 22:58@webnautes

Table of Contents

Tensorflow와 OpenCV를 사용하여 웹캠에 비춘 손글씨 숫자를 인식시켜보았습니다.

최초 작성 2019. 10. 1

CNN을 사용하여 인식 정확도가 좋아졌습니다.

01.py

손글씨 숫자를 인식을 위해 뉴럴 네트워크를 학습시키는 코드입니다.

실행결과 가중치를 파일로 저장합니다.

import tensorflow as tf

mnist = tf.keras.datasets.mnist

(x_train, y_train),(x_test, y_test) = mnist.load_data()
x_train, x_test = x_train / 255.0, x_test / 255.0

model = tf.keras.models.Sequential([
tf.keras.layers.Flatten(),
tf.keras.layers.Dense(512, activation=tf.nn.relu),
tf.keras.layers.Dropout(0.2),
tf.keras.layers.Dense(10, activation=tf.nn.softmax)
])

model.compile(optimizer='adam',
loss='sparse_categorical_crossentropy',
metrics=['accuracy'])

model.fit(x_train, y_train, epochs=5)
model.evaluate(x_test, y_test)

model.save_weights('mnist_checkpoint')

02.py

학습된 뉴럴 네트워크를 사용하여 웹캠에 비춘 손글씨 숫자를 인식하는 코드입니다.

import tensorflow as tf
import cv2
import numpy as np
import math

def process(img_input):

gray = cv2.cvtColor(img_input, cv2.COLOR_BGR2GRAY)

gray = cv2.resize(gray, (28, 28), interpolation=cv2.INTER_AREA)

(thresh, img_binary) = cv2.threshold(gray, 0, 255, cv2.THRESH_BINARY_INV | cv2.THRESH_OTSU)

h,w = img_binary.shape

ratio = 100/h
new_h = 100
new_w = w * ratio

img_empty = np.zeros((110,110), dtype=img_binary.dtype)
img_binary = cv2.resize(img_binary, (int(new_w), int(new_h)), interpolation=cv2.INTER_AREA)
img_empty[:img_binary.shape[0], :img_binary.shape[1]] = img_binary

img_binary = img_empty

cnts = cv2.findContours(img_binary.copy(), cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_SIMPLE)

# 컨투어의 무게중심 좌표를 구합니다.
M = cv2.moments(cnts[0][0])
center_x = (M["m10"] / M["m00"])
center_y = (M["m01"] / M["m00"])

# 무게 중심이 이미지 중심으로 오도록 이동시킵니다.
height,width = img_binary.shape[:2]
shiftx = width/2-center_x
shifty = height/2-center_y

Translation_Matrix = np.float32([[1, 0, shiftx],[0, 1, shifty]])
img_binary = cv2.warpAffine(img_binary, Translation_Matrix, (width,height))

img_binary = cv2.resize(img_binary, (28, 28), interpolation=cv2.INTER_AREA)
flatten = img_binary.flatten() / 255.0

return flatten

model = tf.keras.models.Sequential([
tf.keras.layers.Flatten(),
tf.keras.layers.Dense(512, activation=tf.nn.relu),
tf.keras.layers.Dropout(0.2),
tf.keras.layers.Dense(10, activation=tf.nn.softmax)
])

model.load_weights('mnist_checkpoint')

cap = cv2.VideoCapture(0)
width = int(cap.get(cv2.CAP_PROP_FRAME_WIDTH))
height = int(cap.get(cv2.CAP_PROP_FRAME_HEIGHT))

while(True):

ret, img_color = cap.read()

if ret == False:
break;

img_input = img_color.copy()
cv2.rectangle(img_color, (250, 150), (width-250, height-150), (0, 0, 255), 3)
cv2.imshow('bgr', img_color)

img_roi = img_input[150:height-150, 250:width-250]

key = cv2.waitKey(1)

if key == 27:
break
elif key == 32:
flatten = process(img_roi)

predictions = model.predict(flatten[np.newaxis,:])

with tf.compat.v1.Session() as sess:
print(tf.argmax(predictions, 1).eval())

cv2.imshow('img_roi', img_roi)
cv2.waitKey(0)

cap.release()
cv2.destroyAllWindows()

저작자표시 비영리 동일조건

'Deep Learning & Machine Learning > 강좌&예제 코드' 카테고리의 다른 글

텐서플로우 2.0 강좌 - 케라스를 사용하여 손글씨 숫자 분류를 위한 신경망 만들기 (0)	2019.10.29
Android를 위한 TensorFlow Lite 예제 (MNIST 손글씨 숫자 인식) (28)	2019.10.16
텐서플로우 2.0 강좌입니다. (4)	2019.09.01
텐서 플로우 2.0 강좌 1 - 텐서플로우 설치 (6)	2019.07.14
텐서플로우 기초 강좌 - 1. 간단한 수식 계산 (0)	2019.02.04

시간날때마다 틈틈이 이것저것 해보며 블로그에 글을 남깁니다.

블로그의 문서는 종종 최신 버전으로 업데이트됩니다.

여유 시간이 날때 진행하는 거라 언제 진행될지는 알 수 없습니다.

블로그 글과 유튜브 영상을 만드는 것은 전문가라서라기보단 공부한 내용을 함께 공유하는 게 좋아서입니다.

'Deep Learning & Machine Learning > 강좌&예제 코드' 카테고리의 다른 글

제가 쓴 책도 한번 검토해보세요 ^^

티스토리툴바