30秒轻松实现TensorFlow物体检测-侯体宗的博客

30秒轻松实现TensorFlow物体检测
技术 / 管理员发布于 7年前 364

Google发布了新的TensorFlow物体检测API，包含了预训练模型，一个发布模型的jupyter notebook，一些可用于使用自己数据集对模型进行重新训练的有用脚本。

使用该API可以快速的构建一些图片中物体检测的应用。这里我们一步一步来看如何使用预训练模型来检测图像中的物体。

首先我们载入一些会使用的库

import numpy as np import os import six.moves.urllib as urllib import sys import tarfile import tensorflow as tf import zipfile  from collections import defaultdict from io import StringIO from matplotlib import pyplot as plt from PIL import Image

接下来进行环境设置

%matplotlib inline sys.path.append("..")

物体检测载入

from utils import label_map_util  from utils import visualization_utils as vis_util

准备模型

变量任何使用export_inference_graph.py工具输出的模型可以在这里载入，只需简单改变PATH_TO_CKPT指向一个新的.pb文件。这里我们使用“移动网SSD”模型。

MODEL_NAME = 'ssd_mobilenet_v1_coco_11_06_2017' MODEL_FILE = MODEL_NAME + '.tar.gz' DOWNLOAD_BASE = 'http://download.tensorflow.org/models/object_detection/'  PATH_TO_CKPT = MODEL_NAME + '/frozen_inference_graph.pb'  PATH_TO_LABELS = os.path.join('data', 'mscoco_label_map.pbtxt')  NUM_CLASSES = 90

下载模型

opener = urllib.request.URLopener() opener.retrieve(DOWNLOAD_BASE + MODEL_FILE, MODEL_FILE) tar_file = tarfile.open(MODEL_FILE) for file in tar_file.getmembers():   file_name = os.path.basename(file.name)   if 'frozen_inference_graph.pb' in file_name:     tar_file.extract(file, os.getcwd())

将（frozen）TensorFlow模型载入内存

detection_graph = tf.Graph() with detection_graph.as_default():   od_graph_def = tf.GraphDef()   with tf.gfile.GFile(PATH_TO_CKPT, 'rb') as fid:     serialized_graph = fid.read()     od_graph_def.ParseFromString(serialized_graph)     tf.import_graph_def(od_graph_def, name='')

载入标签图

标签图将索引映射到类名称，当我们的卷积预测5时，我们知道它对应飞机。这里我们使用内置函数，但是任何返回将整数映射到恰当字符标签的字典都适用。

label_map = label_map_util.load_labelmap(PATH_TO_LABELS) categories = label_map_util.convert_label_map_to_categories(label_map, max_num_classes=NUM_CLASSES, use_display_name=True) category_index = label_map_util.create_category_index(categories)

辅助代码

def load_image_into_numpy_array(image):  (im_width, im_height) = image.size  return np.array(image.getdata()).reshape(    (im_height, im_width, 3)).astype(np.uint8)

检测

PATH_TO_TEST_IMAGES_DIR = 'test_images' TEST_IMAGE_PATHS = [ os.path.join(PATH_TO_TEST_IMAGES_DIR, 'image{}.jpg'.format(i)) for i in range(1, 3) ] IMAGE_SIZE = (12, 8) [python] view plain copywith detection_graph.as_default():   with tf.Session(graph=detection_graph) as sess:   for image_path in TEST_IMAGE_PATHS:    image = Image.open(image_path)    # 这个array在之后会被用来准备为图片加上框和标签    image_np = load_image_into_numpy_array(image)    # 扩展维度，应为模型期待: [1, None, None, 3]    image_np_expanded = np.expand_dims(image_np, axis=0)    image_tensor = detection_graph.get_tensor_by_name('image_tensor:0')    # 每个框代表一个物体被侦测到.    boxes = detection_graph.get_tensor_by_name('detection_boxes:0')    # 每个分值代表侦测到物体的可信度.    scores = detection_graph.get_tensor_by_name('detection_scores:0')    classes = detection_graph.get_tensor_by_name('detection_classes:0')    num_detections = detection_graph.get_tensor_by_name('num_detections:0')    # 执行侦测任务.    (boxes, scores, classes, num_detections) = sess.run(      [boxes, scores, classes, num_detections],      feed_dict={image_tensor: image_np_expanded})    # 图形化.    vis_util.visualize_boxes_and_labels_on_image_array(      image_np,      np.squeeze(boxes),      np.squeeze(classes).astype(np.int32),      np.squeeze(scores),      category_index,      use_normalized_coordinates=True,      line_thickness=8)    plt.figure(figsize=IMAGE_SIZE)    plt.imshow(image_np)

在载入模型部分可以尝试不同的侦测模型以比较速度和准确度，将你想侦测的图片放入TEST_IMAGE_PATHS中运行即可。

以上就是本文的全部内容，希望对大家的学习有所帮助，也希望大家多多支持。

上一条：
tensorflow实现KNN识别MNIST
下一条：
tensorflow识别自己手写数字

0条评论 (评论内容有缓存机制,请悉知!)

最新最热

相关文章
智能合约Solidity学习CryptoZombie第三课:组建僵尸军队(高级Solidity理论)(0个评论)
智能合约Solidity学习CryptoZombie第二课:让你的僵尸猎食(0个评论)
智能合约Solidity学习CryptoZombie第一课:生成一只你的僵尸(0个评论)
gmail发邮件报错:534 5.7.9 Application-specific password required...解决方案(0个评论)
2024.07.09日OpenAI将终止对中国等国家和地区API服务(0个评论)

近期文章
智能合约Solidity学习CryptoZombie第三课:组建僵尸军队(高级Solidity理论)(0个评论)
智能合约Solidity学习CryptoZombie第二课:让你的僵尸猎食(0个评论)
智能合约Solidity学习CryptoZombie第一课:生成一只你的僵尸(0个评论)
在go中实现一个常用的先进先出的缓存淘汰算法示例代码(0个评论)
在go+gin中使用"github.com/skip2/go-qrcode"实现url转二维码功能(0个评论)
在go语言中使用api.geonames.org接口实现根据国际邮政编码获取地址信息功能(1个评论)
在go语言中使用github.com/signintech/gopdf实现生成pdf分页文件功能(0个评论)
gmail发邮件报错:534 5.7.9 Application-specific password required...解决方案(0个评论)
欧盟关于强迫劳动的规定的官方举报渠道及官方举报网站(0个评论)
在go语言中使用github.com/signintech/gopdf实现生成pdf文件功能(0个评论)

近期评论
122 在
学历：一种延缓就业设计，生活需求下的权衡之选中评论工作几年后，报名考研了，到现在还没认真学习备考，迷茫中。作为一名北漂互联网打工人..
123 在
Clash for Windows作者删库跑路了，github已404中评论按理说只要你在国内，所有的流量进出都在监控范围内，不管你怎么隐藏也没用，想搞你分..
原梓番博客在
在Laravel框架中使用模型Model分表最简单的方法中评论好久好久都没看友情链接申请了，今天刚看，已经添加。..
博主在
佛跳墙vpn软件不会用?上不了网?佛跳墙vpn常见问题以及解决办法中评论 @1111老铁这个不行了，可以看看近期评论的其他文章..
1111 在
佛跳墙vpn软件不会用?上不了网?佛跳墙vpn常见问题以及解决办法中评论网站不能打开，博主百忙中能否发个APP下载链接，佛跳墙或极光..

Top