目录
-
相关背景
-
从传统方法到R-CNN
-
从R-CNN到SPP
-
Fast R-CNN
-
Faster R-CNN
-
YOLO
-
SSD
-
总结
-
参考文献
-
推荐链接
相关背景
-
14年以来的目标检测方法(以R-CNN框架为基础或对其改进)
data:image/s3,"s3://crabby-images/5a131/5a1316d8d2a8806c811e9df1efe77ff9e6cf4a52" alt=""
data:image/s3,"s3://crabby-images/4b95d/4b95dd32edca56443af65d881d63224f00639ac7" alt=""
data:image/s3,"s3://crabby-images/b5812/b5812145db151d54b499c0efc717952cfc8cb82b" alt=""
data:image/s3,"s3://crabby-images/3ad4b/3ad4bfdbe5ad57b078166f4350f12b9d488764bb" alt=""
从传统方法到R-CNN
-
R-CNN的三大步骤:得到候选区域,用cnn提取特征,训练分类器(后两步放在一个网络中,用softmax做分类器也可以)
data:image/s3,"s3://crabby-images/ce7f6/ce7f6bd749cfc822e4e700b04c05cba652edb7b5" alt=""
从R-CNN到SPP
data:image/s3,"s3://crabby-images/72fea/72fea553db3e74d5ce584b2c816f11b0e483fa5a" alt=""
-
SPP的两大优势:可变输入大小 + 各patch块之间卷积计算是共享的
data:image/s3,"s3://crabby-images/3439b/3439b20e2bd00983848057aa4d812aab51a1ee0f" alt=""
-
SPP的缺陷:multi-stage,训练和测试都比较慢
data:image/s3,"s3://crabby-images/8dab0/8dab0b964b27d8e9c9c1f044e818333b20b632a1" alt=""
Fast R-CNN
-
Fast R-CNN通过ROI pooling(一层的SPP),multi-task等改进大大提高速度
data:image/s3,"s3://crabby-images/88ff1/88ff1ec738942e7d5c4b50bcb591768ff1c41599" alt=""
data:image/s3,"s3://crabby-images/ff2fb/ff2fbe2826a69d72d070ae5d7879fc62cda4310f" alt=""
Faster R-CNN
-
Faster R-CNN对于Fast R-CNN的改进在于把region proposal的步骤换成一个CNN网络(RPN)
data:image/s3,"s3://crabby-images/82752/827526e38f452aa6f87b6221445a4a8f8bb38c35" alt=""
-
Faster R-CNN的两个base model: ZF,VGG16 (base model的中间conv输出即为要输入到RPN的那个feature map)
data:image/s3,"s3://crabby-images/99ca9/99ca9c701e01bf738c9cdba932f9e05406ce2a44" alt=""
data:image/s3,"s3://crabby-images/8e03b/8e03b9508d0e45c1ea9cf0b7a71a5519cee50649" alt=""
-
Faster R-CNN的锚点anchor box
data:image/s3,"s3://crabby-images/5e527/5e52720323452928f0ed4c3d06f4289e3a48b27a" alt=""
data:image/s3,"s3://crabby-images/d86e3/d86e391f6c0b9facb518462da91b6b872a2f1ee3" alt=""
data:image/s3,"s3://crabby-images/16bd5/16bd5c5c5ef127bd7658550a77dbe864c257a693" alt=""
data:image/s3,"s3://crabby-images/0bd11/0bd118c3f5bde20f457db689d5866bc0156c7613" alt=""
YOLO
data:image/s3,"s3://crabby-images/ba560/ba5604c3fa1a2b86c30b97d70ff56a690cd0c40d" alt=""
data:image/s3,"s3://crabby-images/25249/25249c9feb72c4b13cfd826379293a097271f676" alt=""
data:image/s3,"s3://crabby-images/07750/07750789f94c55cd84fafe54fb5ae4e7cf3147e1" alt=""
data:image/s3,"s3://crabby-images/81012/810121cd340c085144c58e95b5a8a209a528977a" alt=""
data:image/s3,"s3://crabby-images/9e981/9e981fca7e28bdd1e8cecaada5fbfa64ba785779" alt=""
data:image/s3,"s3://crabby-images/49ee3/49ee38c66ecf75111f48a6da6b651ffbf0d23cfa" alt=""
SSD
data:image/s3,"s3://crabby-images/8bf9a/8bf9a9f62ad8458279de8c68cd7a44415116c7d7" alt=""
data:image/s3,"s3://crabby-images/b3d21/b3d21e6bdabcfa0f4e7721fb3008db178f4711db" alt=""
data:image/s3,"s3://crabby-images/3f8b9/3f8b9dccf8b4413db992391e8bf1859d299cc6ad" alt=""
data:image/s3,"s3://crabby-images/a414c/a414cf406835310ea77786713aceb5b0b95f091d" alt=""
-
SSD的default box与faster r-cnn的anchor box的对比
data:image/s3,"s3://crabby-images/4050a/4050a2f9ecaaeb45976284424e75bd4a355cdaaa" alt=""
-
SSD的训练样本与groundTruth的匹配策略 + 损失函数
data:image/s3,"s3://crabby-images/42988/429880502895243c0915cfe700466a1ef96b64e5" alt=""
data:image/s3,"s3://crabby-images/b0fc3/b0fc333e7a923baf2d19b86af28f5cfe77915379" alt=""
总结
-
从R-CNN → SPP → Fast R-CNN → Faster R-CNN → YOLO → SSD整体在准确率和速度上都在提高
data:image/s3,"s3://crabby-images/ce7a5/ce7a5b424a7eb2381afaf3918a7dddeca4c01476" alt=""
参考文献
-
-
- Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: CVPR. (2014)
-
SPP
-
- He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. In: ECCV. (2014)
-
Fast R-CNN
-
- Girshick, R.: Fast R-CNN. In: ICCV. (2015)
-
Faster R-CNN
-
- Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: Towards real-time object detection with region proposal networks. In: NIPS. (2015)
-
YOLO
-
- Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: Unified, real-time object detection. In: CVPR. (2016)
-
SSD
-
- W. Liu, D. Anguelov, D. Erhan, C. Szegedy, and S. Reed. SSD: Single shot multibox detector. arXiv:1512.02325v2, 2015
推荐链接
-
Object detection methods (codes)
-
所有目标检测方法的中文总结(博客)
-
Faster RCNN的论文阅读
-
YOLO的论文阅读
-
R-FCN的论文阅读
-
SSD的论文阅读
本文链接:http://task.lmcjl.com/news/12152.html