Object detection - class review

Last updated on:6 months ago

Localization and detection can be used image classification, classification with localization and multiple object detection.

Output of object detection network

If

Landmark detection

Select the number of landmarks.

129 output units = $64 \times 2 + 1$

Analyse the pose of the person

Object detection

Sliding windows detection

You then repeat it, but now use a larger window.

You can reduce the number of windows you need to pass through the confinite by using a more coarse stride, a bigger stride, a larger step size. The sliding detector can be implemented convolutionally, or much more efficient.

Turning FC layer into convolutional layers

Convolutional implementation of sliding windows

Problem: the position of the bounding boxes are not too accurate.

Bounding boxes predictions
output accurate bounding boxes.

Reference

[1] Deeplearning.ai, Convolutional Neural Networks