Vehicle Detection

The goal of this project is to write a software pipeline to detect vehicles in a video.

Here are links to the labeled data for vehicle and non-vehicle examples to train your classifier. These example images come from a combination of the GTI vehicle image database, the KITTI vision benchmark suite, and examples extracted from the project video itself. Also You are welcome and encouraged to take advantage of the recently released Udacity labeled dataset to augment your training data.

Steps of this project

Perform a Histogram of Oriented Gradients (HOG) feature extraction on a labeled training set of images and train a classifier Linear SVM classifier
Implement a sliding-window technique and use your trained classifier to search for vehicles in images.
Run your pipeline on a video stream and create a heat map of recurring detections frame by frame to reject outliers and follow detected vehicles.
Estimate a bounding box for vehicles detected.

Histogram of Oriented Gradients (HOG)

Here is an example of one of each of the vehicle and non-vehicle classes:

I has explored different color spaces and different skimage.hog() parameters (orientations, pixels_per_cell, and cells_per_block). I grabbed random images from each of the two classes and displayed them to get a feel for what the skimage.hog() output looks like.

features, hog_img = hog(img, orientations=orient,
                        pixels_per_cell=(pix_per_cell, pix_per_cell),
                        cells_per_block=(cell_per_block, cell_per_block),
                        visualise=vis, feature_vector=feature_vec)

Here is an example using the YCrCb color space and HOG parameters of orientations=9, pixels_per_cell=(8, 8) and cells_per_block=(2, 2):

I tried various combinations of parameters and find with orientation = 11 and pixel_per_cell=8, cells_per_block=2 works the best.

SVM classifier

I trained a linear SVM using HOG features, spatial features and color histogram. In order to optimize the classifier I use grid search cross validation to tune the hyperparameter and decide to use C=0.01.

parameters = {'kernel':['linear'], 'C':[0.01, 0.1, 1, 10]}
svc = svm.SVC()
clf = GridSearchCV(svc, parameters, cv=4, n_jobs=8)
clf.fit(X_train, y_train)

Sliding Window Search

Use sliding window with scale=1.2, scale=1.5 and scale=2. The sliding window search the images with one cell per step. To make the vechicle detection save, I use the smallest step and use two scales to search through the image space.

Inference

It is found that there are several scaled windows that are required. To include more searching windows would be very expensive. However, it is found that certain window scale are only suitable for a certian depth range. Therefore by constraining different search window to a certain search depth can save a lot of computation. The sliding window is run twice with three scale (1.2, 1.5, 2). Ultimately I searched on three scales using YCrCb 3-channel HOG features plus spatially binned color and histograms of color in the feature vector, which provided a nice result. Here are some example images:

Video Implementation

Here's a link to my video result

I recorded the positions of positive detections in each frame of the video. From the positive detections, I add up the positive detections found in the previous several frames. Then use the previous step heatmap times a decay rate and add up with current heatmap to generate a smoothed heatmap. After that threshold on the smooth heatmap to identify vehicle positions.

def add_heat(heatmap, bbox_list):
    '''Iterate through list of bboxes'''
    for box in bbox_list:
        # Add += 1 for all pixels inside each bbox
        # Assuming each "box" takes the form ((x1, y1), (x2, y2))
        heatmap[box[0][1]:box[1][1], box[0][0]:box[1][0]] += 1

    # Return updated heatmap
    return heatmap

def apply_threshold(heatmap, threshold):
    output = np.copy(heatmap)
    output[output <= threshold] = 0
    return output

heatmap = np.zeros_like(img[:,:,0]).astype(np.float)
heatmap = add_heat(heatmap, bbox_list)
filtered_heatmap = apply_threshold(heatmap, 1)

I use scipy.ndimage.measurements.label() to identify individual blobs in the heatmap. Then assume each blob corresponds to a vehicle. I constructed bounding boxes to cover the area of each blob detected.

from scipy.ndimage.measurements import label

car_labels = label(filtered_heatmap)

Here's an example result showing the heatmap from a series of frames of video, the result of scipy.ndimage.measurements.label() and the bounding boxes then overlaid on the last frame of video:

Here are two frames and their corresponding heatmaps:

Here is the output of scipy.ndimage.measurements.label() on the integrated heatmap from all six frames and the resulting bounding boxes are drawn onto the last frame:

Discussion

I found using multiple frame will produce better results. In order to detect the vehicle in the far scene, you need to have a small scale. However, this compromise the performance of the system. An imporovement is to have the search scale determined by the scence depth the bounding box is located. The vechicle detection usually fail on the boarder of the images. That is due to there are always some pixels being left out by the sliding window methods. The window size and scale should be optimised to reduce those left over pixels.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
examples		examples
output_images		output_images
test_images		test_images
.gitignore		.gitignore
README.md		README.md
Vehicle_Detection.ipynb		Vehicle_Detection.ipynb
_config.yml		_config.yml
output_video_final.mp4		output_video_final.mp4
project_video.mp4		project_video.mp4
test_video.mp4		test_video.mp4
writeup_template.md		writeup_template.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vehicle Detection

Steps of this project

Histogram of Oriented Gradients (HOG)

SVM classifier

Sliding Window Search

Inference

Video Implementation

Discussion

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Vehicle Detection

Steps of this project

Histogram of Oriented Gradients (HOG)

SVM classifier

Sliding Window Search

Inference

Video Implementation

Discussion

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages