Webcam Object Detection with Haar Cascade Classifiers and Running in Google Co-lab

This Python code allows you to capture images from your webcam and detect specific objects using pre-trained Haar Cascade classifiers. The detected objects are surrounded by bounding boxes and labeled with text. The code is designed to run in a Jupyter Notebook or Google Colab environment, with some adjustments required for Google Colab.

Motivating Article

S. Wattamwar, R. Mate, P. Rainchwar, S. Mantri and G. Sorate, "Optimal Face Recognition System using Haar Classifier," 2021 International Conference on Smart Generation Computing, Communication and Networking (SMART GENCON), Pune, India, 2021, pp. 1-7, doi: 10.1109/SMARTGENCON51891.2021.9645879. https://ieeexplore.ieee.org/abstract/document/9645879

Haarcascades https://github.com/opencv/opencv/tree/master/data/haarcascades

Google Colab: Access Webcam for Images and Video https://colab.research.google.com/drive/1QnC7lV7oVFk5OZCm75fqbLAfD9qBy9bw

Cascade Classification https://colab.research.google.com/github/computationalcore/introduction-to-opencv/blob/master/notebooks/4-Cascade_classification.ipynb

Features

Real-time video stream from the webcam
Object detection using Haar Cascade classifiers
Support for multiple object types: faces, smiles, and eyes
Drawing of bounding boxes around detected objects
Text labels on the bounding boxes indicating the object type
Capture and save images with bounding boxes and labels
Customizable file names based on the detected object type

Results

Captured Face

Captured Eyes

Captured Smile

Prerequisites

Python 3.x
OpenCV (with pre-trained Haar Cascade classifiers)
NumPy
PIL (Python Imaging Library)
IPython.display (for Jupyter Notebook or Google Colab)
google.colab.output (for Google Colab)

Running on Google Colab

To run this code on Google Colab, some adjustments are required:

Upload the necessary Haar Cascade classifier files (haarcascade_frontalface_default.xml, haarcascade_smile.xml, and haarcascade_eye.xml) to your Google Colab environment.
Import the required libraries from IPython.display and google.colab.output:

from IPython.display import Image, Javascript, display
from google.colab.output import eval_js

Use the eval_js function from google.colab.output to execute the JavaScript code that captures the video stream and handles the user interface.
Use the display function from IPython.display to display the captured images in the Colab notebook.

Usage

Run the Python script in a Jupyter Notebook or Google Colab environment.
A menu will prompt you to select the detection type:
- f: Face detection
- s: Smile detection
- e: Eye detection
Enter your choice (e.g., f for face detection).
The webcam video stream will be displayed with bounding boxes and labels around the detected objects.
Click the "Capture Photo" button to capture the current video frame.
The captured image will be saved with a filename reflecting the selected detection type (e.g., captured_face.jpg for face detection).
The saved image will be displayed in the notebook or Colab environment.
After capturing the photo, the script will clean up the OpenCV resources and remove the preview window.

Haar Cascade Classifiers

This code uses pre-trained Haar Cascade classifiers provided by OpenCV to detect specific objects in the video stream. These classifiers are machine learning models trained to recognize patterns and features associated with different object types.

The following Haar Cascade classifiers are supported:

haarcascade_frontalface_default.xml: For detecting frontal faces
haarcascade_smile.xml: For detecting smiles
haarcascade_eye.xml: For detecting eyes

The code automatically loads the appropriate classifier based on the selected detection type and performs object detection on each video frame.

Inference with Pre-trained Models

The Haar Cascade classifiers used in this code are pre-trained models provided by OpenCV. These models have been trained on large datasets of images to learn the visual patterns and features associated with specific object types, such as faces, smiles, and eyes.

When the code runs, it loads the pre-trained Haar Cascade classifier model and uses the detectMultiScale function from OpenCV to perform object detection on the video frames. The function returns the coordinates of the bounding boxes where the objects are detected.

The code then draws the bounding boxes and labels on the video frames and the captured images, allowing you to visualize the detected objects.

Contributing

Contributions are welcome! If you find any issues or have suggestions for improvements, please open an issue or submit a pull request.

##Disclaimer This repository is intended for educational and research purposes.

License

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Webcam Object Detection with Haar Cascade Classifiers and Running in Google Co-lab

Motivating Article

Related Work

Features

Results

Captured Face

Captured Eyes

Captured Smile

Prerequisites

Running on Google Colab

Usage

Haar Cascade Classifiers

Inference with Pre-trained Models

Contributing

License

Files

README.md

Latest commit

History

README.md

File metadata and controls

Webcam Object Detection with Haar Cascade Classifiers and Running in Google Co-lab

Motivating Article

Related Work

Features

Results

Captured Face

Captured Eyes

Captured Smile

Prerequisites

Running on Google Colab

Usage

Haar Cascade Classifiers

Inference with Pre-trained Models

Contributing

License