Analog Clock Reader

COMP 4102 Final Project (Winter 2020)
Majd Al Khany - 100977501
Layne Koftinow-Mikan - 101013563

Overview

The analog clock reader consists of a set of algorithms, the input to which is an image of an analog clock and the output of which is simply a string representation of the time on the clock. The code for this project was written in Python and makes use of the OpenCV and Tesseract computer vision libraries. One of the goals of this project was to create a robust algorithm that provides a simple interface. While the input is a single image and the output is a single string, the program is designed to account for many different cases of an analog clock or watch image. This includes the ideal scenario wherein a clock has an hour, minute, and second hand, numbers, and is directly facing the camera. In addition to this, the algorithm supports non-ideal cases wherein the clock is misaligned, misoriented, or missing potential data points such as hands and numbers. This repository also contains a test script which determines the accuracy of the algorithm by comparing the expected results to the actual results and computing the difference.

Requirements to run

OpenCV
Tesseract and PyTesseract

How to run

Launch command: python detect_clock.py <image.jpg> (image must be in the images folder)
Test script launch command: python test_detect_clock.py

How the algorithm works

Input: clock image

Align clock if it is sufficiently skewed (align_clock.py)
1. Detect edges using Canny edge detector
2. Use edges to detect clock contour using Suzuki contour detection
3. Calculate the boundingRect and minEnclosingCircle of the contour
4. Calculate the transformation matrix using boundingRect and minEnclosingCircle values
5. Warp perspective using the transformation matrix
Orient clock if numbers cannot be detected (orient_clock.py)
1. Detect text (numbers on the clock) in the input image using the EAST DNN text detector
2. Place bounding boxes around the detected text in the image (for the OCR engine to be able to find the text and interpret it)
3. Pass the image and data computed in the previous step by the EAST text detector to the PyTesseract OCR engine (in order to read the text on the image)
4. If PyTesseract was able to successfully read the numbers on the clock image, then the image is in the correct orientation, return the image in its current state; otherwise,
  1. Use imutils.rotate to rotate the image by an interval of 10 degrees, and repeat the steps above until the image is in the correct orientation, at which point the image will be returned in the corrected orientation
  2. If the image gets rotated 360 degrees or more and PyTesseract still cannot read the text/numbers correctly, then all rotations are discarded and the image is returned in the same orientation that it was originally passed
Center the clock by cropping the image (isolate_clock.py)
1. Convert image to greyscale and apply medianBlur
2. Apply Hough transform to detect the clock’s outer circumference
3. Crop the image around this circumference
Detect the clock's hands (detect_clock_hands.py)
1. Detect edges using Canny edge detector
2. Detect lines using probabilistic Hough transform on these edges
3. Remove lines that cannot represent clock hands (ie. do not pass near center)
4. Merge similar lines together into one line (since most hands will have more than one line corresponding to them)
5. Estimate which lines represent hour, minute, and second hands using their length and thickness, resulting in a list of 2-3 lines (seconds hand is optional)
Calculate the time using the clock's hands (calculate_time.py)
1. Calculate angle of each hand (eg. 12 o’clock is 0 degrees, 3 o’clock is 90 degrees, etc.)
2. Use angles of each hand to calculate time value
3. Format time values and return a single string Output: string representing the time on the clock (hh:mm:ss)

Example

Step	Screenshot
Input clock
Isolated clock
Edge detection
Line detection
Hand estimates
Program output

Test script

There is also a test script (test_detect_clock.py) that runs through all of the images in the images folder and computes their accuracy by comparing the actual result to the expected result. Most discrepancies are due to "off by one" errors which are more pronounced for the hour hand (3600 seconds) compared to the minute hand (60 seconds). Another issue is inaccurate line detection resulting in the algorithm thinking the seconds hand is facing the opposite direction, hence the seconds being off by 30. Executing the test script results in the following output:

Name		Name	Last commit message	Last commit date
Latest commit History 116 Commits
README_images		README_images
images		images
.gitignore		.gitignore
README.md		README.md
align_clock.py		align_clock.py
calculate_time.py		calculate_time.py
detect_clock.py		detect_clock.py
detect_clock_hands.py		detect_clock_hands.py
frozen_east_text_detection.pb		frozen_east_text_detection.pb
globals.py		globals.py
isolate_clock.py		isolate_clock.py
orient_clock.py		orient_clock.py
test_detect_clock.py		test_detect_clock.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Analog Clock Reader

Overview

Requirements to run

How to run

How the algorithm works

Example

Test script

About

Releases

Packages

Contributors 2

Languages

majdalkhany/analog-clock-reader

Folders and files

Latest commit

History

Repository files navigation

Analog Clock Reader

Overview

Requirements to run

How to run

How the algorithm works

Example

Test script

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages