See like a computer, digital image processing

Computer vision is a fast growing field in computer science, going hand in hand with AI. For a computer a picture speaks a thousand words, or literally millions of bits. The range of applications for CV include: interactive art, mines inspection, stitching maps on the web or through advanced robotics.

An image is a matrix of values. A black and white image is a matrix of gray-scale values (0-255, with 0 = black, and 255 = white). Each value represents a 'pixel', which is also the name chosen by Google to lead it's new range of iconic handphones in a push to tackle the iPhone's market share. Resolution is the number of pixels in a screen, this can be specified in length x width.

An image can be described as an array, or a list of lists. It can be defined by a function of x and y coordinates f(x,y). A video is a list containing a list of lists. It can be defined as a function with a time dimension f(x,y,t).

With an understanding of the fundamental concepts of pixels, images, and videos you can appreciate the world of digital image processing, and how computer's see, 'computer vision'.

OpenCV is great library of programming functions mainly aimed at real-time computer vision. You can find the Python Documentation here.

Other useful tools/references I have been exploring relating to digital image process or computer vision projects include:

Tags: cv