Did you ever question what it can do for you? This is a brief introduction to Machine Learning.
What is Machine Learning (ML)? It is training the computer to make it predict a value or recognize objects, speech, letters, musical notes, etc. This is supervised learning because the output label is defined.
Alternatively, you can also find a set of clusters for example. This could be useful for finding market segments. This is called unsupervised learning because the output label is not defined.
Training is done by using historical data. Data (called features in ML) can be multi-dimensional: for example to predict a house price based on its size, the number of bedrooms, facilities, etc.
But also a matrix of pixels can be used for instance. When we correctly trained our model we can use the output matrix parameter (called theta) to apply on the input parameters to recognize a digit in an image for example.
Based on some input parameters you could predict a value. For example, this could be house price as said before. The input could be its size, number of bedrooms, etc. It could also be used to predict a trend. For example, if a company’s sales have increased steadily every month for the past few years, it would produce a line that depicts the upward trend in sales. After creating the trend line, the company could use the slope of the line to forecast sales in future months.
Remember that a prediction may be not true in the end, but it gives a good indication (if the model is correct) of what to expect. According to that, you could take action, which might be a valuable thing to do.
Classifying input data to a certain label is a powerful tool and has many applications.
One of the most famous examples of classification machine learning is spam detection. Based on input data (the email body, the sender, etc) the classification algorithm could detect whether an email is a spam or not.
Another example is detecting a car’s license plate and after that scan it’s letters. This might be useful when you are developing a traffic enforcement system saving the whole traffic enforcement organization a lot of time (you would remove the need of having manually enter the license plate number).
There are many other applications like detecting music notes, classifying music, speech recognition, object recognition and much more.
In some cases, you would like to plot segments/clusters out of data. This could be useful to find out what kind of segments your market has or to reduce the number of colors in an image. You can as well find out how segments in social networks are grouped.
Also, a clustering algorithm is the backbone behind the search engines. Search engines try to group similar objects in one cluster and the dissimilar objects far from each other. It provides the result for the searched data according to the nearest similar object which is clustered around the data to be searched.
An anomaly is something that deviates from what is standard, normal, or expected. What could that be in your business? In financial systems, this may be a fraud and in manufacturing systems, this may be a defect. In software, this could be a suspicious behavior in the system for example
Raising a flag or taking automated actions could be considered if you implemented an anomaly detection algorithm successfully. Could this be useful for you?
Over the past period, I’ve seen some interesting Machine Learning applications. Some of them:
As you can see Machine Learning is an interesting field. Have you thought about what it can do it for you? Mark de Jong (at Vectos) recently completed a course on Machine Learning at Coursera. If you have some ideas, let’s have a coffee!