Home

Awesome

Short Course

Applied Machine Learning for Security Informatics

The use of statistical models in computer algorithms allows computers to make decisions and predictions, and to perform tasks that traditionally require human cognitive abilities. Machine learning is the interdisciplinary field at the intersection of statistics and computer science which develops such statistical models and interweaves them with computer algorithms. It underpins many modern technologies, such as speech recognition, Internet search, bioinformatics and computer vision—Amazon’s recommender system, Google’s driverless car and the most recent imaging systems for cancer diagnosis are all based on Machine Learning technology.

This course on Machine Learning will explain how to build systems that learn and adapt using real-world applications. Some of the topics to be covered include linear regression, logistic regression, deep neural networks, clustering, and so forth. The course will be project-oriented, with emphasis placed on writing software implementations of learning algorithms applied to real-world problems, in particular, Fraud Detection, Phishing Detection, HTML Injections Classification, Clustering of Phishing Attackers, Malware Fingerprinting, Criminal Profiling, among others.

Instructor: Dr. Alejandro Correa Bahnsen

Requiriments

A good, easy to install option that supports Mac, Windows, and Linux, and that has all of these packages (and much more) is the Anaconda.

GIT!! Unfortunatelly out of the scope of this class, but please take a look at these tutorials

Sessions

SessionNotebook linkExercises
1Introduction to Machine Learning
2Introduction to Python01 - Python & Numpy
3Pandas Data Frame03 - Baby names
4Linear Regression04 - Bikes Rent
5Logistic Regression05 - Intrusion Detection
6Data Preparation and evaluationIntrusion Detection
7Feature SelectionIntrusion Detection
8Decision TreesFraud Detection
9Ensemble Methods - BaggingBagging
10Ensemble Methods - Boosting
11Support Vector MachinesPhishing
12Deep Learning
13Model Deployment
14Kaggle Competition