predicting-march-madness - Machine learning tutorial to create an entry for the Kaggle March Mania contest

  •        16

Kaggle’s March Madness prediction competition is an accessible introduction to machine learning. If you happen to like college basketball, you’ll like that in this competition you can’t bust your bracket, since you make a prediction for every game. Plus this year there’s a big prize pool, and luck plays a big enough role that you can be a legit contender fairly easily. In 2016, my simple process using tidyverse functions in R placed in the top 10%. I refined it a bit for 2017 and finished in the top 25%.



Related Projects

Madness - Recursive Descent Into Madness

  •    Swift

Your parsers can produce your own model objects directly, making Madness ideal for experimenting with grammars, for example in a playground. See Madness.playground for some examples of parsing with Madness.

CG Madness / CG Portals

  •    C++

CG Madness is based on the classic game Marble Madness. It is running on OpenGL and uses current CG techniques. It also has an editor where you can create your own levels. If you have done so and you want to share the level, you can upload it.

Marble and Madness

  •    C++

Marble Madness OpenGL (Game amp; Editor)

Morse Code Madness

  •    VB

Morse Code Madness - Helps you learn Morse Code. Morse Code Trainer, Morse Code Translater, Morse Code QSO Practice, Morse Code Transceiver. Sharpen your Morse Code skills.

crowds - The Wisdom and/or Madness of the Crowds

  •    HTML

The Wisdom and/or Madness of the Crowds

datacamp-community-tutorials - Tutorials for DataCamp (

  •    Jupyter

Files related to articles written for the DataCamp Community. This machine learning tutorial for beginners was first published on the DataCamp Blog on the 25th of March 2015. It was recenlty updated on the DataCamp Community on the 11th of April 2017. You can find the article here.

deep-learning-book - Repository for "Introduction to Artificial Neural Networks and Deep Learning: A Practical Guide with Applications in Python"

  •    Jupyter

Repository for the book Introduction to Artificial Neural Networks and Deep Learning: A Practical Guide with Applications in Python. Deep learning is not just the talk of the town among tech folks. Deep learning allows us to tackle complex problems, training artificial neural networks to recognize complex patterns for image and speech recognition. In this book, we'll continue where we left off in Python Machine Learning and implement deep learning algorithms in PyTorch.

AIX360 - Interpretability and explainability of data and machine learning models

  •    Python

The AI Explainability 360 toolkit is an open-source library that supports interpretability and explainability of datasets and machine learning models. The AI Explainability 360 Python package includes a comprehensive set of algorithms that cover different dimensions of explanations along with proxy explainability metrics. The AI Explainability 360 interactive experience provides a gentle introduction to the concepts and capabilities by walking through an example use case for different consumer personas. The tutorials and example notebooks offer a deeper, data scientist-oriented introduction. The complete API is also available.

Conjecture - Scalable Machine Learning in Scalding

  •    Java

Conjecture is a framework for building machine learning models in Hadoop using the Scalding DSL. The goal of this project is to enable the development of statistical models as viable components in a wide range of product settings. Applications include classification and categorization, recommender systems, ranking, filtering, and regression (predicting real-valued numbers). Conjecture has been designed with a primary emphasis on flexibility and can handle a wide variety of inputs. Integration with Hadoop and scalding enable seamless handling of extremely large data volumes, and integration with established ETL processes. Predicted labels can either be consumed directly by the web stack using the dataset loader, or models can be deployed and consumed by live web code. Currently, binary classification (assigning one of two possible labels to input data points) is the most mature component of the Conjecture package.There are a few stages involved in training a machine learning model using Conjecture.

MLPB - Machine Learning Problem Bible | Problem Set Here >>

  •    R

I have this problem... I need to classify something as A, B or C using a combination of numeric and categorical features. If I could find a similar problem, maybe I could modify the solution to work for my needs. This is where MLPB steps in. Want to see machine learning problems with sparse data? Got it. Want to compare Scikit-learn’s RandomForestRegressor with R’s randomForest? Got it. Need an example of predicting a ranked target variable? Got it.

Reinforcement-Learning-Notebooks - A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python

  •    Jupyter

I wrote these notebooks in March 2017 while I took the COMP 767: Reinforcement Learning [5] class by Prof. Doina Precup at McGill, Montréal. I highly recommend you to go through the class notes and references of all the papers the intructors have posted on the website. These notebooks should be used while you read the book and go beyond the same with the referenced papers. I would suggest watching David Silver's videos and reading the book simultaneously. And when you are done with a few chapters, start implementing them. The algorithms follow a pattern and mostly are variants of each other. I have tried my best to explain each notebook's results and possible future directions.

Machine-Learning-with-R-datasets - Formatted datasets for Machine Learning With R by Brett Lantz


Machine Learning with R by Brett Lantz is a book that provides an introduction to machine learning using R. As far as I can tell, Packt Publishing does not make its datasets available online unless you buy the book and create a user account which can be a problem if you are checking the book out from the library or borrowing the book from a friend. All of these datasets are in the public domain but simply needed some cleaning up and recoding to match the format in the book.

Learning-Library-for-PHP - The rudimentary workings of a machine learning library in PHP.

  •    PHP

Some machine learning/artificial intelligence/natural language processing algorithms implemented in PHP. Note that in almost all cases, PHP as it stands today is the wrong tool for most machine learning jobs. This library provides a pedagogical introduction to these tools more than it is a recommendation that it is used for day-to-day development. Copyright (C) 2011-2015 Giuseppe Burtini and contributors as appropriate.

introduction_to_ml_with_python - Notebooks and code for the book "Introduction to Machine Learning with Python"

  •    Jupyter

This repository holds the code for the forthcoming book "Introduction to Machine Learning with Python" by Andreas Mueller and Sarah Guido. You can find details about the book on the O'Reilly website. The books requires the current development version of scikit-learn, that is 0.18-dev. Most of the book can also be used with previous versions of scikit-learn, though you need to adjust the import for everything from the model_selection module, mostly cross_val_score, train_test_split and GridSearchCV.

Theano-Tutorials - Bare bones introduction to machine learning from linear regression to convolutional neural networks using Theano

  •    Python

Bare bones introduction to machine learning from linear regression to convolutional neural networks using Theano. Dataset It's worth noting that this library assumes that the reader has access to the mnist dataset. This dataset is freely available and is accessible through Yann LeCun's personal website.

Learning Classifier System Trainer


NOT in a publishable state, but published otherwise it will be removed! Please wait until March 2010. Thanks. This project demonstrates an implementation of a learning classifier system, which allows students to modify the classifier behaviour. It is developed in C# and WP

ISLR-python - An Introduction to Statistical Learning (James, Witten, Hastie, Tibshirani, 2013): Python code

  •    Jupyter

This repository contains Python code for a selection of tables, figures and LAB sections from the book 'An Introduction to Statistical Learning with Applications in R' by James, Witten, Hastie, Tibshirani (2013). 2018-01-15: Minor updates to the repository due to changes/deprecations in several packages. The notebooks have been tested with these package versions. Thanks @lincolnfrias and @telescopeuser.

countdownjs - A simple JavaScript API for producing an accurate, intuitive description of the timespan between two Date instances

  •    Javascript

A simple JavaScript API for producing an accurate, intuitive description of the timespan between two Date instances. While seemingly a trivial problem, the human descriptions for a span of time tend to be fuzzier than a computer naturally computes. More specifically, months are an inherently messed up unit of time. For instance, when a human says "in 1 month" how long do they mean? Banks often interpret this as thirty days but that is only correct one third of the time. People casually talk about a month being four weeks long but there is only one month in a year which is four weeks long and it is only that long about three quarters of the time. Even intuitively defining these terms can be problematic. For instance, what is the date one month after January 31st, 2001? JavaScript will happily call this March 3rd, 2001. Humans will typically debate either February 28th, 2001 or March 1st, 2001. It seems there isn't a "right" answer, per se.

Sitepoint-source - Source code for my articles on Sitepoint (since March 2015)

  •    Ruby

This repo contains source code for my articles on SitePoint (starting from March 2015). Source code for previous articles can be found in the corresponding repos.

AAI Portal

  •    PHP

The AAI portal is an authentication brooker for the SWITCHaai infrastructure, based on the Shibboleth middleware, and many e-learning platforms. Resource adapters are available for Blackboard CE and Vista, WebCT CE and Vista, VITELS, Gridlab, AdLearn, .. This project is no longer actively maintained. The last released version was 1.4.5 in March 2011.

We have large collection of open source products. Follow the tags from Tag Cloud >>

Open source products are scattered around the web. Please provide information about the open source projects you own / you use. Add Projects.