Apache Spark
Follow your interests on GitHub
Join GitHub and subscribe to get your personalized daily, weekly, or monthly email digest of new and trending projects, topics, and more that match your favorite interests.
Sign up for free See pricing for teams and enterprisesGetting started with machine learning
Today, machine learning—the study of algorithms that make data-based predictions—has found a new audience and a new set of possibilities.
Apache Hadoop
A curated list of awesome computer vision resources
Assorted data from the General Services Administration.
An index of all open-source data
An unofficial repository of National Park Service data.
Data and code behind the articles and graphics at FiveThirtyEight
Cool links & research papers related to Machine Learning applied to source code (MLonCode)
ID3-based implementation of the ML Decision Tree algorithm
A toolkit for developing and comparing reinforcement learning algorithms.
Reinforcement learning resources curated
Principal Component Analysis on music loops
Ruby gem to calculate the similarity between texts using tf*idf
Large-scale linear classification, regression and ranking in Python
scikit-learn: machine learning in Python
An Open Source Machine Learning Framework for Everyone
Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It can use GPUs and perform efficient symbolic …