Rarefied talent in data science, data technology, and analytics

» Post Jobs | Employer Login

» Home

» Data Science Jobs / Analytics

» Data Engineering Jobs

» About DataJobs.com

» Big Data Knowledge Repo

Data Science Knowledge Repo

A central knowledge resource for data scientists / analytics experts

Big Data Knowledge Repos »

Data Science Repo

Data Technology Repo

A prevailing characteristic of data scientists is deep intellectual curiosity – a trait that drives them to be passionate learners, always picking up new skills on their own volition. Many of these fascinating but difficult techniques of data science are grounded in hard math and machine learning — e.g. Bayesian inference, nonparametric regression, neural net classifiers, hidden markov models, evolutionary algorithms, content/collaborative filters, NLP, etc. Data science is so broad and deep that even the most seasoned experts always have something new to learn; there is simply too much collective knowledge out there.

The purpose of the "Data Science Knowledge Repo" is to provide a central resource that data scientists can revisit frequently to refresh knowledge or learn new skills. If you have any recommended additions – guides, technical papers, and other resources – email frank@datajobs.com.

A

Auto-Regressive Models

B

Bayesian Inference

C

Collaborative Filtering

Collaborative Filtering Technical Paper – Koren & Bell

Clustering Methods

D

Decision Tree Learning

Dominance Analysis

Dominance Weights – Nathans et al.

E

Ensemble Methods

Expectation-Maximization Algorithm

F

Factor Analysis

Factor Analysis Guide – Peter Tryfos

Fixed Effects Models

Fixed Effects Models & Random Effects Models – Clark & Linzer

G

Genetic Algorithms

Genetic Algorithm Guide – Tom Mathew

Gradient Descent

H

Hidden Markov Models

Hierarchical Bayes Models

I

Independent Component Analysis (ICA)

J

K

K-Means Clustering

K-Means Clustering Basics – Andrew Ng

L

Linear Algebra

Linear Discriminant Analysis (LDA)

M

Machine Learning

Markov Chain Monte Carlo (MCMC)

Markov Chain Monte Carlo Guide – Andrieu et al.

N

Naive Bayes

Naive Bayes Guide – Kevin Murphy

Natural Language Processing (NLP)

Neural Nets

O

Ordinary Least-Squares

OLS Regression Basics – G. D. Hutcheson

P

Principal Component Analysis (PCA)

Probability Theory

Probability Theory Review – Maleki & Do

Q

R

R (Statistical Computing Software)

R Programming Guide – Norman Matloff

Recommender Systems

Regression Analysis

S

SAS (Statistical Computing Software)

SAS/STAT Guide – SAS Institute

Singular Value Decomposition (SVD)

Supervised Learning

Support Vector Machines (SVM)

T

Time-Series Analysis

U

Unsupervised Learning

Unsupervised Learning General Guide – Zoubin Ghahramani

V

W

X

Y

Z