Open source data science curriculum

The Data Science Curriculum

  • Intro to Data Science UW / Coursera
    • Topics: Python NLP on Twitter API, Distributed Computing Paradigm, MapReduce/Hadoop & Pig Script, SQL/NoSQL, Relational Algebra, Experiment design, Statistics, Graphs, Amazon EC2, Visualization.




  • Coursework
    • Sentiment analysis, trending topics, and friendship mapping with Twitter API
    • Joins and Matrix Manipulation in MapReduce (AWS EC2)
    • In-database Text analysis (SQL)
  • Sentiment analysis of movie tweets (Python)

A Note on Tools

This degree is brought to you by: “THE INTERNET”.

Information is more democratized^ now than it was at any point in history. Given a little initiative and interest, you can tailor and excel in an education of your own design. The connective web made me what I am today, growing from the child obsessed with Number Munchers to an adult jaw-dropping over DBSCAN.

The most valuable resources;



