Reddit reviews Data Points: Visualization That Means Something
We found 2 Reddit comments about Data Points: Visualization That Means Something. Here are the top ones, ranked by their Reddit score.
We found 2 Reddit comments about Data Points: Visualization That Means Something. Here are the top ones, ranked by their Reddit score.
Here's what I'd recommend.
GETTING STARTED WITH DATA SCIENCE
If you're interested in learning data science I'd suggest the following:
 
Tools
Learning these has several benefits: they streamline your workflow. They speed up your learning process, since they are very easy to use. And perhaps most importantly, they really teach you how to think about analyzing data. GGplot2 has a deep underlying structure to the syntax, based on the Grammar of Graphics theoretical framework. I won’t go into that too much, but suffice it to say, when you learn the ggplot2 syntax, you’re actually learning how to think about data visualization in a very deep way. You’ll eventually understand how to create complex visualizations without much effort.
 
Skill Areas
My recommendations are:
(But, again I recommend learning these in R’s ggplot2.) The reason I recommend these is
As with any discipline, you need to learn the foundations first; this will dramatically speed your progress in the intermediate to advanced stages.
I usually suggest learning these with dummy data (for simplicity) but if you have a simple .csv file, that should work to.
After you learn data visualization, I suggest that you “back into” data management. For this, you should find a dataset and learn to reshape it.
The core data management skills:
You can start learning these here. Again, I recommend learning these in R’s dplyr because dplyr makes these tasks very straight forward. It also teaches you how to think about data wrangling in terms of workflow: the “chaining operator” in dplyr helps you wire these commands together in a way that really matches the analytics workflow. dplyr makes it seamless.
ML is sort of like the “data science 301” course vs. the 102 and 103 levels of the data-vis and data manipulation stuff I outlined above.
Here, I’ll just give book recos:
This is a highly regarded introduction
After you get these foundations, then you can move on to specialize in a particular area.
 
OTHER RESOURCES:
Data Visualization
 
TL;DR
I'd recommend learning R for data science before Python. Learn data visualization first (with R's ggplot2), using simple data or dummy data. Then find a more complicated dataset. Learn data manipulation second (with R's dplyr), and practice data manipulation on your more complex data. Learn machine learning last.
Yep, humans perceive differences in length much better than differences in angle. Yau's book Data Points talks about this extensively with examples.