Top products from r/rstats

We found 27 product mentions on r/rstats. We ranked the 46 resulting products by number of redditors who mentioned them. Here are the top 20.

Next page

Top comments that mention products on r/rstats:

u/TheTrub · 3 pointsr/rstats
  1. The R Book is probably the most widely used text for beginners, but if you know what types of data you will be working with and the types of analyses you want to run, there are more specific texts that can help walk you through. On the other hand, there are a ton of online resources you can access for free. The only time I've bought a book for R was for timeseries analysis. I bought it because I needed an introduction to the topic and it was handy having the accompanying script being written in R.

  2. R works perfectly fine with both mac and windows. Unlike most people, though, I prefer not to use RStudio. To me it's just easier to keep everything organized by use the developer's console and writing and organizing my output in Notepad++.

    For hardware demands, large datasets won't be as big of an issue as model complexity, but given that modern computing demands keep going up, I think 16 GB should be standard for anyone getting a new computer. You'll be less likely to need to make an upgrade in the near future when OS and other software demands require you to do so. The bigger thing to consider is the read/write speed of your hard drive, so definitely go with an SSD. This will also help with power consumption and durability. For your CPU, R doesn't default to using multiple cores, so you'll want to look at single-core benchmarks when picking out your machine. With that being said, R does use multiple threads on a single core, so I wouldn't settle for an i5. But if you're using MCMC, a multicore CPU can be a lifesaver since you can assign your chains to run on different cores in parallel.

    I hope that helps. Good luck with your purchase!
u/Luonnon · 3 pointsr/rstats

Quick and dirty answer: speaking very broadly, random forests -- found in the "randomForest" package -- tend to win battle-of-the-algorithms type studies. If you just want to play with a single model, I'd recommend starting with that and looking at the help for it.

Longer and better answer: Your best bet to answering all these questions and getting a good handle on data mining/predictive analytics is this book: Applied Predictive Modeling. The book references the "caret" package quite a bit, since the package's author is the same person. With it, you can train a lot of different types of models for regression or classification optimizing for accuracy, RMSE, ROC, etc. It provides a standard API for playing with models and makes your life much, much easier. It has its own website here.

u/enilkcals · 9 pointsr/rstats

My advice would be to find a dataset for yourself that you wish to analyse in some manner and work through that using the references you have and Stackoverflow to search for solutions to problems you encounter (and asking when you can't find a solution).

This is because, in my experience, most exercises are canned and have perfect working solutions provided, yet when you get to the real world working on your own data things are never perfect and you will have a lot of problems to solve.

One very useful thing I can recommend is to start learning how to use Knitr as a basis for making your work-flow from importing data through to producing reports (in LaTeX or HTML via R markdown) completely reproducible.

A couple of useful references though are the following books which all have R examples...

u/socialpsychonline · 7 pointsr/rstats

I can recommend Applied Longitudinal Data Analysis: Modeling Change and Event Occurrence by Judith Singer & John Willett.

The second half of the book considers survival analysis, and R code for the examples from each chapter of the book are available here -- in addition to code for SAS, SPSS Stata, etc.

Full disclosure: I have only worked through the first half of the book so far (growth curve modeling), but the text is very complete and the code on that webpage has been really helpful. I imagine the section on survival analysis is similarly helpful.

u/Calibandage · 2 pointsr/rstats

Deep Learning With Python is very good for practical application, as is the course at fast.ai. For theory, people love Goodfellow.

u/urmyheartBeatStopR · 2 pointsr/rstats

> I'd like to know, how did you learn to use R?

My batshit crazy lovable thesis advisor was teaching intro datascience in R.

He can't really lecture and he have high expectation. The class was for everybody including people that don't know how to program. The class book was advance R http://adv-r.had.co.nz/... (red flag).

We only survived this class because I had a cs undergrad background and I gave the class a crash course once. Our whole class was more about how to implement his version of random forest.

I learned R because we had to implement a version of Random forest with Rpart package and then create a package for it.

Before this a dabble in R for summer research. It was mostly cleaning data.

So my advice would be to have a project and use R.

>how did you learn statistics?

Master program using the wackerly book and chegg/slader. (https://www.amazon.com/Mathematical-Statistics-Applications-Dennis-Wackerly/dp/0495110817)

It's a real grind. You need to learn probability first before even going into stat. Wackerly was the only real book that break down the 3 possible transformations (pdf,cdf, mgf).

u/factotumjack · 3 pointsr/rstats

Feedback would be greatly appreciated on this one. It's a prototype analysis for a book idea I had.

I'm toying with the idea of writing a book about statistical analyses of classic games. The target audience would be mathematically interested laypeople, much like Jeffrey Rosenthal's book Struck by Lightning ( https://www.amazon.ca/Struck-Lightning-Jeffrey-S-Rosenthal/dp/0006394957 ).

The twist would be that chapter would contain step-by-step R code or Python code so that the reader could do the same analysis and make changes based on their own questions.

u/sven_ftw · 1 pointr/rstats

I really enjoy using this book for reference material. It depends on what you are trying to learn, though. You won't find code examples in it.

What you will find, however, is a ton of different methods and examples of how to apply them (contextually). You'd probably need to have a least a basic idea of how to begin the analysis to make it useful. (For instance, I'm modeling a binary event, I need a logit or probit, start w/ that chapter; or I'm modeling a rank, Tobit).

u/ahcomochingas · 3 pointsr/rstats

I recommend you to first study probability theory, (formal definitions, variable independence, moments, limit theorems, and some distributions) this is my favorite course, but it's in spanish http://lya.fciencias.unam.mx/lars/0625/, then I recommend this course in MIT: http://ocw.mit.edu/courses/electrical-engineering-and-computer-science/6-041sc-probabilistic-systems-analysis-and-applied-probability-fall-2013/ it's going to take you a couple months to finish it, it's a lot of material, you can complement the material with this course http://oli.cmu.edu/courses/free-open/statistics-course-details/ (it has some r exercises). and in the road, try to make all the ocw homeworks on r, using Rmd, and some latex. Use the R book as a reference http://www.amazon.com/The-Book-Michael-J-Crawley/dp/0470973927

Learning R, is more about learning the math behind probability and the basic concepts of statistics than learning the language.

u/efrique · 2 pointsr/rstats
  1. Google resources for beginners R and also google introduction R (I particularly recommend google for this search).

  2. Try R-bloggers, such as this search

  3. There's this document, now also a book

  4. There's a lot of help to be found on the stackexchange network. e.g. 1
    , 2, 3, 4, 5, 6, 7, 8, 9


  5. descriptives: summary, by, aggregate, apply, tapply, sapply etc
    ...... also plot, hist, boxplot, stripchart, barplot, dotchart etc
    t-tests: t.test (paired-t, independent samples Welch t, independent samples equal-variance t),
    ANOVA: lm, aov. Also anova itself produces anova tables from a variety of model output
    correlation: cor
    regression: lm


u/norsurfit · 1 pointr/rstats

Three very good introductory textbooks for statistics are:

Mendenhall, Statistics https://www.amazon.com/Introduction-Probability-Statistics-William-Mendenhall/dp/1133103758

Larson and Farber https://www.amazon.com/Elementary-Statistics-Picturing-World-Books/dp/0321901118

Peck and Devore https://www.amazon.com/Statistics-Exploration-Analysis-Available-Titles/dp/0840058012

Buy used, older editions of each, and you can get them very cheap. These are all well explained, beginner texts.

u/vmsmith · 1 pointr/rstats

I didn't know about MLR until this post. So without having spent any time with it whatsoever, I would only say that one of the nice things about the caret package is that you can also leverage Kuhn and Johnson's book, Applied Predictive Modeling, as well as YouTube videos of Max Kuhn discussing caret.

u/jacobcvt12 · 3 pointsr/rstats

Both JAGS and BUGS use the same language and can perform very similar operations. JAGS is more portable across operating systems, so for that reason, I would suggest JAGS (BUGS is generally limited to Windows). However, documentation/blog posts/forum posts (which exist in abundance!) for both languages will generally work for either tool. If you are looking for a textbook, Doing Bayesian Data Analysis provides a nice introduction to both bayesian statistics as well as JAGS.

Outside of JAGS/BUGS, there exists another similar language for performing Bayesian statistics called Stan (also described in the above book). Stan is newer, and often times will "run faster" than JAGS, however it does not directly support as many types of analyses.

My advice would be to learn JAGS while simultaneously learning the basics of Bayesian methods. Once you understand the basics of JAGS, try exploring Stan!

u/dtrillaa · 2 pointsr/rstats

I used data camp and I recommend it because it covers both programming in base R and the full tidyverse library. I did the R programming course and it covered everything from functional programming to object oriented programming in R.

If you don’t want to do a monthly subscription, purchase Hadley Wickams (author of the tidyverse) book R for Data Science

u/cokechan · 2 pointsr/rstats

https://www.amazon.com/Analysis-Regression-Multilevel-Hierarchical-Models/dp/052168689X is the definitive text on the subject. I highly recommend this book to understand the fundamentals of multilevel modeling.

u/editorijsmi · 2 pointsr/rstats

you check the following book

Forecasting models – an overview with the help of R software : Time series - Past ,Present and Future

https://www.amazon.co.uk/dp/B07VFY53B1 (E-Book)

https://www.amazon.com/dp/1081552808 (Paperback)

ISBN: 9781081552800

u/grandzooby · 1 pointr/rstats

You might check out Dr. Gerbing's work at Portland State U. He teaches stats in the business school and uses R.

http://www.pdx.edu/sba/david-gerbing

He even wrote a textbook you might find useful:
http://www.amazon.com/dp/0415657202/

I haven't read it yet, so I don't know how helpful it will be.

u/mghoff330 · 3 pointsr/rstats

Mostly Harmless Econometrics is a classic. It gets into regression, but also design with inference in mind. Combine that with ISLR and you should be set!

u/pulsetsar · 2 pointsr/rstats

This should get you started to port over some existing skills. You can then fill in the rest with one of the suggested references.

R for SAS and SPSS Users (Statistics and Computing) https://www.amazon.com/dp/1461406846/ref=cm_sw_r_cp_awd_2doPwbMVQTRYH