Season 2 of the R-Podcast is up and running! This episode begins a multi-part series on reproducible analysis using R. In this episode I discuss the usage of Sweave and LaTeX for producing reproducible reports, an introduction to the capabilities of the knitr package (more episodes will be coming dedicated to this package), and my motivation for adapting reproducible analysis techniques and tools into my workflow. In our listener feedback segment I discuss a new means of providing feedback to the R-Podcast using our new sub-reddit page and introduce new segments highlighting interesting stories around the R community and useful packages. This promises to be an exciting season of the R-Podcast, and I hope you enjoy this episode!
I’m happy to present episode 10 of the R-Podcast! Season 1 of the R-Podcast concludes with part 2 of my series on data munging, in which I discuss issues surrounding importing data sets contained in HTML tables. I share how I used the XML and RCurl packages to validate and import data from hockey-reference.com for storage into a MySQL database. Our listener feedback segment contains another installment on the Pitfalls of R contributed by listener Frans. I want to thank everyone who has provided such positive feedback throughout the season, and I’m looking forward to providing some exciting new content for season 2. I hope you enjoy the episode and check out our new contact page if you would like to provide any feedback. Thanks for listening!
It’s great to be back with a new episode after an eventful break! This episode begins a series on my adventures in data munging, a.k.a data processing. I discuss three issues that demonstrate the flexibility and versatility R brings for recoding messy values, important inconsistent data files, and pinpointing problematic observations and variables. We also have an extended listener feedback segment with an audio installment of the “pitfalls” of R contributed by listener Frans. I hope you enjoy this episode and keep passing along your feedback to theRcast(at)gmail.com and stop by the forums as well!
I’m happy to present this jam-packed episode of the R-Podcast dedicated to using the ggplot2 package for visualization. This episode will have a companion screencast released in the next few days. I use data from the Hockey Summary Project to demonstrate how to create a series of boxplots of NHL regular season attendance for each team. The R code used in this episode will be available via GitHub. I also extend my thanks to the Going Linux podcast for plugging the R-Podcast. If you are interested in providing a listener tip about R, please call the voicemail hotline at +1-269-849-9780 or record an audio clip and send it tothe Rcast(at)gmail.com. Please keep the wonderful feedback coming and hope you enjoy this episode!
Hello everybody, I am finally back with a new episode! In this episode: Hardware issues, major update to RStudio, new forums, and discussion on managing your workflow for projects. I discuss useful functions for executing R scripts and saving/loading R objects for future sessions, and summarize different solutions for organizing R code based on task and via the ProjectTemplate package, along with the importance of version control. Please check out the new forums and let me know what you think! If you are interested in providing a listener tip about R, please call the voicemail hotline at +1-269-849-9780 or record a short mp3 or ogg audio clip and send it to theRcast(at)gmail.com . As always I welcome any other feedback you have. Thanks for listening! P.S. From our Google Plus page, Darren pointed out that I switched forward slashes with backward slashes in my discussion about file paths in Episode 6. Thanks Darren!