One of the benefits of using R for statistical analysis is the programming language which allows users to define their own functions, which is particularly useful for analysis that needs to be repeated. For example, a monthly output from a database may be provided in a pre-determined format and we might be interested in running the same initial analysis on the data. Read the rest of this entry »
Programming with R – Function Basics
October 20th, 2010Plotting Time Series data using ggplot2
September 30th, 2010There are various ways to plot data that is represented by a time series in R. The ggplot2 package has scales that can handle dates reasonably easily. Read the rest of this entry »
Classification Trees using the rpart function
September 21st, 2010In a previous post on classification trees we considered using the tree package to fit a classification tree to data divided into known classes. In this post we will look at the alternative function rpart that is available within the base R distribution. Read the rest of this entry »
Classification Trees
September 18th, 2010Decision trees are applied to situation where data is divided into groups rather than investigating a numerical response and its relationship to a set of descriptor variables. There are various implementations of classification trees in R and the some commonly used functions are rpart and tree. Read the rest of this entry »
Creating a Presentation with LaTeX Beamer – Frame Transitions
September 4th, 2010Transitions are often used in presentations to break up the presentation and to keep the audience awake, but often the outcome is irritation/distraction with text or other objects flying on or off the screen from different directions. As such they should be used sparingly if at all in a talk. LaTeX beamer has a simple mechanism for including transitions in a presentation Read the rest of this entry »
Charting the performance of cricket all-rounders – IT Botham
August 16th, 2010Cricket is a sport that generates a large volume of performance data and corresponding debate about the relative qualities of various players over their careers and in relation to their contemporaries. The cricinfo website has an extensive database of statistics for professional cricketers that can be searched to access the information in various formats. Read the rest of this entry »
Useful functions for data frames
August 9th, 2010The R software system is primarily command line based so when there are large sets of data it is not easy to browse the data frames. There are various useful functions for working with data frames. Read the rest of this entry »
Creating a Presentation with LaTeX Beamer – Including Images from Graphics Files
August 7th, 2010It will often be more efficient to generate graphics in an external software package and then include these files in a LaTeX beamer presentation. The standard LaTeX approach to including graphics can be utilised to perform this task. Read the rest of this entry »