Become an expert in R — Interactive courses, Cheat Sheets, certificates and more!
Get Started for Free

Chicago

Chicago ridership data


Description

Chicago ridership data

Details

These data are from Kuhn and Johnson (2020) and contain an abbreviated training set for modeling the number of people (in thousands) who enter the Clark and Lake L station.

The date column corresponds to the current date. The columns with station names (Austin through California) are a sample of the columns used in the original analysis (for file size reasons). These are 14 day lag variables (i.e. date - 14 days). There are columns related to weather and sports team schedules.

The station at 35th and Archer is contained in the column Archer_35th to make it a valid R column name.

Value

Chicago

a tibble

stations

a vector of station names

Source

Kuhn and Johnson (2020), Feature Engineering and Selection, Chapman and Hall/CRC . https://bookdown.org/max/FES/ and https://github.com/topepo/FES

Examples

data(Chicago)
str(Chicago)
stations

modeldata

Data Sets Used Useful for Modeling Packages

v0.1.0
MIT + file LICENSE
Authors
Max Kuhn [aut, cre], RStudio [cph]
Initial release

We don't support your browser anymore

Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.