Chicago ridership data
Chicago ridership data
These data are from Kuhn and Johnson (2020) and contain an abbreviated training set for modeling the number of people (in thousands) who enter the Clark and Lake L station.
The date
column corresponds to the current date. The columns with station
names (Austin
through California
) are a sample of the columns used in
the original analysis (for file size reasons). These are 14 day lag
variables (i.e. date - 14 days
). There are columns related to weather and
sports team schedules.
The station at 35th and Archer is contained in the column Archer_35th
to
make it a valid R column name.
Chicago |
a tibble |
stations |
a vector of station names |
Kuhn and Johnson (2020), Feature Engineering and Selection, Chapman and Hall/CRC . https://bookdown.org/max/FES/ and https://github.com/topepo/FES
data(Chicago) str(Chicago) stations
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.