Become an expert in R — Interactive courses, Cheat Sheets, certificates and more!
Get Started for Free

boston

Boston Housing Data


Description

Data on median housing values from 506 census tracts in the suburbs of Boston from the 1970 census. This data frame is a corrected version of the original data by Harrison and Rubinfeld (1978) with additional spatial information. The data were taken directly from BostonHousing2 and unneeded columns (i.e., name of town, census tract, and the uncorrected median home value) were removed.

Usage

data(boston)

Format

A data frame with 506 rows and 16 variables.

  • lon Longitude of census tract.

  • lat Latitude of census tract.

  • cmedv Corrected median value of owner-occupied homes in USD 1000's

  • crim Per capita crime rate by town.

  • zn Proportion of residential land zoned for lots over 25,000 sq.ft.

  • indus Proportion of non-retail business acres per town.

  • chas Charles River dummy variable (= 1 if tract bounds river; 0 otherwise).

  • nox Nitric oxides concentration (parts per 10 million).

  • rm Average number of rooms per dwelling.

  • age Proportion of owner-occupied units built prior to 1940.

  • dis Weighted distances to five Boston employment centers.

  • rad Index of accessibility to radial highways.

  • tax Full-value property-tax rate per USD 10,000.

  • ptratio Pupil-teacher ratio by town.

  • b $1000(B - 0.63)^2$ where B is the proportion of blacks by town.

  • lstat Percentage of lower status of the population.

References

Harrison, D. and Rubinfeld, D.L. (1978). Hedonic prices and the demand for clean air. Journal of Environmental Economics and Management, 5, 81-102.

Gilley, O.W., and R. Kelley Pace (1996). On the Harrison and Rubinfeld Data. Journal of Environmental Economics and Management, 31, 403-405.

Newman, D.J. & Hettich, S. & Blake, C.L. & Merz, C.J. (1998). UCI Repository of machine learning databases [http://www.ics.uci.edu/~mlearn/MLRepository.html] Irvine, CA: University of California, Department of Information and Computer Science.

Pace, R. Kelley, and O.W. Gilley (1997). Using the Spatial Configuration of the Data to Improve Estimation. Journal of the Real Estate Finance and Economics, 14, 333-340.

Friedrich Leisch & Evgenia Dimitriadou (2010). mlbench: Machine Learning Benchmark Problems. R package version 2.1-1.

Examples

head(boston)

pdp

Partial Dependence Plots

v0.7.0
GPL (>= 2)
Authors
Brandon Greenwell [aut, cre] (<https://orcid.org/0000-0002-8120-0084>)
Initial release

We don't support your browser anymore

Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.