详见:https://www.kaggle.com/c/boston-housing/overview
如图,这边数据格式不统一,无法用统一分隔符划分。需要做特殊处理,详见:https://www.jianshu.com/p/0325bf1dbdf7
Housing Values in Suburbs of Boston
The medv variable is the target variable.
Data description
The Boston data frame has 506 rows and 14 columns.
This data frame contains the following columns:
crim
per capita crime rate by town. // 城镇人均犯罪率
zn
proportion of residential land zoned for lots over 25,000 sq.ft. // 住宅用地超过25000平方英尺的比例
indus
proportion of non-retail business acres per town. // 每个城镇非零售商用地的比例
chas
Charles River dummy variable (= 1 if tract bounds river; 0 otherwise).
nox
nitrogen oxides concentration (parts per 10 million). // 氮氧化物(千万分之一)
rm
average number of rooms per dwelling. // 每套房的平均房间数
age
proportion of owner-occupied units built prior to 1940. // 1940年前自建房比例
dis
weighted mean of distances to five Boston employment centres. // 距离5个波士顿就业中心距离的加权平均值
rad
index of accessibility to radial highways. // 径向公路可达性指数
tax
full-value property-tax rate per $10,000. // 每$10,000 的全值财产税率
ptratio
pupil-teacher ratio by town. // 城镇师生比例
black
1000(Bk - 0.63)^2 where Bk is the proportion of blacks by town. // 1000(Bk-0.63)^ 2,其中Bk是按城镇划分的黑人比例。
lstat
lower status of the population (percent). // 低地位人口比例
medv
median value of owner-occupied homes in $1000s. // 拥有价值$ 1000s住房的中位数