communities_crime <- fetch_ucirepo(id = 211)
communities_crime_data <- communities_crime$data$original
dim(communities_crime_data)
head(communities_crime_data)Communities and Crime Unnormalized
Webpage: https://archive.ics.uci.edu/dataset/211/communities+and+crime+unnormalized
- Number of features: 125
- Number of targets: 18 possible crime outcomes are described on the UCI page
- Number of observations: 2,215
- Response variable: a clear regression target is
ViolentCrimesPerPop - Topic: socio-economic, law-enforcement, and crime measurements for U.S. communities
- Missing-data handling: inspect and handle the many missing values, especially in the law-enforcement variables
- Parsing / encoding: encode categorical identifiers such as
Stateif they are retained - Other preparation: choose one crime variable as the target and remove overlapping crime subtotal variables that would leak information into that target