Attend the Women in ML Symposium on December 7 Register now

radon

Stay organized with collections Save and categorize content based on your preferences.

  • Description:

Radon is a radioactive gas that enters homes through contact points with the ground. It is a carcinogen that is the primary cause of lung cancer in non-smokers. Radon levels vary greatly from household to household. This dataset contains measured radon levels in U.S homes by county and state. The 'activity' label is the measured radon concentration in pCi/L. Important predictors are 'floor' (the floor of the house in which the measurement was taken), 'county' (the U.S. county in which the house is located), and 'Uppm' (a measurement of uranium level of the soil by county).

Split Examples
'train' 12,573
  • Feature structure:
FeaturesDict({
    'activity': float32,
    'features': FeaturesDict({
        'Uppm': float32,
        'adjwt': float32,
        'basement': object,
        'cntyfips': int32,
        'county': object,
        'dupflag': int32,
        'floor': int32,
        'idnum': int32,
        'lat': float32,
        'lon': float32,
        'pcterr': float32,
        'region': int32,
        'rep': int32,
        'room': int32,
        'startdt': int32,
        'starttm': int32,
        'state': object,
        'state2': object,
        'stfips': int32,
        'stopdt': int32,
        'stoptm': int32,
        'stratum': int32,
        'typebldg': int32,
        'wave': int32,
        'windoor': object,
        'zip': int32,
        'zipflag': int32,
    }),
})
  • Feature documentation:
Feature Class Shape Dtype Description
FeaturesDict
activity Tensor float32
features FeaturesDict
features/Uppm Tensor float32
features/adjwt Tensor float32
features/basement Tensor object
features/cntyfips Tensor int32
features/county Tensor object
features/dupflag Tensor int32
features/floor Tensor int32
features/idnum Tensor int32
features/lat Tensor float32
features/lon Tensor float32
features/pcterr Tensor float32
features/region Tensor int32
features/rep Tensor int32
features/room Tensor int32
features/startdt Tensor int32
features/starttm Tensor int32
features/state Tensor object
features/state2 Tensor object
features/stfips Tensor int32
features/stopdt Tensor int32
features/stoptm Tensor int32
features/stratum Tensor int32
features/typebldg Tensor int32
features/wave Tensor int32
features/windoor Tensor object
features/zip Tensor int32
features/zipflag Tensor int32
  • Citation:
@book{GelmanHill:2007,
  author = {Gelman, Andrew and Hill, Jennifer},
  title = {Data Analysis Using Regression and Multilevel/Hierarchical Models},
  publisher = {Cambridge University Press},
  series = {Analytical methods for social research},
  year = 2007
}