Skip to contents

Regression task to predict house sale prices for Ames, Iowa.

Contains 80 features and 2930 observations. Target column is "Sale_Price".

Examples

data("ames_housing", package = "mlr3data")
str(ames_housing)
#> Classes ‘data.table’ and 'data.frame':	2930 obs. of  82 variables:
#>  $ Sale_Price        : int  215000 105000 172000 244000 189900 195500 213500 191500 236500 189000 ...
#>  $ Alley             : Factor w/ 2 levels "Gravel","Paved": NA NA NA NA NA NA NA NA NA NA ...
#>  $ Bedroom_AbvGr     : int  3 2 3 3 3 3 2 2 2 3 ...
#>  $ Bldg_Type         : Factor w/ 5 levels "Duplex","OneFam",..: 2 2 2 2 2 2 4 4 4 2 ...
#>  $ Bsmt_Cond         : Factor w/ 5 levels "Excellent","Fair",..: 3 5 5 5 5 5 5 5 5 5 ...
#>  $ Bsmt_Exposure     : Factor w/ 4 levels "Av","Gd","Mn",..: 2 4 4 4 4 4 3 4 4 4 ...
#>  $ Bsmt_Full_Bath    : int  1 0 0 1 0 0 1 0 1 0 ...
#>  $ Bsmt_Half_Bath    : int  0 0 0 0 0 0 0 0 0 0 ...
#>  $ Bsmt_Qual         : Factor w/ 5 levels "Excellent","Fair",..: 5 5 5 5 3 5 3 3 3 5 ...
#>  $ Bsmt_Unf_SF       : int  441 270 406 1045 137 324 722 1017 415 994 ...
#>  $ BsmtFin_SF_1      : int  639 468 923 1065 791 602 616 263 1180 0 ...
#>  $ BsmtFin_SF_2      : int  0 144 0 0 0 0 0 0 0 0 ...
#>  $ BsmtFin_Type_1    : Factor w/ 6 levels "ALQ","BLQ","GLQ",..: 2 5 1 1 3 3 3 1 3 6 ...
#>  $ BsmtFin_Type_2    : Factor w/ 6 levels "ALQ","BLQ","GLQ",..: 6 4 6 6 6 6 6 6 6 6 ...
#>  $ Central_Air       : Factor w/ 2 levels "N","Y": 2 2 2 2 2 2 2 2 2 2 ...
#>  $ Condition_1       : Factor w/ 9 levels "Artery","Feedr",..: 3 2 3 3 3 3 3 3 3 3 ...
#>  $ Condition_2       : Factor w/ 8 levels "Artery","Feedr",..: 3 3 3 3 3 3 3 3 3 3 ...
#>  $ Condition_3       : Factor w/ 8 levels "Artery","Feedr",..: 3 3 3 3 3 3 3 3 3 3 ...
#>  $ Electrical        : Factor w/ 5 levels "FuseA","FuseF",..: 5 5 5 5 5 5 5 5 5 5 ...
#>  $ Enclosed_Porch    : int  0 0 0 0 0 0 170 0 0 0 ...
#>  $ Exter_Cond        : Factor w/ 5 levels "Excellent","Fair",..: 5 5 5 5 5 5 5 5 5 5 ...
#>  $ Exter_Qual        : Factor w/ 4 levels "Excellent","Fair",..: 4 4 4 3 4 4 3 3 3 4 ...
#>  $ Exterior_1st      : Factor w/ 16 levels "AsbShng","AsphShn",..: 4 14 15 4 14 14 6 7 6 14 ...
#>  $ Exterior_2nd      : Factor w/ 17 levels "AsbShng","AsphShn",..: 11 15 16 4 15 15 6 7 6 15 ...
#>  $ Fence             : Factor w/ 4 levels "Good_Privacy",..: NA 3 NA NA 3 NA NA NA NA NA ...
#>  $ Fireplace_Qu      : Factor w/ 5 levels "Excellent","Fair",..: 3 NA NA 5 5 3 NA NA 5 5 ...
#>  $ Fireplaces        : int  2 0 0 2 1 1 0 0 1 1 ...
#>  $ First_Flr_SF      : int  1656 896 1329 2110 928 926 1338 1280 1616 1028 ...
#>  $ Foundation        : Factor w/ 6 levels "BrkTil","CBlock",..: 2 2 2 2 3 3 3 3 3 3 ...
#>  $ Full_Bath         : int  1 1 1 2 2 2 2 2 2 2 ...
#>  $ Functional        : Factor w/ 8 levels "Maj1","Maj2",..: 8 8 8 8 8 8 8 8 8 8 ...
#>  $ Garage_Area       : int  528 730 312 522 482 470 582 506 608 442 ...
#>  $ Garage_Cars       : int  2 1 1 2 2 2 2 2 2 2 ...
#>  $ Garage_Cond       : Factor w/ 5 levels "Excellent","Fair",..: 5 5 5 5 5 5 5 5 5 5 ...
#>  $ Garage_Finish     : Factor w/ 3 levels "Fin","RFn","Unf": 1 3 3 1 1 1 1 2 2 1 ...
#>  $ Garage_Qual       : Factor w/ 5 levels "Excellent","Fair",..: 5 5 5 5 5 5 5 5 5 5 ...
#>  $ Garage_Type       : Factor w/ 6 levels "Attchd","Basment",..: 1 1 1 1 1 1 1 1 1 1 ...
#>  $ Garage_Yr_Blt     : int  1960 1961 1958 1968 1997 1998 2001 1992 1995 1999 ...
#>  $ Gr_Liv_Area       : int  1656 896 1329 2110 1629 1604 1338 1280 1616 1804 ...
#>  $ Half_Bath         : int  0 0 1 1 1 1 0 0 0 1 ...
#>  $ Heating           : Factor w/ 6 levels "Floor","GasA",..: 2 2 2 2 2 2 2 2 2 2 ...
#>  $ Heating_QC        : Factor w/ 5 levels "Excellent","Fair",..: 2 5 5 1 3 1 1 1 1 3 ...
#>  $ House_Style       : Factor w/ 8 levels "One_Story","One_and_Half_Fin",..: 1 1 1 1 6 6 1 1 1 6 ...
#>  $ Kitchen_AbvGr     : int  1 1 1 1 1 1 1 1 1 1 ...
#>  $ Land_Contour      : Factor w/ 4 levels "Bnk","HLS","Low",..: 4 4 4 4 4 4 4 2 4 4 ...
#>  $ Land_Slope        : Factor w/ 3 levels "Gtl","Mod","Sev": 1 1 1 1 1 1 1 1 1 1 ...
#>  $ Lot_Area          : int  31770 11622 14267 11160 13830 9978 4920 5005 5389 7500 ...
#>  $ Lot_Area_m2       : num  2952 1080 1325 1037 1285 ...
#>  $ Lot_Config        : Factor w/ 5 levels "Corner","CulDSac",..: 1 5 1 1 5 5 5 5 5 5 ...
#>  $ Lot_Frontage      : int  141 80 81 93 74 78 41 43 39 60 ...
#>  $ Lot_Shape         : Factor w/ 4 levels "Irregular","Moderately_Irregular",..: 4 3 4 3 4 4 3 4 4 3 ...
#>  $ Low_Qual_Fin_SF   : int  0 0 0 0 0 0 0 0 0 0 ...
#>  $ Mas_Vnr_Area      : int  112 0 108 0 0 20 0 0 0 0 ...
#>  $ Mas_Vnr_Type      : Factor w/ 5 levels "BrkCmn","BrkFace",..: 5 4 2 4 4 2 4 4 4 4 ...
#>  $ Misc_Feature      : Factor w/ 5 levels "Elev","Gar2",..: NA NA 2 NA NA NA NA NA NA NA ...
#>  $ Misc_Feature_2    : Factor w/ 1 level "Othr": 1 1 1 1 1 1 1 1 1 1 ...
#>  $ Misc_Val          : int  0 0 12500 0 0 0 0 0 0 0 ...
#>  $ Mo_Sold           : int  5 6 6 4 3 6 4 1 3 6 ...
#>  $ MS_SubClass       : Factor w/ 16 levels "Duplex_All_Styles_and_Ages",..: 3 3 3 3 14 14 4 4 4 14 ...
#>  $ MS_Zoning         : Factor w/ 7 levels "A_agr","C_all",..: 6 5 6 6 6 6 6 6 6 6 ...
#>  $ Neighborhood      : Factor w/ 28 levels "Bloomington_Heights",..: 16 16 16 16 9 9 26 26 26 9 ...
#>  $ Open_Porch_SF     : int  62 0 36 0 34 36 0 82 152 60 ...
#>  $ Overall_Cond      : Factor w/ 9 levels "Above_Average",..: 2 1 1 2 2 1 2 2 2 2 ...
#>  $ Overall_Qual      : Factor w/ 10 levels "Above_Average",..: 1 2 1 6 2 1 9 9 9 6 ...
#>  $ Paved_Drive       : Factor w/ 3 levels "Dirt_Gravel",..: 2 3 3 3 3 3 3 3 3 3 ...
#>  $ Pool_Area         : int  0 0 0 0 0 0 0 0 0 0 ...
#>  $ Pool_QC           : Factor w/ 4 levels "Excellent","Fair",..: NA NA NA NA NA NA NA NA NA NA ...
#>  $ Roof_Matl         : Factor w/ 8 levels "ClyTile","CompShg",..: 2 2 2 2 2 2 2 2 2 2 ...
#>  $ Roof_Style        : Factor w/ 6 levels "Flat","Gable",..: 4 2 4 4 2 2 2 2 2 2 ...
#>  $ Sale_Condition    : Factor w/ 6 levels "Abnorml","AdjLand",..: 5 5 5 5 5 5 5 5 5 5 ...
#>  $ Sale_Type         : Factor w/ 10 levels "COD","CWD","Con",..: 10 10 10 10 10 10 10 10 10 10 ...
#>  $ Screen_Porch      : int  0 120 0 0 0 0 0 144 0 0 ...
#>  $ Second_Flr_SF     : int  0 0 0 0 701 678 0 0 0 776 ...
#>  $ Street            : Factor w/ 2 levels "Grvl","Pave": 2 2 2 2 2 2 2 2 2 2 ...
#>  $ Three_season_porch: int  0 0 0 0 0 0 0 0 0 0 ...
#>  $ Total_Bsmt_SF     : int  1080 882 1329 2110 928 926 1338 1280 1595 994 ...
#>  $ TotRms_AbvGrd     : int  7 5 6 8 6 7 6 5 5 7 ...
#>  $ Utilities         : Factor w/ 3 levels "AllPub","NoSeWa",..: 1 1 1 1 1 1 1 1 1 1 ...
#>  $ Wood_Deck_SF      : int  210 140 393 0 212 360 0 0 237 140 ...
#>  $ Year_Built        : int  1960 1961 1958 1968 1997 1998 2001 1992 1995 1999 ...
#>  $ Year_Remod_Add    : int  1960 1961 1958 1968 1998 1998 2001 1992 1996 1999 ...
#>  $ Year_Sold         : int  2010 2010 2010 2010 2010 2010 2010 2010 2010 2010 ...
#>  - attr(*, ".internal.selfref")=<externalptr>