Para ilustrar este caso se usará una base de datos de apartamentos en el Poblado-Medellín.
En un principio veamos un resumen de la base de datos.
valor | area | |
---|---|---|
Min. :1.300e+08 | Min. : 49.0 | |
1st Qu.:3.350e+08 | 1st Qu.:100.0 | |
Median :4.350e+08 | Median :138.0 | |
Mean :4.855e+08 | Mean :150.5 | |
3rd Qu.:5.500e+08 | 3rd Qu.:190.0 | |
Max. :1.630e+09 | Max. :350.0 |
correlación
valor | area | |
---|---|---|
valor | 1.0000000 | 0.8599923 |
area | 0.8599923 | 1.0000000 |
A continuación se muestran los resultados del ajuste del modelo lineal simple.
##
## Call:
## lm(formula = valor ~ area, data = regresion)
##
## Residuals:
## Min 1Q Median 3Q Max
## -397309920 -69525715 11622185 63152268 618679323
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) -26807060 33644921 -0.797 0.427
## area 3403698 202988 16.768 <2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 141600000 on 99 degrees of freedom
## Multiple R-squared: 0.7396, Adjusted R-squared: 0.737
## F-statistic: 281.2 on 1 and 99 DF, p-value: < 2.2e-16
Ahora se verán los resultados obtenidos para valores predichos
area | estimaciones |
---|---|
1 | -23403362.8 |
8 | 422519.7 |
15 | 24248402.2 |
22 | 48074284.7 |
29 | 71900167.2 |
36 | 95726049.7 |
43 | 119551932.2 |
50 | 143377814.7 |
57 | 167203697.2 |
64 | 191029579.7 |
71 | 214855462.2 |
78 | 238681344.7 |
85 | 262507227.2 |
92 | 286333109.7 |
99 | 310158992.2 |
106 | 333984874.7 |
113 | 357810757.2 |
120 | 381636639.7 |
127 | 405462522.2 |
134 | 429288404.7 |
141 | 453114287.2 |
148 | 476940169.7 |
155 | 500766052.2 |
162 | 524591934.7 |
169 | 548417817.2 |
176 | 572243699.7 |
183 | 596069582.2 |
190 | 619895464.7 |
197 | 643721347.2 |
204 | 667547229.7 |
211 | 691373112.2 |
218 | 715198994.6 |
225 | 739024877.1 |
232 | 762850759.6 |
239 | 786676642.1 |
246 | 810502524.6 |
253 | 834328407.1 |
260 | 858154289.6 |
267 | 881980172.1 |
274 | 905806054.6 |
281 | 929631937.1 |
288 | 953457819.6 |
295 | 977283702.1 |
302 | 1001109584.6 |
309 | 1024935467.1 |
316 | 1048761349.6 |
323 | 1072587232.1 |
330 | 1096413114.6 |
337 | 1120238997.1 |
344 | 1144064879.6 |
Los int | ervalos de confianza para este modelo se muestran a continuación |
2.5 % | 97.5 % | |
---|---|---|
(Intercept) | -93565883 | 39951762 |
area | 3000925 | 3806470 |
### ANOVA
Df | Sum Sq | Mean Sq | F value | Pr(>F) | |
---|---|---|---|---|---|
area | 1 | 5.633806e+18 | 5.633806e+18 | 281.1649 | 0 |
Residuals | 99 | 1.983700e+18 | 2.003737e+16 | NA | NA |