MAT 240 Module Three Assignment Jackie
.docx
keyboard_arrow_up
School
Southern New Hampshire University *
*We aren’t endorsed by this school
Course
240
Subject
Mathematics
Date
Jan 9, 2024
Type
docx
Pages
8
Uploaded by MajorOxideEagle26
Housing Price Prediction Model for D.M. Pan Real Estate Company
Jackie Sechrist
September 13,2023
MAT-240 Applied Statistics Southern New Hampshire University
Instructor Wasim Barham
Module Two Notes
The listing price is the Y axis. Square foot area is the X axis. The shape is linear because the trend line is positive, even though the plot is kind of clustered up in one spot, the association between x
and y
in the scatterplot is a positive one, you can see from data point to data point that the selling price is higher as the square footage increases. The correlation is positive, not perfectly linear but the association between the two is a positive one.
1,000 2,000 3,000 4,000 5,000 6,000 7,000 - 100,000 200,000 300,000 400,000 500,000 600,000 700,000 800,000 f(x) = 112.77 x + 38972.08
Listing Price and Square foot area Square foot area
Listing Price
Regression Equation
Regression Equation Y= 112.77x + 38972
Y^=38972.08+112.77x
Determine r
0.94
According to the scatterplot and my random sample, the “r” also known as the correlation coefficient is a 0.94 which is a positive, strong correlation, shows a close association between the two variables. This correlation is strong because it is close to 1. I determined the direction of the association directly from the best fit line, it was going up, which means that’s a positive correlation, and with the “r” close to one, it determines a variation that is considered good. For the value of R anything less than 0.40 is considered weak, 0.40<0.80 is moderate and 0.80 and greater is considered strong strength in correlation. There are a few outliners, but again we are going to dismiss them with the fact that when taking on a sample in a size of only 30 and that the average for listing price is substantially lower than the
outlines presented, chances are these are just showing up because of sample size or the lake off data for homes in that range. Examine the Slope and Intercepts
Slope- 112.76
Intercept- 38972.08- It represents the point where this line crosses the y-axis.
You would expect listing price to go up, based on square root area when looking at the slope and the intercepts. Based on the regression equation my slope is 112.76 and the y intercept is 38,972 if we were to change x to 0, then the square foot area is 0, then you would be subtracted the answer, which if X is 0 then its), then the sum is now -38,972 which makes zero cents.. One reason is because houses don’t go on the market and sell if the square foot was zero, and this conclusion means that by chance it was sold as only the land it would be a negative number. This doesn’t make sense to be the value because the land doesn’t have a value when negative or land only. R
-squared Coefficient
0.88
The R-squared coefficient- also known as the coefficient of determination. This is the proportion of the listing price that is explained by the square foot area. In the context of this analysis R-squared is the proportion of Y that is explained by X, so in this case, “how much does the listing price vary based on
the square foot average of the house.” In other words, r-squared shows how well the data fit the regression model. Conclusions
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help