ASSIGNMENT # 5

AMAN VIJ

ADEENA MIRZA AMIT KUMAR TRIPATHY

SECTION A

Questions 1.

(a) Explore the data. Report on the distribution of values in the attributes and how they individually relate to the outcome of interest (dependent variable).

All the attributes are combination of categorical and numeric values and cover varied ranges. Each attribute was studied along with analyzing histograms, scatterplots and log-scaled graphs. The findings are listed below with the graphs of each attribute distribution in Section B.

(i) Borrower’s age (Bo_Age):

Borrower’s age shows an overall right skewed distribution with a maximum at 36. The distribution for only defaulters shows a*…show more content…*

(vi) Borrower’s total monthly debt expense (Tot_mthly_debt_exp):

Total monthly debt expense of borrower’s has a range from 0 to 17225 with an average of about 1745. The distribution of the total data as well as the defaulters is suggestively right skewed with most of the defaulters belonging to the class of borrower’s below

5500.

(vii) Borrower’s total monthly income (Tot_mthly_incm):

Borrower’s monthly income has a distribution similar to the monthly debt expense distribution and has a range from 500 to 65000 with an average of 5025. The graph also shows that most of the defaulters are from the low-income group with there being no defaulters with a monthly income of 15000 or higher.

(viii) Appraised value of home at origination (orig_apprd_val_amt):

Appraised value of homes also has a right skewed distribution with fewer records of houses with high appraised value and even fewer defaulters in that region. No defaulter was recorded for a house with an appraised value greater than 400,000.

(ix) Purchase price for house (pur_prc_amt):

Purchase price for houses shows a distribution very similar to appraised values with low frequency of borrowers for houses with high purchase price and no defaulter for a house more expensive than 400,000.

(x) Borrower debt to income ratio (DTI ratio):

Debt to income ratio is calculated by

AMAN VIJ

ADEENA MIRZA AMIT KUMAR TRIPATHY

SECTION A

Questions 1.

(a) Explore the data. Report on the distribution of values in the attributes and how they individually relate to the outcome of interest (dependent variable).

All the attributes are combination of categorical and numeric values and cover varied ranges. Each attribute was studied along with analyzing histograms, scatterplots and log-scaled graphs. The findings are listed below with the graphs of each attribute distribution in Section B.

(i) Borrower’s age (Bo_Age):

Borrower’s age shows an overall right skewed distribution with a maximum at 36. The distribution for only defaulters shows a

(vi) Borrower’s total monthly debt expense (Tot_mthly_debt_exp):

Total monthly debt expense of borrower’s has a range from 0 to 17225 with an average of about 1745. The distribution of the total data as well as the defaulters is suggestively right skewed with most of the defaulters belonging to the class of borrower’s below

5500.

(vii) Borrower’s total monthly income (Tot_mthly_incm):

Borrower’s monthly income has a distribution similar to the monthly debt expense distribution and has a range from 500 to 65000 with an average of 5025. The graph also shows that most of the defaulters are from the low-income group with there being no defaulters with a monthly income of 15000 or higher.

(viii) Appraised value of home at origination (orig_apprd_val_amt):

Appraised value of homes also has a right skewed distribution with fewer records of houses with high appraised value and even fewer defaulters in that region. No defaulter was recorded for a house with an appraised value greater than 400,000.

(ix) Purchase price for house (pur_prc_amt):

Purchase price for houses shows a distribution very similar to appraised values with low frequency of borrowers for houses with high purchase price and no defaulter for a house more expensive than 400,000.

(x) Borrower debt to income ratio (DTI ratio):

Debt to income ratio is calculated by

Related

- Good Essays
## Pt2520 Unit 6 Data Mining Project

- 1667 Words
- 7 Pages

STATS 415 Data Mining Project Insights into the prediction of the default payment through the history of payments, the amount of previous payment and the amount of bill payment

- 1667 Words
- 7 Pages

Good Essays - Good Essays
## Carper's Fundamental Patterns Of Knowing In Nursing

- 1915 Words
- 8 Pages

First, an understanding of each of these attributes is needed in the interest of gaining further insight on how they

- 1915 Words
- 8 Pages

Good Essays - Decent Essays
## Whirlpool Case Analysis

- 11031 Words
- 45 Pages

problem in collection due to an incorrect collection policy or lack of liquidity by buyers. An increase in debt in the year 1997 and 1998 shows the cost of interest due to aggressive investments or integration strategies. The company’s ability to meet its debt obligations drops significantly in the year 1996 due to closing the year at a loss, yet the company managed to control it

- 11031 Words
- 45 Pages

Decent Essays - Decent Essays
## Parice Problem

- 1623 Words
- 7 Pages

Describe briefly what each statistic in parts a. to c. tells you about the data.

- 1623 Words
- 7 Pages

Decent Essays - Satisfactory Essays
## Bivariate Analysis Paper

- 222 Words
- 1 Pages

Familiarise yourself with the data set provided. This will include doing research to help you understand the variables and develop a purpose for the investigation.

- 222 Words
- 1 Pages

Satisfactory Essays - Good Essays
## The Real Estate Market Hit Rock Bottom

- 882 Words
- 4 Pages

the borrowers. The lenders of financial institutions no longer trust approving loans to less than

- 882 Words
- 4 Pages

Good Essays - Good Essays
## The Downfall Of The Subprime Mortgage Market

- 1716 Words
- 7 Pages

Due to the U.S. Governments relaxed lending stipulations and low interest rates, the demand for houses soared from 1996 to 2006, consequently increasing home prices across the country. During this time period home prices rose nearly fifty percent (Li, 2015). Comparatively, from

- 1716 Words
- 7 Pages

Good Essays - Better Essays
## Module 1 Slp . Introduction To Research Methodologies .

- 1219 Words
- 5 Pages

Next you will need to decide on how you will obtain and analyze your research data. This process will be determined by the type of research problem that was define in the beginning of the research development and revolve around your hypothesis or desired results (M., 2004). You will have annotate and share the results of your data collection. This after action report of the obtained data will allow the researcher to determine whether or not the desired results were obtained or will there need to be some follow on research in order to generate expanded results.

- 1219 Words
- 5 Pages

Better Essays - Good Essays
## Statistical Data Report

- 1655 Words
- 7 Pages

The aim of this report is to analyse the assessment results of two primary schools within Bradford-on-Avon, a small town in Wiltshire; Fitzmaurice Community Primary School and Christ Church Voluntary Controlled Primary School. The data below is retrieved from the Department of Education (2014) and compared with local authority and national averages.

- 1655 Words
- 7 Pages

Good Essays - Good Essays
## Statistical Statistics : Descriptive Statistics

- 1603 Words
- 7 Pages

Statistics used to describe, or descriptive statistics, are single numbers that describe a sample of data (Kiess & Green, 2010, p. 7). Descriptive statistics merely describe what the data consists of (Trochim, 2006). Several frequently used descriptive statistics are the mean, median, and mode (Kiess & Green, 2010, p. 7). Descriptive statistics are used when it is necessary to relay large amounts of quantitative data in a simple, understandable way (Trochim, 2006). However, in some cases the use of descriptive statistics may simplify the data too much, leaving out important information (Trochim, 2006).

- 1603 Words
- 7 Pages

Good Essays - Better Essays
## Introduction & Descriptive Statistics

- 1341 Words
- 6 Pages

Also its ‘centre’ can often be meaningfully represented by the Median and the variation can sometimes be measured by the Range or the Inter Quartile Range.

- 1341 Words
- 6 Pages

Better Essays - Better Essays
## Descriptive Statistical Of Descriptive Statistics

- 982 Words
- 4 Pages

By computing number of years of job experience, out of 22 students mean is 6.34 higher than median and mode due to two students have more than 20 years of work experience, median is 3 and mode is 1 which suggest majority students have only one year work experience. The standard deviation of data is 7.56.

- 982 Words
- 4 Pages

Better Essays - Decent Essays
## Capital Asset Pricing And Discounted Price Flow Models Essay

- 1097 Words
- 5 Pages

The cost of debt is equivalent to the interest rate a corporation, and individual, or a household is paying on all of its debt such as loans or bonds. Debt is inclusive of repayment later, just as savings can be used later. It is believed that corporations with higher debt are frequently the riskier conglomerates. The risky behavior of some businesses can sometimes be attractive to potential investors, while causing others to shy away. Household debt unfolds similarly to that of major corporations, but on a smaller scale. While the bulk of credit in a household is extended in the form of mortgages, a lot dwells in the plastic credit cards. Unfortunately, credit is often used to bridge the gap in income or temporary drops and can ultimately cause the debt of a household to increase substantially. Credit cards often enable some households the consumption of possibilities that would not otherwise be around.

- 1097 Words
- 5 Pages

Decent Essays - Decent Essays
## Economic Research Methods

- 1078 Words
- 5 Pages

A frequency analysis and analysis of distribution, depicted with histograms and box plots, portraying the distribution of the different age groups within the world population.

- 1078 Words
- 5 Pages

Decent Essays - Better Essays
## Introduction. Descriptive Statistics Organize And Describe

- 1201 Words
- 5 Pages

Descriptive statistics organize and describe the characteristics of a data set. Researchers are interested in two types of descriptive statistics: central tendency and variability. Measures of central tendency describe the center of a distribution of scores. Common measures of central tendency are the mean, median, and mode. Variability describes the dispersion or spread of scores in a data set. The three commonly reported measures of variability are standard deviation, range, and variance. Both of these types of descriptive statistics are necessary for research methods because they enable the experimenter to see trends and draw conclusions from the data set. Information provided by the combination of measures of central

- 1201 Words
- 5 Pages

Better Essays