Assignment 11
.pdf
keyboard_arrow_up
School
California State University, Chico *
*We aren’t endorsed by this school
Course
105
Subject
Economics
Date
Jan 9, 2024
Type
Pages
7
Uploaded by ProfessorUniverseTarsier7
Example Assignment Eleven: Using multiple regression to analyze the gender pay gap
Part I: Use APA style and formatting for all assignments, references, and citations. Yes, have a cover
page, too, as well as a running head. Try Purdue Owl for an example APA style paper:
https://owl.english.purdue.edu/owl/resource/560/18/
For this final analysis you get to bring together many of the variables we have been using this term to
better understand difference in income. In particular we want to explain the gender pay gap between
women and men. For Ordinary Least Squares (OLS) regression analyses, which we are using for this
assignment, you want to have at least one interval/ratio independent variable and an interval/ratio
dependent variable. Your dependent variable is pincp. Your independent variables will be sex, agep, and
schl. But, there are other variables that might explain variation in income. For this analysis we will add
race to our independent variable list. However, as your book tells you, for the nominal variables we
need to do a little recoding into “dummy” variables so
we can use OLS regression more effectively. We
will recode sex into a dummy variable called “Male.” And, we will recode rac1p into a dummy variable
called “White.”
Also, know that there are more tests that need to be done to come to firmer conclusions from an OLS
analysis. For example, two independent variables might also have a strong association where one
predicts the other to a large degree. Might this be the case for sex and schl? When this happens it is
known as multicollinearity or just collinearity and it can impact OLS regression results. There are ways to
test for it and correct the problem, but we are not going to do that in this course. Just know that there is
more to OLS regression than what you practice here. You are practicing running and interpreting the
analysis.
1.
What is the measure (nominal, ordinal, or interval/ratio) of each of your independent variables
and your dependent variable?
Dependent variable, pincp: I/R
Independent variable, rac1p: Nominal
Independent variable, sex: Nominal
Independent variable, agep: I/R
Independent variable, schl: I/R I answer for you because I want you to treat this as an I/R
variable for years of schooling even though it is not exactly year for year the years of schooling.
You can check the data dictionary for schl to see how the answers are coded. They are coded
from 1 to 16 where each number means progressively more education.
2.
Using your 2014-2018 ACS data file, recode your nominal independent variables as instructed in
the text under 17.2 Recoding to Create Dummy Variables and from past assignments to
transform each nominal variable, sex and rac1p, into a new variable.
3.
For sex code Male=1 and Female = 0 in a new variable Male. Male is already coded 1, but you
need to make 1 = 1 in the new variable anyway. Female is coded as 2, so you have to change the
2 to a 0. The new variable, male, should be numeric when you are done. Here is a screen shot to
help you:
4.
Next assign labels to the values for your new variable, male. So, 1=Male and 0=Female. We have
done assigned labels before. See screen shot below to help guide you.
5.
Save your file with the new variable, male.
6.
For rac1p code white=1 and nonwhite = 0 in a new variable white. Recoding rac1p is a little
more complicated to recode thnt sex was because it has many values-white, black, native, etc. If
you want to see the coding for rac1p, it is in the data dictionary starting on page 101. White is
already coded 1, but you need to make 1=1 in the new variable. All other race categories are
coded 2-9, so you have to change them all together to =0. Race categories are much more
complicated than white/nonwhite. We are coding them this way for ease of practice. The new
variable, white, should be numeric when you are done. Here is a screen shot to help you:
7.
Next assign labels to the values for your new variable, white. So, 1=White and 0=Nonwhite. We
have done assigned labels before. See screen shot below to help guide you.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Related Questions
What are the various functional forms of Regression Model?
arrow_forward
What are the most important remaining threats to the internal validity of this regression analysis?
arrow_forward
Give typing answer with explanation and conclusion
if education is heavily affected by IQ and IQ is closely correlated with both wage and education level, how would a regression of wage on education be biased? What regression violation is this? (For econometrics)
arrow_forward
Please answer fast
arrow_forward
What are the consequences in the regression results if multicollinearity is present in the regression model?
arrow_forward
How to include dummy variables in a regression? Give an example
arrow_forward
Explain the concept of model selection criteria such as Akaike Information Criterion (AIC) and
Bayesian Information Criterion (BIC) in the context of linear regression. How can these
criteria be used to compare and select among competing regression models?
arrow_forward
What are the four assumptions of linear regression (simple linear and multiple)?
arrow_forward
Explain how you will use Regression analysis in any research with a suitable example.
arrow_forward
Please provide me with the correct answer, along with the calculations, and do not use any AI tools
arrow_forward
(2)What would the consequence be for a regression model if theerrors were not homoscedastic?
arrow_forward
This is an econometrics question. Please answer in 4 decimal places
arrow_forward
Consider the following estimated regression model relating annual salary to years of education and work experience. Estimated Salary=11,681.31+3418.97(Education)+1194.78(Experience) Suppose two employees at the company have been working there for five years. One has a bachelor's degree (8 years of education) and one has a master's degree (10 years of education). How much more money would we expect the employee with a master's degree to make?
arrow_forward
In a regression problem with 1 output variable and with a total number of 100 possible input variables, what is the number of all possible models with three input variables?
arrow_forward
What is difference between regression model, and estimated regression equation?
arrow_forward
Why researchers use so many theoretical model to do their research, such as regression model, empirical model etc??
arrow_forward
GSU is trying to predict how price of books predict the quantity of books sold over the semesters. Perform a liner regression analysis, and and find the best model to predict quantity of books to stock in the book store. Make recommendation at setting the prices and quantity at their optimum values for maximizing quantity of books sold. Be prepared to discuss your analysis.
Quantity Price
180 475
590 400
430 450
250 550
275 575
720 375
660 375
490 450
700 400
210 500
arrow_forward
What is a linear regression model? What is measured by the coefficients ofa linear regression model? What is the ordinary least squares estimator?
arrow_forward
List the 5 assumptions of the Classical Linear Regression Model and explain at least three of them
arrow_forward
Consider the following estimated regression model relating annual salary to years of education and work experience.
Estimated Salary=11,722.40+3182.56(Education)+1202.44(Experience)Estimated Salary=11,722.40+3182.56(Education)+1202.44(Experience)
Suppose an employee with 66 years of education has been with the company for 33 years (note that education years are the number of years after 8th8th grade). According to this model, what is his estimated annual salary?
arrow_forward
Kristin Forbes in her American Economic Review (2000) article investigates the relationship
between economic growth and inequality. She uses five yearly data for 45 countries for the time
period 1965-1995. In the table below are results of her using four types of panel regression
estimation techniques for the same model, where she estimates the relationship between
economic growth and inequality (measured by the Gini coefficient)
Estimation
method
Inequality
Income
Male Education
Female Education
PPP
R²
Countries.
Observations
Period
Fixed effects
(1)
0.0036
(0.0015)
-0.076
(0.020)
-0.014
(0.031)
0.070
(0.032)
-0.0008
(0.0003)
0.67
45
180
1965-1995*
Five-year periods
Random effects
(2)
0.0013
(0.0006)
0.017
(0.006)
0.047
(0.015)
-0.038
(0.016)
-0.0009
(0.0002)
0.49
45
180
1965-1995
Chamberlain's
77-matrix
(3)
0.0016
(0.0002)
-0.027
(0.004)
0.018
(0.010)
0.054
(0.006)
-0.0013
(0.0000)
45
135
1970-1995
Arellano and
Bond
(4)
0.0013
(0.0006)
-0.047
(0.008)
-0.008
(0.022)
0.074
(0.018)…
arrow_forward
Explain carefully why running the regression above might suffer from endogeneity concerns: are their any unobservable variables that might confound the results? Should we be worried about reverse causality? What empirical methods could we use to address these concerns?
arrow_forward
What is the Role of Control Variables in Multiple Regression?
arrow_forward
Economics
you learned four steps that should be used to evaluate a regression model. What is the first step and why is it so important?
arrow_forward
Need more explanation by considerinng the data given in snapshots
arrow_forward
Consider the following data regarding students' college GPAs and high school GPAs. The estimated regression equation is
Estimated College GPA = 3.38 + ( − 0.0272) (High School GPA).
GPAs
College GPA High School GPA
2.09
3.60
3.73
3.99
3.56
3.88
2.83
4.90
3.51
4.15
3.87
3.83
Copy Data
Step 2 of 3: Compute the mean square error (s2) for the model. Round your answer to four decimal places.
arrow_forward
Define Interpretation of coefficients in polynomial regression models?
arrow_forward
Question:
1. Please chose a topic you are interested in within social policy-education, health care, mental health etc. and write out a hypothetical regression estimation for this topic. Make sure to identify the ideal outcome variable, explanatory variable, and control variables.
arrow_forward
Consider the following data regarding students' college GPAs and high school GPAs. The estimated regression equation is
Estimated College GPA=1.85+0.4743(High School GPA).Estimated College GPA=1.85+0.4743(High School GPA). GPAs
College GPA
High School GPA
3.843.84
2.562.56
3.573.57
3.903.90
2.072.07
3.143.14
4.004.00
3.223.22
3.873.87
2.882.88
2.212.21
2.082.08
Copy Data
Step 1 of 3 :
Compute the sum of squared errors (SSE) for the model. Round your answer to four decimal places.
arrow_forward
SEE MORE QUESTIONS
Recommended textbooks for you
Managerial Economics: Applications, Strategies an...
Economics
ISBN:9781305506381
Author:James R. McGuigan, R. Charles Moyer, Frederick H.deB. Harris
Publisher:Cengage Learning
Related Questions
- What are the various functional forms of Regression Model?arrow_forwardWhat are the most important remaining threats to the internal validity of this regression analysis?arrow_forwardGive typing answer with explanation and conclusion if education is heavily affected by IQ and IQ is closely correlated with both wage and education level, how would a regression of wage on education be biased? What regression violation is this? (For econometrics)arrow_forward
- Explain the concept of model selection criteria such as Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) in the context of linear regression. How can these criteria be used to compare and select among competing regression models?arrow_forwardWhat are the four assumptions of linear regression (simple linear and multiple)?arrow_forwardExplain how you will use Regression analysis in any research with a suitable example.arrow_forward
arrow_back_ios
SEE MORE QUESTIONS
arrow_forward_ios
Recommended textbooks for you
- Managerial Economics: Applications, Strategies an...EconomicsISBN:9781305506381Author:James R. McGuigan, R. Charles Moyer, Frederick H.deB. HarrisPublisher:Cengage Learning
Managerial Economics: Applications, Strategies an...
Economics
ISBN:9781305506381
Author:James R. McGuigan, R. Charles Moyer, Frederick H.deB. Harris
Publisher:Cengage Learning