Problem Set #3
.pdf
keyboard_arrow_up
School
Texas Tech University *
*We aren’t endorsed by this school
Course
5347
Subject
Computer Science
Date
Dec 6, 2023
Type
Pages
8
Uploaded by ConstableYak3694
Problem Set #3
Fallon Sheffield
Q3. Analyze the Nike data given in Internet and Computer Exercises 1 of
Chapter 15
. The data
file and the description of key variables can be downloaded from the Web site for this book. Do
the three usage groups differ in terms of awareness, attitude, preference, intention, and loyalty
toward Nike when these variables are considered simultaneously?
The user group in Nike’s test pool are divided up into three categorical groups: light users,
medium users, and heavy users. The eigenvalue of function one has a higher value, meaning that
function 1 is performing much better than function 2 and doing a better job of discriminating the
variables. In addition to the eigenvalue, the second function only explains 3.3% of variance
while function 1 explains 96.7% of variance. Under the “Wilks’ Lambda” table, function 1
shows as significant, with a p-value of less than 0.001, meaning that this is a reliable analysis and
a Wilks’ Lambda of 0.135. Therefore, the function we should use for this analysis would be
function 1.
Looking at the “Standardized Canonical discriminant Function Coefficients” table, both attitude
(0.588) and awareness (0.547) stick out as significant coefficients in our discriminant analysis.
Both of these are relatively high and positive, suggesting that attitude and awareness both have
strong positive impacts on discriminating between user groups. This holds true in the structure
matrix as well, as awareness is represented by 0.708 and attitude 0.672. In addition to this, the
canonical discriminant function coefficients table also shows attitude and awareness as
significant variables. Because of this, we can assume that intention, loyalty, and preference do
not differ greatly between the three groups.
When looking at the classification Results table, we see that 87.5% of “original grouped cases
correctly satisfied” and “80% of cross-validated grouped cases correctly classified.” This would
show us that the analysis did a pretty good job of classifying different variables into the correct
groups.
Q4. Analyze the outdoor lifestyle data given in Internet and Computer Exercises 2 of
Chapter 15
.
The data file and the description of key variables can be downloaded from the Web site for this
book. Do the three groups based on location of residence differ on the importance attached to
enjoying nature, relating to the weather, living in harmony with the environment, exercising
regularly, and meeting other people (
V
2
to
V
6
) when these variables are considered
simultaneously?
The groups in this analysis are divided up into three categories: mid/downtown, suburbs, and
countryside, all based off of residence for a poll on outdoor lifestyle. For this analysis we will
use function 1, as it is more reliable. This can be known because of the higher eigenvalue shown
in the “eigenvalues” chart. Function 1 has an eigenvalue of 2.257 versus function 2’s 0.174. We
can also see that there is a much higher percentage of variance of about 93% in function 1,
compared to 7% in function 2. In addition to this, the Wilks’ Lambda for Function 1 is valued at
0.262 and has a reliable p-value of less than 0.001. Function is the better and more reliable
function to use in this data set.
When we look at the “Standardized Canonical discriminant Function Coefficients” table, we can
see that preference (-0.767), nature (0.817), and meeting people (0.798) have the largest
standardized coefficients that have a significant impact on the discriminant functions.
“Preference” has a negative coefficient, which suggests that this variable decreases, but is
associated with higher values on the discriminant function. “Nature” and “Meeting People” both
have positive values, meaning there is an increase of these variables. The structure matrix also
shows that all of these variables are, in fact, significant. “Preference” (0.738) and “nature”
(0.407) both have positive correlations with function 1, however “meeting people” (-0.044) has a
weak negative correlation with function 1. The canonical discriminant function coefficients
shows that nature has the highest positive coefficient (0.63), indicating its strong contribution to
the function. Preference has a negative coefficient again in function 1 and meeting peopl has a
positive coefficient for function 1. Higher values with “nature” and “meeting people” are
associated with one group, while lower values are associated with another group. Preference also
plays a role in discriminants, but has a negative association with one of the groups, meaning that
as “preference” decreases, it is linked to that particular group. The other variables, “weather,”
“harmony,” and “exercising” seem to have weaker associations with the discriminant functions,
deeming them as insignificant.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Related Questions
For selection sizes where the necessary value is less than or equal to 15, please elaborate on how you may hazard an estimate. I'm wondering how large a pool of potential employees you expect there to be. If the values in the same histogram have a normal distribution between 11 and 15, then an estimated value of 13 may be used as the selection criteria.
Simply respond to this inquiry with some text.
arrow_forward
Create a “level of attributes” diagram for a car rental. Include its core benefit, expected attributes, and add-on attributes. Have any of these attributes changed over the past 10 years? Please explain your answer.
arrow_forward
Please describe the relationship between bias, variance, underfitting, and overfitting. Support your answers with examples if needed.
arrow_forward
Explain what you mean by the term "experimental" when referring to a study strategy. Please
describe its key characteristics by referring to the most relevant example.
arrow_forward
Q3 Rearrange the following steps for the EA problem solving model:
Repeat
Allow more options for better ones
Choose one or more of these, alter them to create new solutions
• Discard some of the solutions
Over a period, solution quality will improve
Create a number of tentative solutions
arrow_forward
When evaluating the correctness of a model, the only thing that can
be considered is how well the model performs on test data. explain
in detail, expand on? Explain?
arrow_forward
Discuss two ways in which the 1:M relationship between COURSE and CLASS can be implemented. (Hint: Think about relationship strength.)
arrow_forward
Pick and summarise ONE (1) article that uses data classification in a variety of domains. (The summary should concentrate on how the material applies to data classification analysis specifically.) Please respond to the following questions:
i. What is the problem or question(s) that this study is addressing? You should be able to locate the focal point. If there are any other issues, make a note of them as well. ii. What is the data's source? There are two or more sources of data in some studies. Give a brief description of how the data was obtained.
arrow_forward
Do you have any particular recommendations for capturing the data flow?
arrow_forward
Describe how you will collect data when using the formative assessment as described below.
Identify the key skills required to meet the learning objective. In this case, the core skill is rounding two- and three-digit whole numbers to the nearest 10 or 100. The content is the concept of rounding and the rules for rounding numbers.
Design the formative assessment. This could be a worksheet with a variety of problems for the student to solve. The student would be asked to show all their work on the worksheet so that I could see their thought process and identify any areas where they may be struggling.
Monitor the student's progress. As the student works through the worksheet, observe their problem-solving process. These observations would help identify any areas where the student may need additional support or instruction.
Provide feedback. After the student has completed the worksheet, I would review their work and provide feedback. Highlight areas where they did well and areas where…
arrow_forward
You must present an examination of "Population & Housing in San Diego County." The goal is to look for correlations between the shifting population and residential unit density in the county. How would you go about conducting this inquiry? Please specify which data files are needed and where they might be found (including any relevant references). Which data model will you use to conduct this investigation? (You do not need to finish the analysis but you may examine the data files on the web).
arrow_forward
For this assignment you are to find 3 real-world examples of identity theft, preferably medical id theft, but any type will be accepted.
One of the examples should be your own if you were an id theft victim (I personally have been a financial id theft victim twice) or someone you know. This will allow you to know the details of how the theft occurred and how it was handled.
One of the examples must come from research news stories and other resources found on the Web. Provide a link to each website in your report..
The final example may be personal or a news story.
For each case, identify the type of ID theft that occurred, and answer these questions: what did the thief do (if known) to acquire the information, how was it discovered, what were the damages incurred, how long did it take to resolve the problems caused by the id theft?.
arrow_forward
At least two examples/scenarios are required to back up your response and highlight the most important SDLC stage.
arrow_forward
How can you tell if a model meets the requirements for proportionality and additivity?
arrow_forward
Please discuss your feelings about data transformation. Was this the first time that you had heard of it? What do you think? Does this affect your feeling about the statistics that you hear every day? One thing that I'm hoping to read from you is if you are now seeing statistics in a different light. These last couple of chapters bring it all together. You've learned so much these past 13 weeks---now it's time to be able to talk about it.
arrow_forward
What exactly is meant by the term "racial condition," and how is it distinct from a typical condition?
arrow_forward
Assume you've been tasked with creating a logical model of a school's or college's registration system. Is a top-down approach preferable, or a bottom-up approach preferable? What considerations could influence your decision?
arrow_forward
What are the limitations of the data? What assumptions of the model can be made?
arrow_forward
If there is a rating of the competency for each skill an employee possesses, where in the data model would we place this rating?
arrow_forward
Could you provide more details for each of these points?
arrow_forward
SEE MORE QUESTIONS
Recommended textbooks for you
A Guide to SQL
Computer Science
ISBN:9781111527273
Author:Philip J. Pratt
Publisher:Course Technology Ptr
Related Questions
- For selection sizes where the necessary value is less than or equal to 15, please elaborate on how you may hazard an estimate. I'm wondering how large a pool of potential employees you expect there to be. If the values in the same histogram have a normal distribution between 11 and 15, then an estimated value of 13 may be used as the selection criteria. Simply respond to this inquiry with some text.arrow_forwardCreate a “level of attributes” diagram for a car rental. Include its core benefit, expected attributes, and add-on attributes. Have any of these attributes changed over the past 10 years? Please explain your answer.arrow_forwardPlease describe the relationship between bias, variance, underfitting, and overfitting. Support your answers with examples if needed.arrow_forward
- Explain what you mean by the term "experimental" when referring to a study strategy. Please describe its key characteristics by referring to the most relevant example.arrow_forwardQ3 Rearrange the following steps for the EA problem solving model: Repeat Allow more options for better ones Choose one or more of these, alter them to create new solutions • Discard some of the solutions Over a period, solution quality will improve Create a number of tentative solutionsarrow_forwardWhen evaluating the correctness of a model, the only thing that can be considered is how well the model performs on test data. explain in detail, expand on? Explain?arrow_forward
- Discuss two ways in which the 1:M relationship between COURSE and CLASS can be implemented. (Hint: Think about relationship strength.)arrow_forwardPick and summarise ONE (1) article that uses data classification in a variety of domains. (The summary should concentrate on how the material applies to data classification analysis specifically.) Please respond to the following questions: i. What is the problem or question(s) that this study is addressing? You should be able to locate the focal point. If there are any other issues, make a note of them as well. ii. What is the data's source? There are two or more sources of data in some studies. Give a brief description of how the data was obtained.arrow_forwardDo you have any particular recommendations for capturing the data flow?arrow_forward
- Describe how you will collect data when using the formative assessment as described below. Identify the key skills required to meet the learning objective. In this case, the core skill is rounding two- and three-digit whole numbers to the nearest 10 or 100. The content is the concept of rounding and the rules for rounding numbers. Design the formative assessment. This could be a worksheet with a variety of problems for the student to solve. The student would be asked to show all their work on the worksheet so that I could see their thought process and identify any areas where they may be struggling. Monitor the student's progress. As the student works through the worksheet, observe their problem-solving process. These observations would help identify any areas where the student may need additional support or instruction. Provide feedback. After the student has completed the worksheet, I would review their work and provide feedback. Highlight areas where they did well and areas where…arrow_forwardYou must present an examination of "Population & Housing in San Diego County." The goal is to look for correlations between the shifting population and residential unit density in the county. How would you go about conducting this inquiry? Please specify which data files are needed and where they might be found (including any relevant references). Which data model will you use to conduct this investigation? (You do not need to finish the analysis but you may examine the data files on the web).arrow_forwardFor this assignment you are to find 3 real-world examples of identity theft, preferably medical id theft, but any type will be accepted. One of the examples should be your own if you were an id theft victim (I personally have been a financial id theft victim twice) or someone you know. This will allow you to know the details of how the theft occurred and how it was handled. One of the examples must come from research news stories and other resources found on the Web. Provide a link to each website in your report.. The final example may be personal or a news story. For each case, identify the type of ID theft that occurred, and answer these questions: what did the thief do (if known) to acquire the information, how was it discovered, what were the damages incurred, how long did it take to resolve the problems caused by the id theft?.arrow_forward
arrow_back_ios
SEE MORE QUESTIONS
arrow_forward_ios
Recommended textbooks for you
- A Guide to SQLComputer ScienceISBN:9781111527273Author:Philip J. PrattPublisher:Course Technology Ptr
A Guide to SQL
Computer Science
ISBN:9781111527273
Author:Philip J. Pratt
Publisher:Course Technology Ptr