mis6356Dimension Reduction(1)

.pdf

School

Arizona State University *

*We aren’t endorsed by this school

Course

6356

Subject

Information Systems

Date

Oct 30, 2023

Type

pdf

Pages

20

Uploaded by PrivateDugongMaster226

Report
Dimension Reduction James Zhang MIS 6356 BA with R
Exploring the data Statistical summary of data: common metrics Average Median Minimum Maximum Standard deviation Counts & percentages
Reducing Categories A single categorical variable with m categories is typically transformed into m or m-1 dummy variables (handled automatically by most R modeling functions Each dummy variable takes the values 0 or 1 0 = “no” for the category 1 = “yes” Problem: Can end up with too many variables Solution: Reduce by combining categories that are close to each other Use pivot tables to assess outcome variable sensitivity to the dummies
Combining Categories Many zoning categories are the same or similar with respect to CATMEDV
Principal Components Analysis Goal: Reduce a set of numerical variables. The idea: Remove the overlap of information between these variable. [“Information” is measured by the sum of the variances of the variables.] Final product: A smaller number of numerical variables that contain most of the information
Principal Components Analysis How does PCA do this? Create new variables that are linear combinations of the original variables (i.e., they are weighted averages of the original variables). These new variables are uncorrelated (no information overlap), and only a few of them contain most of the original information. The new variables are called principal components .
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help

Browse Popular Homework Q&A

Q: Let A, B, C, D be the vertices of a square with side length 100. If we want to create a…
Q: The radian measure of an angle of - 218 degrees is
Q: Find the payment necessary to amortize a 4% loan of $1800 compounded quarterly, with 19 quarterly…
Q: Zeller's congruence is an algorithm developed by Christian Zeller to calculate the day of the week.…
Q: What minimum specifications does his computer need in order to run Windows 10? Which of the two CPU…
Q: ? ? ? -11) 1. T(x) = (311+12, 2. T(x) = (2x1, x₂) ✓ 3. T(x) = (₁ + 10, ₂)T I1)T
Q: Prove (°C ), - r(WP) and (°CF ), --r(3) - T ӘР For ideal gas, pV=RT. Show that Cy is independent of…
Q: Find the equivalent capacitance of the circuit below.
Q: Which of the following will have the lowest average kinetic energy? OA) H₂ at 400 °C O B) O₂ at 300…
Q: The ponderal index is a measure of overall size similar to a body mass index. The ponderal index of…
Q: Evaluate the following integral:4² - 10t - 1 dt O7714 O98/3 86/3 O 83/3
Q: Let B = {(1, 3), (-2,-2)} and B' = {(-12, 0), (-4,4)} be bases for R2, and let - [²9] 43 be the…
Q: Use the geometric series test to determine whether \sum_(n=0)^(\infty ) 4((\pi )/(5))^(n) converges…
Q: If a dumbbell has a weight of 44.5 N, its mass is: a. 44.5 kg b. 10 lbs c. 4.5 kg d. 436 kg
Q: Use the formula for nPr to solve the following question. A club with sixteen members is to choose…
Q: 1 Simplify the expression: 10h - 4h =
Q: Polly Manufacturing Company acquired equipment on January 1, 2022, for $527,000. Estimated useful…
Q: Write the proton condition and acid/base mass balance equation for each of the following systems.
Q: 60% of the voters favor Ms. Stein. If 250 voters are chosen at random, what is the expected number…
Q: Consider the function f(x) = x*el8. For this function there are three important intervals: (– 00,…
Q: Using the Discriminant In Exercises9–14, use the discriminant to find the numberof real and…
Q: Consider two markets: the market for cat food and the market for snake oil. The initial equilibrium…