A group of data scientists want to analyze some data. They already cleaned up the data, with the result being a Dataframe called X_train. The Dataframe X_train has 1058 rows and 13 columns. It has the below columns (picture attached on how it looks like): 'LotFrontage','LotArea','BsmtFinSF1', 'BsmtFinSF2', 'BsmtUnfSF','TotalBsmtSF','1stFlrSF', '2ndFlrSF', 'LowQualFinSF', 'GrLivArea','TotRmsAbvGrd','GarageArea','OpenPorchSF' Answer the following questions: 1. Transform X_train using PCA. Assign the output to a variable X_train_pca. 2. What is wrong with above approach? Scale the data, then repeat the above.
A group of data scientists want to analyze some data. They already cleaned up the data, with the result being a Dataframe called X_train. The Dataframe X_train has 1058 rows and 13 columns. It has the below columns (picture attached on how it looks like): 'LotFrontage','LotArea','BsmtFinSF1', 'BsmtFinSF2', 'BsmtUnfSF','TotalBsmtSF','1stFlrSF', '2ndFlrSF', 'LowQualFinSF', 'GrLivArea','TotRmsAbvGrd','GarageArea','OpenPorchSF' Answer the following questions: 1. Transform X_train using PCA. Assign the output to a variable X_train_pca. 2. What is wrong with above approach? Scale the data, then repeat the above.
Database System Concepts
7th Edition
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Chapter1: Introduction
Section: Chapter Questions
Problem 1PE
Related questions
Question
Use python machine learning.
A group of data scientists want to analyze some data. They already cleaned up the data, with the result being a Dataframe called X_train. The Dataframe X_train has 1058 rows and 13 columns. It has the below columns (picture attached on how it looks like):
'LotFrontage','LotArea','BsmtFinSF1', 'BsmtFinSF2', 'BsmtUnfSF','TotalBsmtSF','1stFlrSF', '2ndFlrSF', 'LowQualFinSF', 'GrLivArea','TotRmsAbvGrd','GarageArea','OpenPorchSF'
Answer the following questions:
1. Transform X_train using PCA. Assign the output to a variable X_train_pca.
2. What is wrong with above approach? Scale the data, then repeat the above.
![0
1
2
3
4
1452
1453
1454
1455
1459
LotFrontage LotArea BsmtFin SF1 BsmtFinSF2 BsmtUnfSF TotalBsmtSF 1stFlrSF 2nd FlrSF LowQualFin SF
706
854
0
978
486
216
655
65.0
80.0
68.0
60.0
84.0
8450
62.0
75.0
9600
11250
9550
14260
35.0
3675
90.0 17217
62.0
7500
7917
9937
1058 rows x 13 columns
...
547
0
410
0
830
0
0
0
0
0
ooo。
0
0
0
0
290
150
284
434
540
490
0
1140
811
953
136
856
1262
920
756
1145
547
1140
1221
953
1256
856
1262
920
961
1145
1072
1140
1221
953
1256
0
866
756
1053
0
0
0
694
0
0
0
0
0
0
000 00
0
0
GrLivArea TotRmsAbvGrd Garage Area O
548
460
608
642
1710
1262
1786
1717
2198
1072
1140
1221
1647
1256
8
6
6
7
9
5
6
6
7
6
836
525
0
400
460
276](/v2/_next/image?url=https%3A%2F%2Fcontent.bartleby.com%2Fqna-images%2Fquestion%2F585fc841-d538-4422-aa5e-520dda1af9cf%2F4a19984b-663c-449d-a8c1-b95c6cf8c85b%2F6xx7g1a_processed.png&w=3840&q=75)
Transcribed Image Text:0
1
2
3
4
1452
1453
1454
1455
1459
LotFrontage LotArea BsmtFin SF1 BsmtFinSF2 BsmtUnfSF TotalBsmtSF 1stFlrSF 2nd FlrSF LowQualFin SF
706
854
0
978
486
216
655
65.0
80.0
68.0
60.0
84.0
8450
62.0
75.0
9600
11250
9550
14260
35.0
3675
90.0 17217
62.0
7500
7917
9937
1058 rows x 13 columns
...
547
0
410
0
830
0
0
0
0
0
ooo。
0
0
0
0
290
150
284
434
540
490
0
1140
811
953
136
856
1262
920
756
1145
547
1140
1221
953
1256
856
1262
920
961
1145
1072
1140
1221
953
1256
0
866
756
1053
0
0
0
694
0
0
0
0
0
0
000 00
0
0
GrLivArea TotRmsAbvGrd Garage Area O
548
460
608
642
1710
1262
1786
1717
2198
1072
1140
1221
1647
1256
8
6
6
7
9
5
6
6
7
6
836
525
0
400
460
276
Expert Solution
![](/static/compass_v2/shared-icons/check-mark.png)
This question has been solved!
Explore an expertly crafted, step-by-step solution for a thorough understanding of key concepts.
Step by step
Solved in 2 steps
![Blurred answer](/static/compass_v2/solution-images/blurred-answer.jpg)
Knowledge Booster
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.Recommended textbooks for you
![Database System Concepts](https://www.bartleby.com/isbn_cover_images/9780078022159/9780078022159_smallCoverImage.jpg)
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
![Starting Out with Python (4th Edition)](https://www.bartleby.com/isbn_cover_images/9780134444321/9780134444321_smallCoverImage.gif)
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
![Digital Fundamentals (11th Edition)](https://www.bartleby.com/isbn_cover_images/9780132737968/9780132737968_smallCoverImage.gif)
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
![Database System Concepts](https://www.bartleby.com/isbn_cover_images/9780078022159/9780078022159_smallCoverImage.jpg)
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
![Starting Out with Python (4th Edition)](https://www.bartleby.com/isbn_cover_images/9780134444321/9780134444321_smallCoverImage.gif)
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
![Digital Fundamentals (11th Edition)](https://www.bartleby.com/isbn_cover_images/9780132737968/9780132737968_smallCoverImage.gif)
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
![C How to Program (8th Edition)](https://www.bartleby.com/isbn_cover_images/9780133976892/9780133976892_smallCoverImage.gif)
C How to Program (8th Edition)
Computer Science
ISBN:
9780133976892
Author:
Paul J. Deitel, Harvey Deitel
Publisher:
PEARSON
![Database Systems: Design, Implementation, & Manag…](https://www.bartleby.com/isbn_cover_images/9781337627900/9781337627900_smallCoverImage.gif)
Database Systems: Design, Implementation, & Manag…
Computer Science
ISBN:
9781337627900
Author:
Carlos Coronel, Steven Morris
Publisher:
Cengage Learning
![Programmable Logic Controllers](https://www.bartleby.com/isbn_cover_images/9780073373843/9780073373843_smallCoverImage.gif)
Programmable Logic Controllers
Computer Science
ISBN:
9780073373843
Author:
Frank D. Petruzella
Publisher:
McGraw-Hill Education