TODO 9 Now let's wrap our one-hot code in a class that can be used with Sklearn's pipeline functionality. Finish the OneHotEncoding class which produces one-hot encodings and stores the columns names of the newly generated one-hot for reference later. In the transform() method, convert the input X into one-hot encodings using pd.get_dummies(). Store the output into the variable one_hot. It's similar to what you did in TODO 8. Store the names of the columns for our one-hot encoding one_hot so we can access them later if needed. Store the output into the class variable self.feature_names Think about how you access the columns of a DataFrame! class OneHotEncoding(BaseEstimator, TransformerMixin): def __init__(self): self.feature_names = None def fit(self, X: pd.DataFrame, y: pd.DataFrame = None): # We don't need to set/learn any variables so # we just need to return a reference to the object with 'self' return self def transform(self, X: pd.DataFrame, y: pd.DataFrame = None): # TODO 9.1 one_hot = # TODO 9.2 self.feature_names = return one_hot

Database System Concepts
7th Edition
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Chapter1: Introduction
Section: Chapter Questions
Problem 1PE
icon
Related questions
Question

TODO 9

Now let's wrap our one-hot code in a class that can be used with Sklearn's pipeline functionality. Finish the OneHotEncoding class which produces one-hot encodings and stores the columns names of the newly generated one-hot for reference later.

  1. In the transform() method, convert the input X into one-hot encodings using pd.get_dummies(). Store the output into the variable one_hot. It's similar to what you did in TODO 8.
  2. Store the names of the columns for our one-hot encoding one_hot so we can access them later if needed. Store the output into the class variable self.feature_names
    1. Think about how you access the columns of a DataFrame!

class OneHotEncoding(BaseEstimator, TransformerMixin):
    def __init__(self):
        self.feature_names = None
    
    def fit(self, X: pd.DataFrame, y: pd.DataFrame = None):
        # We don't need to set/learn any variables so
        # we just need to return a reference to the object with 'self'
        return self
    
    def transform(self, X: pd.DataFrame, y: pd.DataFrame = None):
        # TODO 9.1
        one_hot = 
        
        # TODO 9.2
        self.feature_names =
        
        return one_hot

Expert Solution
trending now

Trending now

This is a popular solution!

steps

Step by step

Solved in 5 steps with 1 images

Blurred answer
Knowledge Booster
Topological Sort
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.
Similar questions
  • SEE MORE QUESTIONS
Recommended textbooks for you
Database System Concepts
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
Starting Out with Python (4th Edition)
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
Digital Fundamentals (11th Edition)
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
C How to Program (8th Edition)
C How to Program (8th Edition)
Computer Science
ISBN:
9780133976892
Author:
Paul J. Deitel, Harvey Deitel
Publisher:
PEARSON
Database Systems: Design, Implementation, & Manag…
Database Systems: Design, Implementation, & Manag…
Computer Science
ISBN:
9781337627900
Author:
Carlos Coronel, Steven Morris
Publisher:
Cengage Learning
Programmable Logic Controllers
Programmable Logic Controllers
Computer Science
ISBN:
9780073373843
Author:
Frank D. Petruzella
Publisher:
McGraw-Hill Education