I # YOUR CODE HERE def train_DT(X,Y): clf = DecisionTreeclassifier(random_state = 0, max_depth = 2) clf = clf.fit(X,Y) return clf I assert callable(train_DT) 2h) Classification #1: Using Only Subject Let's try to classify the email conversation using only the subject field of the dataframe only. Using the function train_DT() , train a decision tree classifier on subject_train_X (as your predictor) and category_train_Y (as your outcome) and save the model as subject_clf . I # YOUR CODE HERE raise NotImplementedError() I assert isinstance(subject_clf, DecisionTreeclassifier) assert hasattr(subject_clf, "predict") Now we will use the function classification_report to print out the performance of the classifier on the training set: I # Your classifier should observe an accuracy of around 96%. subject_predicted_train_Y = subject_clf.predict(subject_train_X) print(classification_report(category_train_Y, subject_predicted_train_Y))

Database System Concepts
7th Edition
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Chapter1: Introduction
Section: Chapter Questions
Problem 1PE
icon
Related questions
Question

Using Python

I # YOUR CODE HERE
def train_DT(X,Y):
clf = DecisionTreeclassifier(random_state = 0, max_depth = 2)
clf = clf.fit(X,Y)
%3D
return clf
I assert callable(train_DT)
2h) Classification #1: Using Only Subject
Let's try to classify the email conversation using only the subject field of the dataframe only.
Using the function train_DT() , train a decision tree classifier on subject_train_X (as your predictor) and
category_train_Y (as your outcome) and save the model as subject_clf .
I # YOUR CODE HERE
raise NotImplementedError()
I assert isinstance(subject_clf, DecisionTreeclassifier)
assert hasattr(subject_clf, "predict")
Now we will use the function classification_report to print out the performance of the classifier on the training set:
I # Your classifier should observe an accuracy of around 96%.
subject_predicted_train_Y
print(classification_report(category_train_Y, subject_predicted_train_Y))
subject_clf.predict(subject_train_X)
Transcribed Image Text:I # YOUR CODE HERE def train_DT(X,Y): clf = DecisionTreeclassifier(random_state = 0, max_depth = 2) clf = clf.fit(X,Y) %3D return clf I assert callable(train_DT) 2h) Classification #1: Using Only Subject Let's try to classify the email conversation using only the subject field of the dataframe only. Using the function train_DT() , train a decision tree classifier on subject_train_X (as your predictor) and category_train_Y (as your outcome) and save the model as subject_clf . I # YOUR CODE HERE raise NotImplementedError() I assert isinstance(subject_clf, DecisionTreeclassifier) assert hasattr(subject_clf, "predict") Now we will use the function classification_report to print out the performance of the classifier on the training set: I # Your classifier should observe an accuracy of around 96%. subject_predicted_train_Y print(classification_report(category_train_Y, subject_predicted_train_Y)) subject_clf.predict(subject_train_X)
Expert Solution
trending now

Trending now

This is a popular solution!

steps

Step by step

Solved in 2 steps

Blurred answer
Knowledge Booster
Troubleshooting
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.
Similar questions
  • SEE MORE QUESTIONS
Recommended textbooks for you
Database System Concepts
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
Starting Out with Python (4th Edition)
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
Digital Fundamentals (11th Edition)
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
C How to Program (8th Edition)
C How to Program (8th Edition)
Computer Science
ISBN:
9780133976892
Author:
Paul J. Deitel, Harvey Deitel
Publisher:
PEARSON
Database Systems: Design, Implementation, & Manag…
Database Systems: Design, Implementation, & Manag…
Computer Science
ISBN:
9781337627900
Author:
Carlos Coronel, Steven Morris
Publisher:
Cengage Learning
Programmable Logic Controllers
Programmable Logic Controllers
Computer Science
ISBN:
9780073373843
Author:
Frank D. Petruzella
Publisher:
McGraw-Hill Education