Problem 1. Consider a dataset with three columns of binary attributes A1, Az and a binary | attribute Y. There are eight types of data point in total, and their corresponding proportio the dataset are captured in the column P.

Functions and Change: A Modeling Approach to College Algebra (MindTap Course List)
6th Edition
ISBN:9781337111348
Author:Bruce Crauder, Benny Evans, Alan Noell
Publisher:Bruce Crauder, Benny Evans, Alan Noell
Chapter5: A Survey Of Other Common Functions
Section5.3: Modeling Data With Power Functions
Problem 6E: Urban Travel Times Population of cities and driving times are related, as shown in the accompanying...
icon
Related questions
Question
2
Problem 1. Consider a dataset with three columns of binary attributes Aj, A2 and a binary label
attribute Y. There are eight types of data point in total, and their corresponding proportions in
the dataset are captured in the column P.
type ! A1 A2 Y
0 8%
P
1
1
2
1
1
29%
3
1
1
2%
4
1
1
18%
16%
2%
5
6
1
1
1%
1
1
1 24%
(a) What is the GINI index of the dataset?
(b) What is the GINI index of the split on A1 and
that on A2 respectively?
Transcribed Image Text:Problem 1. Consider a dataset with three columns of binary attributes Aj, A2 and a binary label attribute Y. There are eight types of data point in total, and their corresponding proportions in the dataset are captured in the column P. type ! A1 A2 Y 0 8% P 1 1 2 1 1 29% 3 1 1 2% 4 1 1 18% 16% 2% 5 6 1 1 1% 1 1 1 24% (a) What is the GINI index of the dataset? (b) What is the GINI index of the split on A1 and that on A2 respectively?
Expert Solution
trending now

Trending now

This is a popular solution!

steps

Step by step

Solved in 3 steps with 2 images

Blurred answer