BDAT1007_Assignment 3_Group 2

.docx

School

Georgian College *

*We aren’t endorsed by this school

Course

1007

Subject

Industrial Engineering

Date

Feb 20, 2024

Type

docx

Pages

9

Uploaded by AgentGrouseMaster1039

BDAT 1007: SOCIAL DATA MINING TECHNIQUES Assignment 3 – Group Work March 24, 2023 Elysse Joy Angelica Pascual, Iman Shokri, Haben Iyob, Suryadevsinh Zala Georgian College
Group 2 Topic: Technology Data: Technology - Kaggle Dataset Link: https:// Part A – Decision Tree Analysis - Iman Shokri Attributes (Display Size, Graphic Card, Original Price, OS, Star Rating) The first step was to use the retrieve operator to load the input data, which was followed by select attributes to select the required attributes. Furthermore, the set role operator was used to set our target variable. After that cross-validation operator has been added to split the data into training and testing data which will be further used in the process. Cross validation operator connects to a split screen where we define our training and testing operators. The max depth for decision tree operator was set to 5. After that apply model operator was used to apply the model. The last operator added was performance operator which helped us evaluate the performance.
Part B – Logistic Regression Analysis - Iman Shokri Firstly, we are using a retrieve operator to read input data, followed by a select attributes operator which is used to select specific dependent (target) variable and independent variables. The third operator is numerical to binomial operator used to convert our target variable, from numerical to binomial. Then we set our target variable by using set role operator followed by a logistic regression operator which takes the training dataset and generates the predicted model. The next operator we have used is apply model which expects two inputs –one is a model coming from output of logistic regression operator and the other is the example set generated from the LR operator. Finally, we use a performance operator to evaluate the performance of the model.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help