Data Mining for Business Analytics: Concepts, Techniques, and Applications with XLMiner
3rd Edition
ISBN: 9781118729274
Author: Galit Shmueli, Peter C. Bruce, Nitin R. Patel
Publisher: WILEY
expand_more
expand_more
format_list_bulleted
Expert Solution & Answer
Chapter 2, Problem 6P
Explanation of Solution
Given: A specific company made the training data by using the internal data including the purchase and demographic information. The future data to be arranged will include the demographic data of the list of purchases from other sources. In the training data, the predictor, “refund issued�, was a useful predictor.
To find: The explanation on why the predictor named “refund issued� in the training data cannot be considered as an appropriate variable to be incorporated in the model...
Expert Solution & Answer
Want to see the full answer?
Check out a sample textbook solutionStudents have asked these similar questions
A manager of a chain of 20 Sports Bars would like to be able to predict daily revenue for each restaurant based on several factors. Recognizing that each sports bar is different, the manager assigns each an id number from 1 to 20 . What is the problem with including that id number as an attribute in a model to predict daily revenue?
2.
The intent behind using the data is to predict fraudulent transactions accurately. The number of records used to train the model is exactly 1/6 of the entire annual transactions count, which was 312 million. The model is able to predict a total of 2400 records correctly as fraudulent , while it fails to sensitively predict the remaining 6600 fraudulent transactions correctly. The type 1 error count comes to be just a meagre 1% of the non-fraudulent cases present in the training data. Construct the confusion matrix using the above information. Assume fraudulent transactions as the positive class. Calculate Accuracy of the Trained Model, Misclassification rate, Precision, Sensitivity, F1-Score
Question 8
Each regression model can only have one dependent variable.
Question 8 options:
True
False
Chapter 2 Solutions
Data Mining for Business Analytics: Concepts, Techniques, and Applications with XLMiner
Knowledge Booster
Similar questions
- There are many factors when determining the performance of your model. What are some ways to evaluate regression versus classification models?arrow_forwardThe difference between EDA and hypothesis testing, and why analysts may prefer EDA when doing data miningarrow_forwardHow important is data modeling in the analysis procedure? Can we estimate how much information our model will require?arrow_forward
- In what ways can data modelling aid in the analysis process? Can we estimate how much information will be required for our model?arrow_forwardWhen assessing the performance of your model, there are a lot of different elements to consider. How can we compare and contrast the predictive power of classification and regression models?arrow_forwarddescribe the usefulness of a data model in relation to the traditional way of predicting strength.arrow_forward
- QUESTION Your analytics team is developing a linear regression model on training data and then testing this model on testing data. They are curious why we do not keep adding predictor variables when fitting the model. You tell them adding additional predictor variables to the model will always increase model ASE on the testing data. always decrease model ASE on the testing data. always decrease model ASE on the training data. always increase model ASE on the training data.arrow_forwardWhy is it important to identify potential null values during data modeling?arrow_forwardIn comparison to stepwise regression, what are the benefits of using all-subsets regression for the analysis of data rather than stepwise regression?arrow_forward
arrow_back_ios
arrow_forward_ios
Recommended textbooks for you
- Database System ConceptsComputer ScienceISBN:9780078022159Author:Abraham Silberschatz Professor, Henry F. Korth, S. SudarshanPublisher:McGraw-Hill EducationStarting Out with Python (4th Edition)Computer ScienceISBN:9780134444321Author:Tony GaddisPublisher:PEARSONDigital Fundamentals (11th Edition)Computer ScienceISBN:9780132737968Author:Thomas L. FloydPublisher:PEARSON
- C How to Program (8th Edition)Computer ScienceISBN:9780133976892Author:Paul J. Deitel, Harvey DeitelPublisher:PEARSONDatabase Systems: Design, Implementation, & Manag...Computer ScienceISBN:9781337627900Author:Carlos Coronel, Steven MorrisPublisher:Cengage LearningProgrammable Logic ControllersComputer ScienceISBN:9780073373843Author:Frank D. PetruzellaPublisher:McGraw-Hill Education
Database System Concepts
Computer Science
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:McGraw-Hill Education
Starting Out with Python (4th Edition)
Computer Science
ISBN:9780134444321
Author:Tony Gaddis
Publisher:PEARSON
Digital Fundamentals (11th Edition)
Computer Science
ISBN:9780132737968
Author:Thomas L. Floyd
Publisher:PEARSON
C How to Program (8th Edition)
Computer Science
ISBN:9780133976892
Author:Paul J. Deitel, Harvey Deitel
Publisher:PEARSON
Database Systems: Design, Implementation, & Manag...
Computer Science
ISBN:9781337627900
Author:Carlos Coronel, Steven Morris
Publisher:Cengage Learning
Programmable Logic Controllers
Computer Science
ISBN:9780073373843
Author:Frank D. Petruzella
Publisher:McGraw-Hill Education