CIS_5200_Fall_2023_Final_Practice_Problems
.pdf
keyboard_arrow_up
School
University of Pennsylvania *
*We aren’t endorsed by this school
Course
5200
Subject
Computer Science
Date
Jan 9, 2024
Type
Pages
5
Uploaded by HighnessArt13215
CIS 5200: Machine Learning
Fall 2023
Final Exam Sample Questions
University of Pennsylvania
This is a brief document containing sample questions
from the second half of the course
. For
questions on topics from the first half of the course, see the practice midterm.
SAMPLE T/F Questions [?? points]
For each question below, identify if the statements made
are True or False.
If you believe the statement is false, justify your answers.
Correct
“True” questions are worth 1 point, while correct “False” answers are worth two points – one for
the correct answer, and one for a correct justification.
(a)
(T/F)
It’s okay to evaluate test error multiple times on the same test set, so long as the model
was never trained on that test set.
(b)
(T/F)
Boosting reduces bias.
(c)
(T/F)
When boosting the squared loss, examples that the current ensemble gets correct are
removed from the training set.
(d)
(T/F)
True positive rate, recall, and specificity all measure the same thing.
(e)
(T/F)
A model that achieves 99% test accuracy is always a good model.
(f)
(T/F)
In Adaboost, the weight of any given training example is strictly decreasing as you
boost more.
(g)
(T/F)
When bagging, it is possible for a training example to appear more than once in the
same bootstrap sample or “bag”.
1
CIS 5200: Machine Learning
Midterm Exam
2
(h)
(T/F)
In general, boosted decision stumps (decision trees with a single split) are less likely to
overfit than single, full-depth decision trees.
Name
:
Penn ID
:
3
SAMPLE SHORT ANSWER QUESTIONS
(a) [2 points] Suppose you are given the following dataset, along with an ensemble
H
(
x
) that makes
the following predictions:
x
y
H
(
x
)
1
5
5.5
2
6
5.5
3
7
7.5
4
8
7.5
You are training a gradient boosted regression tree, boosting the squared loss. What dataset
would you train your next weak learner on? List both the feature values and labels you would
call the weak learner algorithm with.
(b) [2 points] Suppose you determine that 0
.
99
n
of your
n
training labels are +1. Would you expect
accuracy to be a good measure of performance in this scenario? If yes, why? If not, what are
some alternative performance metrics that you might use instead?
(c) [2 points] Suppose you run PCA on a training dataset
{
x
1
, ...,
x
n
}
. After computing all eigen-
values of the sample covariance matrix
λ
1
, ..., λ
n
, you observe that:
λ
1
+
λ
2
λ
1
+
· · ·
+
λ
n
= 0
.
98
What does this mean about your data?
(d) [2 points] Suppose you are training a decision tree on a binary classification dataset with 10
traning examples. You are considering a feature split that splits your dataset into the following
two subsets:
(a) Subset 1: 6 instances (4 positive labels, 2 negative labels)
(b) Subset 2: 4 instances (1 positive label, 3 negative labels)
Name
:
Penn ID
:
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Related Questions
Hospital :ER Case Study
A General Hospital consists of a number of specialized wards (such
as Radiology, Oncology, etc).Information about ward includes unique
name, total numbers of current patients. Each ward hosts a number of
patients, who were admitted by a consultant (doctors) employed by the
Hospital. On admission, the date and time are kept. The personal
details of every patient includes name, Medical Recode Number
(MRN), set of phone and one address (city, street, code). A separate
register is to be held to store the information of the tests undertaken.
Each test has unique episode No., category and the final result of test.
Number of tests may be conducted for each patient. Doctors are
specialists in a specific ward and may be leading consultants for a
number of patients. Each patient is assigned to one leading consultant
but may be examined by other doctors, if required.
arrow_forward
Hospital :ER Case Study
A General Hospital consists of a number of specialized wards (such
as Radiology, Oncology, etc) .Information about ward includes unique
name, total numbers of current patients. Each ward hosts a number of
patients, who were admitted by a consultant (doctors) employed by the
Hospital. On admission, the date and time are kept. The personal
details of every patient includes name, Medical Recode Number
(MRN), set of phone and one address (city, street, code). A separate
register is to be held to store the information of the tests undertaken.
Each test has unique episode No. , category and the final result of test.
Number of tests may be conducted for each patient. Doctors are
specialists in a specific ward and may be leading consultants for a
number of patients. Each patient is assigned to one leading consultant
but may be examined by other doctors, if required.
u
arrow_forward
¥ Question Completion Status:
Given a truth table below with the first two columns of truth values provided:
b.
(p.q)
1
2
T F
4.
Which of the answers below contains the truth values for the third column when
f(p,q) = p q ?
Column 3
Column 3
Column 3
truth values:
Column 3
truth
Column 3
truth values:
truth values:
truth values:
values:
1 T
1 F
1 F
1 F
1 T
None of the other answers
O 2 F
O 2 T
2 T
2 T
provided is correct.
2 F
3 F
3 F
3 T
3 T
3 F
4 T
4 F
4 T
4 F
4 T
Save All Answers
Save and Su
Click Save and Submit to save and subrmit. Click Save All Answers to save all answers.
0 日
la
(99+
here to search
3.
arrow_forward
The following is a table of applicants to be considered for admission into 07BSAM:
POINTS
MATH
SYMBOL
123
450
NAMES
STUDENT
NUMBER
ENG SYMBOL
To qualify for admission, an applicant should have at least 25 points, minimum of B symbol in
Mathematics and a minimum of E symbol in English. If an applicant has a C symbol in Mathematics,
then the applicant must pass an entrance test to be considered for admission.
Hints:
There are three modes of study: The Full-time mode, the Part-time mode and the Distance mode.
Applicants can choose which mode they want provided there is space in that mode. Each of the Full-
time and the Part-time modes cannot accommodate more 30 admitted students. After these two
modes are full, every other qualified applicant is admitted in the Distance mode. The admission
exercise is first come first served.
Write a pseudocode that identifies the applicants that qualify for admission and outputs their names,
student numbers and modes of study. Use a FOR-LOOP /…
arrow_forward
in a shop ,there are 10 employee and 20 kinds of goods,goods id between 1-20
EMPLOYEE id first name last name gender
10001, 'Tom', 'Brown', 'F'10002, 'Elizabeth', 'Tremblay', 'F'10003, 'Gladys', 'Julie', 'F10004, 'John', 'Taylor', 'M10005, 'Amelia', 'Smith'10006, 'Logan', 'Katherine'10007, 'Leo', 'Brown'10007, 'Lem', 'Thompson'10009, 'Tom' 'Smith'10010, 'Emma', 'Campbell'
------------
and I want to add a name library in it ,like this
how could i create a HTML file ,with will randomly create customers with these employee .
there is a start button on the page .
press "start "bottom ,and it will It will randomly match 10 items, customers, and goods, display goods id ,customername and gender ,employer name , id and gender .
how to do such a page ?
arrow_forward
SQL question help
arrow_forward
5)ln cell G9, write a formula to determine if any
of the bidders has a True value for Rule 1. Copy
the formula across the row to analyze the
remaining rules.
6)ln cell G10, write a formula to determine if
none of the bidders has a True value for Rule 1.
Copy the formula across the row to analyze the
remaining bidders. Copy the formula across the
row to analyze the remaining rules.
PO 527 Bidder List Financial Evaluation
D&B
Composite
Credit
Previous
D&B D&B Stress
Experience Net Worth Appraisal PAYDEX Risk Class
Grade
(Dollars)
Ribba Technology Satisfactory $ 5,000,000
286,000
Bidder List
(1 Best) (100 Best) (1 Best)
89
79
Rule 1 Rule 2 Rule 3
1
False
False
True
False
True
TDR Corporation Satisfactory
ComputerX Assoc. Satisfactory
Exceed WR
Roberts Inc.
True
True
True
True
2
False
4,560,000
1,089,000
True
False
93
None
45
3
2
Unsatisfactory 23,459,000
68
False
False
False
312N
arrow_forward
Create a table of your courses for this
semester with fictitious grades for Test 1, Test
2, and Test 3 with appropriate weighting. The
next column should determine the grade
using the 'if' function, with plus and minus.
The grade cell should show a discrete color
scheme with below C- in red; above B in
green and, yellow for the rest. This should be
done using conditional formatting of cells.
arrow_forward
SQL query Knowledge
arrow_forward
For this portion of the lab design the solution in the following manner:
Create a table using a two-dimensional list that stores a Fahrenheit temperature and the equivalent Celsius temperature. Use the following range of Fahrenheit temperatures:Starting at -10 through 100 in increments of 10. Thus, the Fahrenheit temperatures will be:-10, 0, 10, 20, 30, 40, 50…and so on.Display the contents of the list.Similarly create tables/lists for the following, starting at 0 through 100 in increments of 10:Miles to kilometersGallons to litersPounds to kilogramsInches to centimetersDisplay the contents of each listSave the program as firstname_lastname_Lab7a.py where you will replace firstname and lastname with your actual first and last name.
arrow_forward
Section: FactThyroid function is measured using a factor called TSH, which stands for thyroid-stimulatinghormone. A TSH test is a blood test that measures this hormone. TSH normal values are 0.5 to 5.0mIU/L. If TSH is over 5.0 thyroid is underactive. On the other hand, if TSH is less than 0.5 thyroid isoveractive.
arrow_forward
The expression =IF(A1 > 12, 3*A1, 6*A1) is used in a spreadsheet.Find the result if A1 is 2 Find the result if A1 is 13.
arrow_forward
Don't want copy paste answer
arrow_forward
calculate_new_balance
Given a starting balance (a number), and a list of transaction tuples, calculate the final balance for an account. Transaction tuples are of the shape ("description", amount, "withdrawal") , or ("description", amount, "deposit"). The last entry in the tuple will be either "withdrawal" or "deposit". Every withdrawal decreases the balance of the account by the specified amount, and every deposit increases the balance. The return value is the new account balance, as a number. (which could be negative)
Sample calls should look like:
>>> calculate_new_balance(100, [("payday", 20, "deposit"), ("new shoes", 50, "withdrawal"), ("illicit winnings", 200, "deposit")])270>>> calculate_new_balance(100, [])100
arrow_forward
Do what is asked
arrow_forward
Question Completion Status:
QUESTION 7
Given a truth table below with the first two columns of truth values provided:
d.
b.
f(p.q)
1 T
T.
2 T
3 F
4 F F
Which of the answers below contains the truth values for the third column when
f(p,q) = (p vq)e (pa)?
Column 3
truth values:
Column 3
truth values:
Column 3
truth values:
Column 3
truth values:
Column 3,
truth
values:
1 F
1 T
1 F
1
None of the other
1 F
O 2T
|O 2 F
2 T
O answers provided is
2 T
2 T
correct.
3 T
3 T
3 F
3 F
3 T
4 T
4 F
4 F
4 T
4 F
Click Save and Submit to save and sub7mit. Cick Save All Answers to save all answers.
Save All A
O Type here to search
0 日
a
+66
arrow_forward
3. Create a table for student mark list and insert the values for name, id, m1,m2,m3,m4,m5, total and grade of each student. Calculate the grades using “if else / nested if” statement and display the total no. of student, total marks and grade of each student.Output:Total 5 Students Student:Kavita, Marks:98 Student:Pritam, Marks:97 Student:Ayan, Marks:78 Student:Rishav, Marks:87 Student:Aziz, Marks:92
Do it in oracle sql
arrow_forward
3. Create a table for student mark list and insert the values for name, id,
m1,m2,m3,m4,m5, total and grade of each student. Calculate the grades using "if else
/ nested if" statement and display the total no. of student, total marks and grade of
each student.
Output:
Total 5 Students
Student:Kavita, Marks:98
Student:Pritam, Marks:97
Student:Ayan, Marks:78
Student:Rishav, Marks:87
Student:Aziz, Marks:92
arrow_forward
Complete questions 2-7
Note:
Complete questions 2-7 in part B of the Garden Glory Project on p.129 and 130 of the textbook (8th
edition).
There may be more than one service performed at a property on a given date. Use this information when you answer questions.
Comment on each question whether a given design is good and provide reasons for that. For example, the answer for Q1 is “This design is not appropriate because the designated Primary
Key (PK) (PropertyName) in the table cannot determine some other attributes of the same
table (e.g., ServiceDate, Description, Amount); in other words, the designated PK is not unique”. Your answers need to be specific and should clearly pinpoint the design problem with references to the relevant attributes in the table and the reason of violation. Any vague answer like “This design is bad because the PK is not good”, are wrong.
Note that the foreign keys are italicized instead of dashed underlined.
Clearly label answers for each question.…
arrow_forward
v Question Completion Status:
A Moving to another question will save this response.
Question 25
Which of the following propositions is tautology?
O (p v q)-q
O pv (q-p)
O pv (p-q)
O p-q
A Moving to another question will save this response.
arrow_forward
Please help
arrow_forward
Conference Planning
The following table shows a training company's workshops, the number of days of each, and their registration fees.
Conference Number of Days Registration Fee
Handling Stress 3 $1,000
Time Management 3 $800
Supervision Skills 3 $1,500
Negotiation 5 $1,300
How to Interview 1 $500
Conferences are held in several locations. The table below contains locations and hotel fees per day.
Location Hotel Fees per Day
Austin $150
Chicago $225
Dallas $175
Orlando $300
Phoenix $175
Raleigh $150
When a customer registers for a conference, he or she must pay the registration fee plus…
arrow_forward
Q_5
arrow_forward
SEE MORE QUESTIONS
Recommended textbooks for you
Np Ms Office 365/Excel 2016 I Ntermed
Computer Science
ISBN:9781337508841
Author:Carey
Publisher:Cengage
COMPREHENSIVE MICROSOFT OFFICE 365 EXCE
Computer Science
ISBN:9780357392676
Author:FREUND, Steven
Publisher:CENGAGE L
Programming with Microsoft Visual Basic 2017
Computer Science
ISBN:9781337102124
Author:Diane Zak
Publisher:Cengage Learning
Related Questions
- Hospital :ER Case Study A General Hospital consists of a number of specialized wards (such as Radiology, Oncology, etc).Information about ward includes unique name, total numbers of current patients. Each ward hosts a number of patients, who were admitted by a consultant (doctors) employed by the Hospital. On admission, the date and time are kept. The personal details of every patient includes name, Medical Recode Number (MRN), set of phone and one address (city, street, code). A separate register is to be held to store the information of the tests undertaken. Each test has unique episode No., category and the final result of test. Number of tests may be conducted for each patient. Doctors are specialists in a specific ward and may be leading consultants for a number of patients. Each patient is assigned to one leading consultant but may be examined by other doctors, if required.arrow_forwardHospital :ER Case Study A General Hospital consists of a number of specialized wards (such as Radiology, Oncology, etc) .Information about ward includes unique name, total numbers of current patients. Each ward hosts a number of patients, who were admitted by a consultant (doctors) employed by the Hospital. On admission, the date and time are kept. The personal details of every patient includes name, Medical Recode Number (MRN), set of phone and one address (city, street, code). A separate register is to be held to store the information of the tests undertaken. Each test has unique episode No. , category and the final result of test. Number of tests may be conducted for each patient. Doctors are specialists in a specific ward and may be leading consultants for a number of patients. Each patient is assigned to one leading consultant but may be examined by other doctors, if required. uarrow_forward¥ Question Completion Status: Given a truth table below with the first two columns of truth values provided: b. (p.q) 1 2 T F 4. Which of the answers below contains the truth values for the third column when f(p,q) = p q ? Column 3 Column 3 Column 3 truth values: Column 3 truth Column 3 truth values: truth values: truth values: values: 1 T 1 F 1 F 1 F 1 T None of the other answers O 2 F O 2 T 2 T 2 T provided is correct. 2 F 3 F 3 F 3 T 3 T 3 F 4 T 4 F 4 T 4 F 4 T Save All Answers Save and Su Click Save and Submit to save and subrmit. Click Save All Answers to save all answers. 0 日 la (99+ here to search 3.arrow_forward
- The following is a table of applicants to be considered for admission into 07BSAM: POINTS MATH SYMBOL 123 450 NAMES STUDENT NUMBER ENG SYMBOL To qualify for admission, an applicant should have at least 25 points, minimum of B symbol in Mathematics and a minimum of E symbol in English. If an applicant has a C symbol in Mathematics, then the applicant must pass an entrance test to be considered for admission. Hints: There are three modes of study: The Full-time mode, the Part-time mode and the Distance mode. Applicants can choose which mode they want provided there is space in that mode. Each of the Full- time and the Part-time modes cannot accommodate more 30 admitted students. After these two modes are full, every other qualified applicant is admitted in the Distance mode. The admission exercise is first come first served. Write a pseudocode that identifies the applicants that qualify for admission and outputs their names, student numbers and modes of study. Use a FOR-LOOP /…arrow_forwardin a shop ,there are 10 employee and 20 kinds of goods,goods id between 1-20 EMPLOYEE id first name last name gender 10001, 'Tom', 'Brown', 'F'10002, 'Elizabeth', 'Tremblay', 'F'10003, 'Gladys', 'Julie', 'F10004, 'John', 'Taylor', 'M10005, 'Amelia', 'Smith'10006, 'Logan', 'Katherine'10007, 'Leo', 'Brown'10007, 'Lem', 'Thompson'10009, 'Tom' 'Smith'10010, 'Emma', 'Campbell' ------------ and I want to add a name library in it ,like this how could i create a HTML file ,with will randomly create customers with these employee . there is a start button on the page . press "start "bottom ,and it will It will randomly match 10 items, customers, and goods, display goods id ,customername and gender ,employer name , id and gender . how to do such a page ?arrow_forwardSQL question helparrow_forward
- 5)ln cell G9, write a formula to determine if any of the bidders has a True value for Rule 1. Copy the formula across the row to analyze the remaining rules. 6)ln cell G10, write a formula to determine if none of the bidders has a True value for Rule 1. Copy the formula across the row to analyze the remaining bidders. Copy the formula across the row to analyze the remaining rules. PO 527 Bidder List Financial Evaluation D&B Composite Credit Previous D&B D&B Stress Experience Net Worth Appraisal PAYDEX Risk Class Grade (Dollars) Ribba Technology Satisfactory $ 5,000,000 286,000 Bidder List (1 Best) (100 Best) (1 Best) 89 79 Rule 1 Rule 2 Rule 3 1 False False True False True TDR Corporation Satisfactory ComputerX Assoc. Satisfactory Exceed WR Roberts Inc. True True True True 2 False 4,560,000 1,089,000 True False 93 None 45 3 2 Unsatisfactory 23,459,000 68 False False False 312Narrow_forwardCreate a table of your courses for this semester with fictitious grades for Test 1, Test 2, and Test 3 with appropriate weighting. The next column should determine the grade using the 'if' function, with plus and minus. The grade cell should show a discrete color scheme with below C- in red; above B in green and, yellow for the rest. This should be done using conditional formatting of cells.arrow_forwardSQL query Knowledgearrow_forward
- For this portion of the lab design the solution in the following manner: Create a table using a two-dimensional list that stores a Fahrenheit temperature and the equivalent Celsius temperature. Use the following range of Fahrenheit temperatures:Starting at -10 through 100 in increments of 10. Thus, the Fahrenheit temperatures will be:-10, 0, 10, 20, 30, 40, 50…and so on.Display the contents of the list.Similarly create tables/lists for the following, starting at 0 through 100 in increments of 10:Miles to kilometersGallons to litersPounds to kilogramsInches to centimetersDisplay the contents of each listSave the program as firstname_lastname_Lab7a.py where you will replace firstname and lastname with your actual first and last name.arrow_forwardSection: FactThyroid function is measured using a factor called TSH, which stands for thyroid-stimulatinghormone. A TSH test is a blood test that measures this hormone. TSH normal values are 0.5 to 5.0mIU/L. If TSH is over 5.0 thyroid is underactive. On the other hand, if TSH is less than 0.5 thyroid isoveractive.arrow_forwardThe expression =IF(A1 > 12, 3*A1, 6*A1) is used in a spreadsheet.Find the result if A1 is 2 Find the result if A1 is 13.arrow_forward
arrow_back_ios
SEE MORE QUESTIONS
arrow_forward_ios
Recommended textbooks for you
- Np Ms Office 365/Excel 2016 I NtermedComputer ScienceISBN:9781337508841Author:CareyPublisher:CengageCOMPREHENSIVE MICROSOFT OFFICE 365 EXCEComputer ScienceISBN:9780357392676Author:FREUND, StevenPublisher:CENGAGE L
- Programming with Microsoft Visual Basic 2017Computer ScienceISBN:9781337102124Author:Diane ZakPublisher:Cengage Learning
Np Ms Office 365/Excel 2016 I Ntermed
Computer Science
ISBN:9781337508841
Author:Carey
Publisher:Cengage
COMPREHENSIVE MICROSOFT OFFICE 365 EXCE
Computer Science
ISBN:9780357392676
Author:FREUND, Steven
Publisher:CENGAGE L
Programming with Microsoft Visual Basic 2017
Computer Science
ISBN:9781337102124
Author:Diane Zak
Publisher:Cengage Learning