Skip to main content

Documents Computer Science

ML_assignment_6.pdf.pdf

ML_assignment_6

.pdf

School

Drexel University *

*We aren’t endorsed by this school

Course

613

Subject

Computer Science

Date

Feb 20, 2024

Type

pdf

Pages

5

Uploaded by CaptainMantisMaster961

Machine Learning Assignment 6 - Probabilistic Models Fall 2023 Meet Sakariya 14473322 1 Theory 1 (a) Computer the posteriors for the observation i. Inference (5pts) Solution: Posterior using Inference: P(y=+ |x=[T,T]) = 3/12 = 0.25 P(y=- |x=[T,T]) = 0/12 = 0 Normalize: P(y=+ |x=[T,T]) = 0.25/(0.25+0) = 1 P(y=- |x=[T,T]) = 0/(0.25+0) = 0 ii. Naive Bayes (5pts) Solution: Posterior using Naive Bayes: P(y=+) = 12/21 = 0.571 P(x1=T y=+) = (3+4)/12 = 0.583 P(x2=T y=+) = (3+4)/12 = 0.583 x = [ ] using: 1. Consider the following set of training examples for an unknown target function: ( Y x1 x2 Count + T T 3 + T F 4 + F T 4 + F F 1 - T T 0 - T F 1 - F T 3 - F F 5 ) : T,T x1, x2 y | | →

2 P(y=-) = 9/21 = 0.428 P(x1=T |y=-) = 1/9 = 0.111 P(x2=T |y=-) = 3/9 = 0.333 Naive Bayes for x = [T, T] P(Y |X) = (P(Y).P(X|Y))/P(X) P(Y=+).P(x1=T|Y=+).P(x2=T|Y=+) and P(Y |X) = (P(Y).P(X|Y))/P(X) P(Y=-).P(x1=T|Y=-).P(x2=T|Y=-) P(y=+ |x=[T,T]) = (0.571)(0.583)(0.583) = 0.194 P(y=- |x=[T,T]) = (0.428)(0.111)(0.333) = 0.016 Normalize: P(y=+ x) = 0.194/(0.194+0.016) = 0.924 P(y=- x) = 0.016/(0.194+0.016) = 0.076 | |

2 NaiveBayesClassifier Let’s train and test a dataset. 3 1. Description of any additional pre-processing of the dataset you did. 2. The validation accuracy of your system. 3. Your confusion matrix. to classifiy the fetal state from the Cartiotocography Here are the main pre-processing steps I performed on the CTG dataset: 1) I shuffled the rows randomly using scikit-learn’s shuffle function to ensure the training and vali- dation sets contain a random mix of samples. 2) I split the shuffled dataframe into a training set with the first 2/3 observations and a validation set with the remaining 1/3 observations. 3) For each feature, I computed the mean value from the training data. Then I binarized each feature in the training and validation sets by setting values >= mean to 1 and values < mean to 0. This pre-processes the continuous features to be more suitable for naive Bayes. 4) The original data had a ’CLASS’ column that I discarded, only keeping the ’NSP’ target column. Accuracy: 0.8486562942008486 Confusion matrix: 1. Reads in the data. 2. Shuffles the observations 3. Selects the first 2/3 (round up) of the data for training and the remaining for validation. 4. Pre-processes the data. Although technically some of the columns are discerete valued, let’s treat them all as continuous and convert them to binary ones using the mean of that feature, as computed from the training data. 5. You can now using the training dataset to compute: (a) Class priors (b) Naive probabilities, P(xi |y) for each feature of each class 6. Given that information, you can now classifies each validation sample. Naive Bayes Classifier Solution: Write a script that: In your report you will need:

Your preview ends here

Eager to read complete document? Join bartleby learn and gain access to the full version

Access to all documents
Unlimited textbook solutions
24/7 expert homework help

Related Questions

Axiom Bayesian Networks Biconditional Conjunction Decoding Disjunction Entailment Evaluating Filtering HMM (Hidden Markov Model) Implication Inference Knowledge Base Learning Negation Predicting Semantics Smoothing Stationary Process Syntax

Regression Naive Bayes Averaged One-Dependence Estimators (AODE) Bayesian Belief Network (BBN) Deep Boltzmann Machine (DBM) Bayesian Gaussian Nalve Bayes Deep Belief Networks (DBN) Deep Learning Multinomial Naive Bayes Bayesian Network (BN) Classification and Regression Tree (CART) Convolutional Neural Network (CNN) Stacked Auto-Encoders Random Forest Iterative Dichotomiser 3 (ID3) Gradient Boosting Machines (GBM) Boosting Bootstrapped Aggregation (Bagging) C4.5 C5.0 Ensemble Decision Tree Chi-squared Automatic Interaction Detection (CHAID) AdaBoost Stacked Generalization (Blending) Decision Stump Conditional Decision Trees Gradient Boosted Regression Trees (GBRT) MS Radial Basis Function Network (RBFN) Principal Component Analysis (PCA) Perceptron Neural Networks Partial Least Squares Regression (PLSR Back-Propagation Sammon Mapping Machine Learning Algorithms Hopfield Network Ridge Regression Least Absolute Shrinkage and Selection Operator (LASSO) Multidimensional Scaling (MDS)…

Generative Adversarial Networks (GANS) can be broken down into parts A-1 В-2 С-3 D-4 is used To learn a generative model, which describes how data is generated in terms of a probabilistic model A-Adversarial B-Generative C-Networks D-discriminator Which is one of the most popular also the most successful implementation of GAN? A-Conditional GAN B-Vanilla GAN C-Deep Convolutional GAN D-Laplacian Pyramid GAN Q2 : cite some application examples of GANS?

A Probabilistic Model for Approaching the Selection Pressure Curve

3. Consider a fuzzy inference system cosisting two fuzzy laws as follow, if u is A, and vis B₁, thenw is C₁ if u is A, and vis B₂, thenw is C₂ All the fuzzy sets are in the form of triangular (3-value) and trapizoidal (4-value) that are defined as follow, A₁ = (0,1,2, 4); B₁ = (1,3,5); C₁ = (1,2,3) A₂ = (1,3,4,6); B₂ = (0,2,4); C₂ = (2,4,6) Obtain the results of the fuzzy inference, (w), for the inputs u = 3,v₁ =1.5 using the Mamdani inference method in a precise and digramatic form.

For learning a predictive model describe the coefficient of determination and explicate its usefulness in Linear Regression

In the theoretical landscape of probability theory, a multifaceted inquiry unfolds: Can we dissect the complexities of stochastic processes and their role in modeling dynamic systems over time? How do concepts like Markov chains and Brownian motion contribute to understanding the evolution of random phenomena, transcending mere chance into a nuanced exploration of theoretical probabilities with real-world implications? Furthermore, how does the convergence of these theoretical frameworks amplify our ability to analyze and predict the uncertain trajectories of diverse phenomena in fields ranging from finance to physics?

Subject : Artificial Intelligence What Is Fuzzy Inference Systems? Select one: a. Having a larger output than the input b. Having a smaller output than the input c. The process of formulating the mapping from a given input to an output using fuzzy logic d. Changing the output value to match the input value to give it an equal balance

Select what you think is correct (multiple options are possible)? A) Logistic regression is a parametric classification algorithm and decision tree B) Logistic regression is based on a linear combination of parameters as is decision tree C) Logistic regression is based on a linear combination of parameters and a link function called sigmoid D) Decision trees tend to have high bias and low variance that random forests fix E) Decision trees use unsupervised learning

4 Bayesian Learning In Naive Bayes Classifier, what happens when the conditional independence Question 7 assumption is violated? Explain in terms of the consistency of the outcome (i.e., is the output the same as the target value).

machine learning models

Could you perhaps supply some examples to support your choice of machine learning model?

Is Logistic regression based on a linear combination of parameters as is decision tree?

Logistic regression aims to train the parameters from the training set D = {(x(i),y(i)), i 1,2,...,m, y ¤ {0,1}} so that the hypothesis function h(x) = g(0¹ x) 1 (here g(z) is the logistic or sigmod function g(z) can predict the probability of a 1+ e-z new instance x being labeled as 1. Please derive the following stochastic gradient ascent update rule for a logistic regression problem. 0j = 0j + a(y(¹) — hz(x)))x; ave. =

Finding a hypothesis that matches the examples is the first step in inductive learning.According to Ockham's razor, the simplest consistent theory should be chosen. The difficulty of this job is determined on the representation used. (True/False)

Classic statistical inference is based on the following three fundamental assumptions.What are they?

Which of the following statements are true for statistical learning?(select all that apply) a) For big data, likelihood overwhelms prior, hence MAP converges to MLE. b) MLE doesn't need prior distribution. c) MAP learning needs prior distribution. d)Full Bayesian learning gives best possible predictions but is intractable.

Please dont us Ai

Artificial Intelligence please answer

Incorrect Question 11 0 / 6 pts Which of the following supervised learning methods CANNOT use categorical input variables (i.e. categorical features, predictors)? O multinomial logistic regression naive Bayes classifier neural net random forest all the listed methods can use categorical features Olinear regression

Inductive learning involves finding a hypothesis that agrees well with the examples.Ockham’s razor suggests choosing the simplest consistent hypothesis. The difficultyof this task depends on the chosen representation.(True/False)

[9_3_B_BCD] Please answer this question step by step

True or False: During Q-learning, the learning rate α should be decreased as the Q-table is updated. Briefly justify your answer.

In non-monotonic thinking, there is uncertainty and incompleteness.

What are the differences between deep probabilistic models and traditional probabilistic models? Can you please share the pros and cons?

Question 10 Suppose we are using a Perceptron algorithm to predict if a point lies above or below the line y=2x-3. The next point in the test set is (1,2). The algorithm predicts that the point lies below the line. Question 9 Assuming the learning rate is 0.01 and the current value for the weights are: weight for x: 0.5 weight for y: 0.025 Suppose we are using a Perceptron algorithm to predict if a point lies above or below the line y=2x-3. The first point in the test set is (0,-2). The algorithm What is the new weight for x? predicts that the point lies below the line. O 5.02 What happens next? O 4.98 O The weights are NOT changed because the algorithm predicted correctly. O 5.2 O The weights are changed because the algorithm predicted incorrectly. O 4.8 Question 11 Question 12 Consider the logistic function Assume we are using Logistic Regression to predict the binary classification problem of whether a student should be admitted into a college based solely 1+e et upon their SAT…

SEE MORE QUESTIONS

Recommended textbooks for you

Text book image

Database System Concepts

Computer Science

ISBN:9780078022159

Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan

Publisher:McGraw-Hill Education

Text book image

Starting Out with Python (4th Edition)

Computer Science

ISBN:9780134444321

Author:Tony Gaddis

Publisher:PEARSON

Text book image

Digital Fundamentals (11th Edition)

Computer Science

ISBN:9780132737968

Author:Thomas L. Floyd

Publisher:PEARSON

Text book image

C How to Program (8th Edition)

Computer Science

ISBN:9780133976892

Author:Paul J. Deitel, Harvey Deitel

Publisher:PEARSON

Text book image

Database Systems: Design, Implementation, & Manag...

Computer Science

ISBN:9781337627900

Author:Carlos Coronel, Steven Morris

Publisher:Cengage Learning

Text book image

Programmable Logic Controllers

Computer Science

ISBN:9780073373843

Author:Frank D. Petruzella

Publisher:McGraw-Hill Education

SEE MORE TEXTBOOKS

Related Questions

SEE MORE QUESTIONS

Recommended textbooks for you

Database System Concepts
Computer Science
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:McGraw-Hill Education
Starting Out with Python (4th Edition)
Computer Science
ISBN:9780134444321
Author:Tony Gaddis
Publisher:PEARSON
Digital Fundamentals (11th Edition)
Computer Science
ISBN:9780132737968
Author:Thomas L. Floyd
Publisher:PEARSON
C How to Program (8th Edition)
Computer Science
ISBN:9780133976892
Author:Paul J. Deitel, Harvey Deitel
Publisher:PEARSON
Database Systems: Design, Implementation, & Manag...
Computer Science
ISBN:9781337627900
Author:Carlos Coronel, Steven Morris
Publisher:Cengage Learning
Programmable Logic Controllers
Computer Science
ISBN:9780073373843
Author:Frank D. Petruzella
Publisher:McGraw-Hill Education

Text book image

Database System Concepts

Computer Science

ISBN:9780078022159

Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan

Publisher:McGraw-Hill Education

Text book image

Starting Out with Python (4th Edition)

Computer Science

ISBN:9780134444321

Author:Tony Gaddis

Publisher:PEARSON

Text book image

Digital Fundamentals (11th Edition)

Computer Science

ISBN:9780132737968

Author:Thomas L. Floyd

Publisher:PEARSON

Text book image

C How to Program (8th Edition)

Computer Science

ISBN:9780133976892

Author:Paul J. Deitel, Harvey Deitel

Publisher:PEARSON

Text book image

Database Systems: Design, Implementation, & Manag...

Computer Science

ISBN:9781337627900

Author:Carlos Coronel, Steven Morris

Publisher:Cengage Learning

Text book image

Programmable Logic Controllers

Computer Science

ISBN:9780073373843

Author:Frank D. Petruzella

Publisher:McGraw-Hill Education

SEE MORE TEXTBOOKS