a3 (1)
.Rmd
keyboard_arrow_up
School
University of California, Irvine *
*We aren’t endorsed by this school
Course
10A
Subject
Computer Science
Date
Dec 6, 2023
Type
Rmd
Pages
3
Uploaded by BailiffLyrebirdMaster934
---
title: Assignment 3
author: Jeff Rouder
output: pdf_document
---
# Objective
The goal for this week is that you learn basic data manipulation steps such as
setting up selection filters and `tapply()`
# Background
We are going to continue our analysis of the teacher evaluation data.
Each
questionnaire consists of a bunch of questions including diagnostics such as
clarity of explanations, organization of the course, and enthusiasm.
We ask
whether we can use these questions to understand the strength and weaknesses of
courses.
# Submission Details
- All assignments are turned in through Canvas
- Upload your pdf only (no screen shots, no text, no .Rmd)
- Due Sunday night, 11:59pm
- Be midnful of late policy.
# Load up new data set.
```{r,message=F,warning=F}
link<-"https://raw.githubusercontent.com/rouderj/uciPsych10A-
F21/main/eaterSocSciM.dat"
dat<-read.table(url(link),head=T)
```
Let's look at dat
```{r}
dat[1:10,]
```
# Quality-Control Check
1. How many rows and how many columns are in this set?
What are the column names?
Let's explore how many courses and questions there are in the set.
Here is the course count:
```{r}
table(dat$course)
```
Notice there are 81 courses and 18 scores per course.
2. Use the `table()` function to count the number of questions.
How many
questions?
How many scores per question?
Does the number of rows, 1458, make
sense?
Why?
# Data manipulation:
The responses to each question is numbered from 1 (lowest) to 7 (highest).
The
score is just the mean across all students for a course.
Last week, we studied the score as a function of course.
We averaged across all
responses and all questions.
We can do the same here.
We use `tapply`.
```{r}
courseMean <- tapply(dat$score,dat$course,mean)
print(courseMean) # a 'named' vector
names(courseMean) #just the names of each course
as.vector(courseMean) #without names
```
tapply() takes three arguments:
i. a vector
ii. a grouping vector,
iii. a function
The vector courseMean should be the same as last week's score.
```{r}
boxplot(courseMean)
```
# Ordering Output
We can get a better view of the scores by class simply by ordering them!
I do this
routinely.
```{r}
sort(courseMean)
```
3. What is the mean of scores for each question?
Which question has the highest
mean? (actual question, not the number)
Which has the lowest (actual question, not
the number)?
The actual questions are at https://github.com/rouderj/uciPsych10A-
F21/blob/main/eaterQuestions.txt
# Course Profile
Let's switch our focus to questions.
The 18 questions may be found here:
https://github.com/rouderj/uciPsych10A-F21/blob/main/eaterQuestions.txt.
Perhaps
these are *diagnostic,* that is, they provide information about the strengths and
faults of each course.
Let's define a *profile* as the collection of 18 scores for a given class.
For
example, here is the *profile* for the first course.
Notice the use of filter:
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Related Questions
Alert dont submit AI generated answer.
arrow_forward
Do you have any suggestions on ways that might be used to capture the data flow?
arrow_forward
Automatic scaling must be specified.
arrow_forward
Q3: Jami is an intern on a sales team at a large insurance company. On the first day of the internship, Jami is asked to download a list of all of her friends and contacts from Facebook and upload them to the company database. The team lead explains "This database keeps track of all of our employees, and well as their customers and prospects. This makes it really easy to avoid redundant customer interactions, and also keeps us from cannibalizing customers and business from one another." Jami is wondering if there are ethical issues she should be concerned about. Do you think there are ethical issues involved with this situation? If yes- what are those issues, specifically, and how would you advise Jami?
arrow_forward
Before I deliver my PowerPoint presentation to my professor, please double-check all of my references and APA citations.
arrow_forward
use python
Web Scraping
A marketing company would like to know if varsity (college) swimmers are (on average) taller than their volleyball counterparts. You have been asked to create a data driven solution in order to answer this question.
The following web pages contain the roster of the Bearcats’ men’s and women’s swimming and volleyball teams.
Men’s Swimming Team
https://athletics.baruch.cuny.edu/sports/mens-swimming-and-diving/roster
Men’s Volleyball Team
https://athletics.baruch.cuny.edu/sports/mens-volleyball/roster
Women’s Swimming Team
https://athletics.baruch.cuny.edu/sports/womens-swimming-and-diving/roster
Women’s Volleyball Team
https://athletics.baruch.cuny.edu/sports/womens-volleyball/roster
The height of each player is listed on all web pages.1. Scrape the heights of all the players on the men’s swimming team and find the average.
2. Scrape the heights of all the players on the men’s volleyball team and find the average.
3. Scrape the heights of all the players on the…
arrow_forward
Can you provide me with the answers to the rest of the questions except Question 8, as shown in the images.
arrow_forward
Please prepare task as instructed by picture below.i will also add another picture as reference and guide to how it should be done
Please prepare figures correctly, thank you
arrow_forward
How to implement CRUD methods on NPOI(deleting , Creating ,Updating) please help
i have managed to import an excel file to a datagrid and now I want to be able to modify the data in it
arrow_forward
Scaling must be automatically setup.
arrow_forward
Subject: Clarification on Project Submission for Dataset 2: Modified Adult Salary Dataset - Binary Classification Model
I am working on the project for the "Modified Adult Salary Dataset - Binary Classification Model" and would appreciate your assistance in doing this project , and meeting all the requirements.
I am focusing on the following aspects:
Data Exploration: Gaining insights into the dataset, identifying potential issues such as missing values, outliers, or class imbalance.
Data Visualization: Utilizing the appropriate techniques to visualize relationships and distributions in the data.
Data Preprocessing: Handling any data inconsistencies, performing encoding and scaling as needed, and considering methods to balance the dataset.
Model Selection and Tuning: Implementing neural network models, performing hyperparameter tuning, and evaluating the performance of the best model.
If there are any areas that need further clarification or improvement, I would appreciate your…
arrow_forward
SEE MORE QUESTIONS
Recommended textbooks for you
Database Systems: Design, Implementation, & Manag...
Computer Science
ISBN:9781305627482
Author:Carlos Coronel, Steven Morris
Publisher:Cengage Learning
COMPREHENSIVE MICROSOFT OFFICE 365 EXCE
Computer Science
ISBN:9780357392676
Author:FREUND, Steven
Publisher:CENGAGE L
Np Ms Office 365/Excel 2016 I Ntermed
Computer Science
ISBN:9781337508841
Author:Carey
Publisher:Cengage
Related Questions
- Q3: Jami is an intern on a sales team at a large insurance company. On the first day of the internship, Jami is asked to download a list of all of her friends and contacts from Facebook and upload them to the company database. The team lead explains "This database keeps track of all of our employees, and well as their customers and prospects. This makes it really easy to avoid redundant customer interactions, and also keeps us from cannibalizing customers and business from one another." Jami is wondering if there are ethical issues she should be concerned about. Do you think there are ethical issues involved with this situation? If yes- what are those issues, specifically, and how would you advise Jami?arrow_forwardBefore I deliver my PowerPoint presentation to my professor, please double-check all of my references and APA citations.arrow_forwarduse python Web Scraping A marketing company would like to know if varsity (college) swimmers are (on average) taller than their volleyball counterparts. You have been asked to create a data driven solution in order to answer this question. The following web pages contain the roster of the Bearcats’ men’s and women’s swimming and volleyball teams. Men’s Swimming Team https://athletics.baruch.cuny.edu/sports/mens-swimming-and-diving/roster Men’s Volleyball Team https://athletics.baruch.cuny.edu/sports/mens-volleyball/roster Women’s Swimming Team https://athletics.baruch.cuny.edu/sports/womens-swimming-and-diving/roster Women’s Volleyball Team https://athletics.baruch.cuny.edu/sports/womens-volleyball/roster The height of each player is listed on all web pages.1. Scrape the heights of all the players on the men’s swimming team and find the average. 2. Scrape the heights of all the players on the men’s volleyball team and find the average. 3. Scrape the heights of all the players on the…arrow_forward
- Can you provide me with the answers to the rest of the questions except Question 8, as shown in the images.arrow_forwardPlease prepare task as instructed by picture below.i will also add another picture as reference and guide to how it should be done Please prepare figures correctly, thank youarrow_forwardHow to implement CRUD methods on NPOI(deleting , Creating ,Updating) please help i have managed to import an excel file to a datagrid and now I want to be able to modify the data in itarrow_forward
- Scaling must be automatically setup.arrow_forwardSubject: Clarification on Project Submission for Dataset 2: Modified Adult Salary Dataset - Binary Classification Model I am working on the project for the "Modified Adult Salary Dataset - Binary Classification Model" and would appreciate your assistance in doing this project , and meeting all the requirements. I am focusing on the following aspects: Data Exploration: Gaining insights into the dataset, identifying potential issues such as missing values, outliers, or class imbalance. Data Visualization: Utilizing the appropriate techniques to visualize relationships and distributions in the data. Data Preprocessing: Handling any data inconsistencies, performing encoding and scaling as needed, and considering methods to balance the dataset. Model Selection and Tuning: Implementing neural network models, performing hyperparameter tuning, and evaluating the performance of the best model. If there are any areas that need further clarification or improvement, I would appreciate your…arrow_forward
arrow_back_ios
arrow_forward_ios
Recommended textbooks for you
- Database Systems: Design, Implementation, & Manag...Computer ScienceISBN:9781305627482Author:Carlos Coronel, Steven MorrisPublisher:Cengage LearningCOMPREHENSIVE MICROSOFT OFFICE 365 EXCEComputer ScienceISBN:9780357392676Author:FREUND, StevenPublisher:CENGAGE LNp Ms Office 365/Excel 2016 I NtermedComputer ScienceISBN:9781337508841Author:CareyPublisher:Cengage
Database Systems: Design, Implementation, & Manag...
Computer Science
ISBN:9781305627482
Author:Carlos Coronel, Steven Morris
Publisher:Cengage Learning
COMPREHENSIVE MICROSOFT OFFICE 365 EXCE
Computer Science
ISBN:9780357392676
Author:FREUND, Steven
Publisher:CENGAGE L
Np Ms Office 365/Excel 2016 I Ntermed
Computer Science
ISBN:9781337508841
Author:Carey
Publisher:Cengage