Assignment 4 Linear Regression v3

.docx

School

Howard University *

*We aren’t endorsed by this school

Course

MISC

Subject

Statistics

Date

Jan 9, 2024

Type

docx

Pages

13

Uploaded by Dillah1

Report
[Student Name] DATA 320 Module 4 Linear Regression Model Development using Power BI Name Overview For this assignment, you will Conduct Exploratory Data Analysis (EDA) to prepare data for further analysis. Analyze data for relationships and/or trends using PowerBI. Create three visualizations to explain the data. Use the template to answer questions about the data anomalies and your findings. Create a linear regression model to perform a prediction. Write a letter to future students in your university providing them with data-driven advice. Scenario You are a university student, and you are trying to understand the best ways to succeed. The professor has been offering study sessions to help the students and has asked each student to keep track of the number of minutes they spent on the last assignment. Your data analytics professor has written a survey with the following questions: 1) How many years have you been in school? 2) Are you a full-time or part-time student? 3) What is the name of your success coach? 4) What degree are you pursuing? 5) Did you attend the last study session? 6) How many minutes did you devote to the last assignment? 7) What grade did you get on the assignment (out of 100) The system automatically generated an ID number. The professor provides you with the data (attached) and asks that you do the following: 1) List the questions and goals for the project 2) Explore the raw data. What are the fields, and how many records? Find any anomalies and fix the data. 3) Perform a trend analysis 4) Create at least 3 visualizations 5) Create a linear regression 6) Understand the implications of the model you created 7) Write a letter to future students in the class giving them advice on how to succeed using the output from the linear regression. Data 320 Assignment 4 Linear Regression 1
[Student Name] Exploratory Data Analysis Phase Initial Questions Answers How would you state the problem you are trying to solve? What are the project goals? What questions are you trying to answer? Are there more questions that you can think of other than what is in the scenario? Who is your audience? Are there additional stakeholders/decision-makers? Explore the raw data, anomaly detection, and transformation Answers/Results Load the Excel file into Power BI Desktop *if you are using the Virtual Lab (VDA) to access Power BI desktop, follow the instructions- Loading files and Publishing Power BI in the UMGC Virtual Lab found in the classroom. Click on the “Student Survey” worksheet and click Transform Data. You should now be in Power Query Editor Click on View and ensure that “column quality”, “column distribution”, and “column profile” are checked. Go to the menu in the bottom left corner, how many columns do you have? Data 320 Assignment 4 Linear Regression 2
[Student Name] How many distinct rows of data do you have? Do any of your fields have missing or empty data? Which one (s)? Look closely at each field, is there any unusual data that might need to be removed? You checked with your professor and told them about the data anomalies that you discovered, they told you to ignore the missing records. However, they suggest deleting the row with strange data. You need to return to Excel, delete the row that has the strange data. Then open that new file in Power BI again. Now, how many rows of data do you have? For each field, what kind of data do you see (categorical or continuous)? What do you think each field means? Refer to the scenario for help. <<Enter your columns in the tables that follow these instructions. Add or remove any rows to the tables as needed>> Is there a field that doesn’t look like categorical OR continuous (numerical)? Examination of Categorical Fields (Click on the field and use the column statistics to collect this data). Name of field Brief description Number of categories Examination of Continuous (Numerical) Fields (Click on the field and use the column statistics to collect this data). Round to two decimals. Name of field Brief description Minimum value Maximum value Average Standard deviation Data 320 Assignment 4 Linear Regression 3
[Student Name] Trend Analysis Step Answers/Results Before you get into the data, using only your common sense and experience, what factors do you think would lead to a higher grade on the assignment? After looking at the columns, which field do you think is the target variable that we want to study? Click the “Close and Apply” button in Data Transformation. We will return to this after you have spent some time analyzing the data. You should now be in the Power BI Desktop main window. Click on the Key Influencer icon in the Visualizations Move “Grade” into the Analyze box and move “Total Minutes Spent” into the Explain By box. Do the same step above for other fields to see what happens with “Grade” Think back to your top reasons why you think students would get good grades- did the data SUPPORT or CONTRADICT your thoughts? Which ones? Looking at the influencers the data found, what are some of the factors influencing the student’s grade? In plain language, how would you explain the key influencers you found? Use the Key Influencer to analyze “Grade” only using the “Total Minutes spent on the Assignment” as the explain by variable. What values did you get? Data 320 Assignment 4 Linear Regression 4
[Student Name] Visualization Using your knowledge about data visualizations, create at least three visualizations that help to show or explain something significant about the data. Create a new page for each visualization. Regression Analysis Step Answers/ Results Create a new page. Create a scatterplot visualization with Total Minutes Spent on Assignment in the X- Axis and Grade in the Y-Axis. Change the aggregation defaults from SUM to Don’t Summarize for both. Look at the scatterplot created- what is a general statement you can say about this graph? From the Analytics pane- add a Trend Line Data 320 Assignment 4 Linear Regression 5
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help