The list of projects that each employee worked on. The average award received by each employee. The maximum and minimum monthly awards are received by each employee. The project id and language in which each employee received the maximum award for working on that project. The employee name that received the maximum awards within 10 years. The number of projects that each employee did not work on it.

Computer Networking: A Top-Down Approach (7th Edition)
7th Edition
ISBN:9780133594140
Author:James Kurose, Keith Ross
Publisher:James Kurose, Keith Ross
Chapter1: Computer Networks And The Internet
Section: Chapter Questions
Problem R1RQ: What is the difference between a host and an end system? List several different types of end...
icon
Related questions
Question
100%
import pandas as pd
import numpy as np
URL = "https://bit.ly/38zhU5T"
df=pd.read_csv(URL)
 
note: I think the data should be cleaned before performing the tasks(please include the solution of cleaning the data as well)
 
The data engineer is given a dataset that stores detail of all developed projects within the last ten years by four
employees in the software design and implementation department in an IT company. This data contains the following
information:
URL =
"https://bit.ly/38zhU5T"
• Project ID represents the identifier of each project.
• Language represents the programming language used to develop the project.
• Employee 1, Employee 2, Employee 3, and Employee 4 are the four employees in the department. The entry value of
these columns can be:
O when the employee did not work on the project.
o 1 when the employee worked on the project.
o 2 when the employee partially worked on the project. Note that if the employee partially worked on a project
then he received half of the reward.
• Award represents the financial reward for working on a particular project.
• Currency represents the currency of the financial reward.
Transcribed Image Text:The data engineer is given a dataset that stores detail of all developed projects within the last ten years by four employees in the software design and implementation department in an IT company. This data contains the following information: URL = "https://bit.ly/38zhU5T" • Project ID represents the identifier of each project. • Language represents the programming language used to develop the project. • Employee 1, Employee 2, Employee 3, and Employee 4 are the four employees in the department. The entry value of these columns can be: O when the employee did not work on the project. o 1 when the employee worked on the project. o 2 when the employee partially worked on the project. Note that if the employee partially worked on a project then he received half of the reward. • Award represents the financial reward for working on a particular project. • Currency represents the currency of the financial reward.
You are asked as a data engineer to prepare an accurate report for the project manager. In addition, you are supposed to report all problems
in the dataset as in the following example:
Problem
Col name /Whole dataset
Number of defective rows
Ratio of defective rows
Problem 1
Store ID
1500 row
37.5%
Where the number of defective rows refers to the count of rows that show the problem and the ratio of defective rows is the percentage of
defective rows to the total number of rows in the dataset.
The list of projects that each employee worked on.
The average award received by each employee.
The maximum and minimum monthly awards are received by each employee.
The project id and language in which each employee received the maximum award for working on that project.
The employee name that received the maximum awards within 10 years.
The number of projects that each employee did not work on it.
Transcribed Image Text:You are asked as a data engineer to prepare an accurate report for the project manager. In addition, you are supposed to report all problems in the dataset as in the following example: Problem Col name /Whole dataset Number of defective rows Ratio of defective rows Problem 1 Store ID 1500 row 37.5% Where the number of defective rows refers to the count of rows that show the problem and the ratio of defective rows is the percentage of defective rows to the total number of rows in the dataset. The list of projects that each employee worked on. The average award received by each employee. The maximum and minimum monthly awards are received by each employee. The project id and language in which each employee received the maximum award for working on that project. The employee name that received the maximum awards within 10 years. The number of projects that each employee did not work on it.
Expert Solution
steps

Step by step

Solved in 3 steps with 6 images

Blurred answer
Recommended textbooks for you
Computer Networking: A Top-Down Approach (7th Edi…
Computer Networking: A Top-Down Approach (7th Edi…
Computer Engineering
ISBN:
9780133594140
Author:
James Kurose, Keith Ross
Publisher:
PEARSON
Computer Organization and Design MIPS Edition, Fi…
Computer Organization and Design MIPS Edition, Fi…
Computer Engineering
ISBN:
9780124077263
Author:
David A. Patterson, John L. Hennessy
Publisher:
Elsevier Science
Network+ Guide to Networks (MindTap Course List)
Network+ Guide to Networks (MindTap Course List)
Computer Engineering
ISBN:
9781337569330
Author:
Jill West, Tamara Dean, Jean Andrews
Publisher:
Cengage Learning
Concepts of Database Management
Concepts of Database Management
Computer Engineering
ISBN:
9781337093422
Author:
Joy L. Starks, Philip J. Pratt, Mary Z. Last
Publisher:
Cengage Learning
Prelude to Programming
Prelude to Programming
Computer Engineering
ISBN:
9780133750423
Author:
VENIT, Stewart
Publisher:
Pearson Education
Sc Business Data Communications and Networking, T…
Sc Business Data Communications and Networking, T…
Computer Engineering
ISBN:
9781119368830
Author:
FITZGERALD
Publisher:
WILEY