Assume that you are a data scientist in Amazon. Since the company is celebrating Silver Jubilee this year, it has decided to reward their customers. Your Manager handed over last 2 years retail data and asked you to do certain tasks. The tasks are as follows: Task1-1: When you started working with data, you've realized that it needs cleaning to produce better results. Do essential data cleaning. The final output should be the one as follows before cleaning: any negatives?: True after cleaning: any negatives?: False
Q: C. Assume the following sample of data : Тeam TeamID 1001 | 1002 1003 | 1004 City Win TeamName…
A: Hey there, I am writing the required solution based on the above given question. Please do find the…
Q: d the data into a pandas dataframe named data_firstname where first name is you name. 2. Carryout…
A: Lets see the solution.
Q: 1.What is the purpose of exploring data? a. To generate labels for your data. b. To gain a better…
A: Hello Student. Warm Welcome from my side. Hope you are doing great. I will try my best to answer…
Q: a) What is the primary difference between a data scientist and a data engineer? b) Which of the…
A: a). Data Engineer: The data engineer is someone who develops, constructs, tests and maintains…
Q: Now, I'm working on the outline of my project. It's about solving a problem at work. The topic of…
A: The process of ensuring that data is correct, consistent, and useable is known as data cleansing.…
Q: What data science topics are you interested in learning more about? How might you apply some of the…
A: INTRODUCTION: Following are the general topics in which most of the persons are interested:…
Q: Q1) Design a football matches database system using relational data model and then using SQL…
A: Given two questions are not related. So, as per our guidelines, only one question will be answered…
Q: How do you introduce structure into text-based data. Identify alternative ways of inducing structure…
A: Include Structure in Text Data: Text mining can be defined as the application of a data mining…
Q: A: Make a list of 5 internal data (specific data points such as “first name” and “sales per day”)…
A: 1) Total sales of the day :- This will tell the profits and the sales of each category items.…
Q: w Write a new subroutine to clear all the changes done on the original Data worksheet. Place a…
A: Q. I am Using Excel VBA and I am curious to an example of code that would fit the question, if there…
Q: Is it ethical to relegate individual data to being a commodity to be used by a company at their…
A: Is it ethical to relegate individual data to being a commodity to be used by a company at their will
Q: scenario is described below where data mining might be applied. Indicate what data mining algorithm…
A: I)A data mining (or engine learning) algorithm is a series of heuristics and calculations to…
Q: Continuting with youtube and sales, fill in the blanks below to create a regression table output.…
A: Given:
Q: You have been asked to help one of the data analysts who is working in your team. The data analyst…
A: In the case of missing data, the value cannot be observed, but would be meaningful if observed. For…
Q: Python has been used in numerous areas worldwide, such as website making, artificial intelligence,…
A: The data structure in python information system:- It is important to organize, manage and save data…
Q: Senior management has asked you to research a Big Data topic. They would like you to choose one of…
A: We're looking at the healthcare industry because it generates a lot of data and is primarily driven…
Q: Is there any potential for data mining with the Fitch Wood data mart? Where can I find some data…
A: The Fitch Wood data mart has the potential to facilitate a significant amount of data mining. For…
Q: Load & check the data: 1. Load the data into a pandas dataframe named data_firstname where first…
A: As per policy, in case of multiple questions, we will answer the first question.
Q: Load & check the data: 1. Load the data into a pandas dataframe named data_firstname where first…
A: As per Bartleby Policy we are answer first 3 parts
Q: Film producers want to show the work of analysts. They do not always do it well. Sometimes it ends…
A: Given a movie Database and asked to write SQL query to retrieve movie data, where the director and…
Q: marketing company would like to know if varsity (college) swimmers are (on average) taller than…
A: import requestsfrom bs4 import BeautifulSoupURL1 =…
Q: What do you consider to be the most effective way to back up user data? Your solution has to be…
A: Answer: It is the process of digitizing information and documents and storing them in a storage…
Q: a) Elaborate in your own words the operation in the figure 1. below. O Integration Understanding…
A: DATA MINING STAGES (KNOWLEDGE DISCOVERY IN DATABASES(KDD)) The iterative process consists of the…
Q: Load & check the data: 1. Load the data into a pandas dataframe named data_firstname where first…
A: Here is the code: from sklearn.datasets import load_breast_cancer # Step 1data =…
Q: Student with the following attributes: Student_ld, student fell name, average mark and course_id.…
A: In questions with many individual demands, we must answer the first three. All the classes with…
Q: Film producers want to show the work of analysts. They do not always do it well. Sometimes it ends…
A: selecting the movies that had atleast two directories where the movie has an actor Michael Jackson…
Q: Which one of the following is the important process used to know more about data? O a. Machine…
A: Machine learning - It is the science of making computer able to make decisions without any hardcode…
Q: Please discuss your feelings about data transformation. Was this the first time that you had heard…
A: data is changed to improve it coordinated. transformed data might be more straightforward for the…
Q: When working with big volumes of data, you inevitably run into a number of problems. What are some…
A: Meaning: In an enterprise, big data refers to a large amount of data, which may be organised or…
Q: You are asked to do some research, and write a report that answers the following questions about Big…
A: Answer : 1. What is Big Data? Define. Big Data is a phrase used to mean a massive volume of both…
Q: I need help with this question. writting code Question: The management also wants you to run…
A: we can use the inbuilt mean(), median(), var(), std() to get the mean, median, variance and standard…
Q: How would you organize this flat file for the collection of data?
A: A database stored in a file called a flat-file database. Now the question is how can you create a…
Q: Datasets: Dataframes flights, airlines in the nycflights13 package. Other data can be integrated…
A: Dataframes flights, airlines in the nycflights13 package. Other data can be integrated when needed.…
Q: services to our student needs. Now you are tasked to create data warehouse using the data generated…
A: Question: a. Facts are defined as the numbers or numerical values required to describe the…
Q: Compare and contrast the benefits and drawbacks of batch versus online data input methods. There is…
A: Batch input methods, such as filling out a form by hand, are a much more tedious and time-consuming…
Q: A marketing company would like to know if varsity (college) swimmers are (on average) taller than…
A: Given: According to the given link in the question the male swimming team with their height are as…
Q: Please discuss your feelings about data transformation. Was this the first time that you had heard…
A: data transformation is the most common way of changing the organization, construction, or upsides of…
Q: Which is the correct order of steps to Data Processing? Ask an Interesting Question Design a…
A: The six stages of data processing are - 1. Data collection 2. Data preparation 3. Data input 4.…
Q: Please re-purpose the following financial data collecting script to gather stock trading data in the…
A: Public Market Equivalent (PME) is a set of analyzes used in the private equity industry to compare…
Q: Write the strategies for data reduction.
A: The Answer is in Below Steps
Q: Pretend that you are working for Del Monte in the area of canned vegetables and that Del Monte has a…
A: INTRODUCTION: Here we need to tell types of methods and analyses would these decisions in part a…
Q: (Data exploration and Mining Method Proposal): Here you will explore your data both visually and/or…
A: Note: Answering in python as no language is mentioned. Task : Load the dataset. Add statistical…
Q: Is there any way to use the Fitch Wood data mart for data mining? There are also some data mining…
A: Answer: There is a vast amount of data mining that could be done using Fitch wood data mart. For…
What will be the python code for this
Trending now
This is a popular solution!
Step by step
Solved in 2 steps with 2 images
- In Assignment 5, our focus was on the Data & Processes that act on it. This week’s assignment is focusing on Actions & Events. In a typical mobile banking application, users will login into the app, the log in may be successful or unsuccessful. For unsuccessful logins, the users can either request login or reset their password and attempt logging back in again. If the login is successful, the users can then perform various transactions (read actions). For example, The users can transfer money internally to another customer or transfer money to another account they own. They may also transfer money to an external bank. The users can also view their current account balance – they may have more than one account They can also deposit their check using Mobile Check deposit feature on the app They can contact customer service using either chat feature, request a callback, or email Some of the shapes and symbols you will need for this assignment are as follows: Start Point/Initial…Which of the studied data structures in this course would be the most appropriate choice for the following tasks? And Why? To be submitted through Turnitin. Maximum allowed similarity is 15%. A Traffic Department needs to keep a record of random 3000 new driving licenses. The main aim is to retrieve any license rapidly through the CPR Number. A limited memory space is available. A symbol table is an important data structure created and maintained by compilers in order to store information about the occurrence of various entities such as variable names, function names, objects, classes, interfaces, etc. Symbol table is used by both the analysis and the synthesis parts of a compiler to store the names of all entities in a structured form at one place, to verify if a variable has been declared, …etc.Datasets: Dataframes flights, airlines in the nycflights13 package. Other data can be integrated when needed. In this project, you will need to read in the given dataset in RStudio and then perform the following data analysis using R. Part I. Reading in the dataset and basic analysis Pat II. Visualizing relationships between pairs of variables Part III. Manipulating/ joining/ transforming Data Part IV. Summarizing data For each of the above four topics, please design 5 interesting questions/tasks, run R commands to get the answers or complete the tasks.
- PLEASE SHOW ALL WORK AND COMMENT ALL CODE The Objective of this coding problem is the prediction of a proposed metro ectension construction project based on the people'es opinion. There are three alternatives to choose they are as follows: Eglington-Pickering Line Airport-Vaughn Line Airposrt-Hamilton Line Each record is represented by 16 features. Task-1: Metro-Ext.xlsx is the training and test dataset; you will considerr 80% of the data for training and 20% for the test. Build (1) Logistic regression (2) KNN and (3) Naive Bayes model to predict on the test data set and compute the confusion matrix for each model and compare the result. deliverables = coding files (.py and .ipynb), and a discussions of confusion matrix for both models metro-EXT.xlsx (Please place chart in EXCEL) Feasibility and Constructability Slopes and Gradients Urban Realm Geology and Soil Stability Land Acquisition Work Opportunities Economy in Movement of People Revenue Generation Access to the Social,…This question tests your understanding of data transformation. We did an assignment on this process. If you test your data and find that it is not suitable for testing, which mathematical function can you put each score through to make it usable and suitable for testing? Select one: a. There is no way to make the data usable. b. square root each number c. subtract the df from each number d. use the variance instead of each number Clear my choicePHP According to table the in the image 1. (Assume that the following program section (Program Code 1) are variables that receives input from the user via a form. Build program code to provide instructions for entering the data into the assessments table. In addition, next build a link to view the data that has been entered.) <?php include ('config.php'); $ matrixnum = $_POST['matrixnum']; $ subject_code = $_POST['subject_code ']; $ quiz = $_POST['quiz']; $ assignment = $_POST['assignment']; $ project = $_POST['project']; $ total_marks= $_POST['total_marks']; $ grade = $_POST['grade']; Code program 1 2.(Based on the program section (Code Program 1), assume that the data wants to be updated by the user. Build program code to provide instructions for updating the data to the assessments table. In addition, build a link to view the updated data.)
- The following is a sample report from a child clinic. It shows the days and treatments for visits to doctors at the clinic. A mother takes her child, sees a doctor and the doctor administers some treatment. There are couple of things you should assume: The mother’s name is unique. a mother won't have more than one child with the same name the doctor's surname is unique a treatment code is unique treatments are specific to doctors, eg 101 is only ever given by Johnson Surname First Name Town Age Child Doctor Date Treatment Smith Jane Coventry 30 Rebecca Johnson 27.3.04 101 Brian Clarence 30.5.04 209 Robert Johnson 10.1.88 101 Brown Beryl Rugby 28 Alan Clarence 30.4.04 214 Sarah Johnson 29.5.04 101 Sarah Clarence 12.1.04 321 Jones Fiona Kenilworth 34 James Clarence 30.4.04 322 Jenny…_&plase helo with Course: Database *(SQL)* Please excute the given SQL script (https://drive.google.com/file/d/1zxe_aOhERjVCL54_zbgSLkFTRHYQhOPW/view?usp=sharing) for accessing the data. The data is described in the following relation schemas: Airport (airportID, name, city) Passenger (ticketNo, name, nationality, flightNo, seatNo)FK: flightNo references Flight (flightNo)FK: seatNo references Seat (seatNo) Flight (flightNo, flightCompany, departAirport, arrivalAirport)FK: departAirport references Airport (airportID)FK: arrivalAirport references Airport (airportID) Seat (seatNo, flightNo, class)FK: flightNo references Flight (flightNo) #Construct the SQL statements based on following transactions:1. Retrieve all rows in Airport table for all the airports in London city.2. Retrieve all British and German passengers.3. Retrieve all names of all the passengers..Using R lab please provide the code and answer:Find the data set “HELPrct”. Construct a side by side boxplots (can be horizontal orvertical) to compare the effect of homeless or housed on the average number of drinks perday for the patients. Add title “XXXXXX – Side by side Boxplot to compare the effect ofhomeless on average number ofDrinks”.Based on the side by side boxplot; write a simple conclusion.Find the data set “HELPrct”. Construct a side by side boxplots (can be horizontal orvertical) to compare the effect of different race groups on the average number of drinksper day for the patients. Add title “XXXXXX – Side by side Boxplot to compare the effectof race groups vs on averagenumber of Drinks”.Based on the side by side boxplot; write a simple conclusion.
- The following statement tells the decomposition of the people survived a.k.a column Survived as of 0 (not survived) or 1 (survived). Create a similar statement using groupby function to further breakdown by Sex. In other words, add additional sex dimension. The desired output should be similar to the following: Sex Survided female 0 81 1 233 male 0 468 1 109Name: Survived, dtype: int64 In [16]:Consider a school system in which data is recorded about the movie industry. The data requirements are summarized as follows: Each subject is owned by a department and each department ismanaged by a teacher, called Head of Department (HOD). Eachdepartment has a name and a unique code number. Each subjecthas a name, unique number and credit value. All teachers have aunique ID. A HOD cannot manage more than one department at atime. Also, a HOD does not teach subjects. A subject cannot betaught by more than one teacher. Finally, the system needs toknow each student’s contact details, including, full name, studentID, address, age and gender, as well as date of enrolment in thesubject. Design an Entity-Relationship diagram for the movie databaseYour department is interested in keeping track of crucial information. Create a data structure that will keep important information for your department. Of course, the list of majors should be sorted by last name (and then by first, if there are multiple students with the same last name).Your department is interested in keeping track of crucial information. Create a data structure that will keep important information for your department. Of course, the list of majors should be sorted by last name (and then by first, if there are multiple students with the same last name).