dataframe = pd.read_csv('https://raw.githubusercontent.com/Explore-AI/Public-Data/master/Data/regression_sprint/titanic_train_raw.csv') df = pd.read_csv('https://raw.githubusercontent.com/Explore-AI/Public-Data/master/Data/regression_sprint/titanic_test_raw.csv') Write a function that takes in as input a dataframe and a column name, and returns the mean for numerical columns and the mode for non-numerical columns. Function Specifications: The function should take two inputs: (df, column_name), where df is a pandas DataFrame, column_name is a str. If the column_name does not exist in df, raise a ValueError. Should return as output the mean if the specified column is numerical and return a list of the mode(s) otherwise. The mean should be rounded to 2 decimal places. If there is more than one mode for a given non-numerical column, the fuction should return a list of all modes.    def calc_mean_mode(df, column_name): # your code here return calc_mean_mode(df,'Age') Expected Outputs: calc_mean_mode(df, 'Age') == 29.7 calc_mean_mode(df, 'Embarked') == ['S']

Computer Networking: A Top-Down Approach (7th Edition)
7th Edition
ISBN:9780133594140
Author:James Kurose, Keith Ross
Publisher:James Kurose, Keith Ross
Chapter1: Computer Networks And The Internet
Section: Chapter Questions
Problem R1RQ: What is the difference between a host and an end system? List several different types of end...
icon
Related questions
Question

dataframe = pd.read_csv('https://raw.githubusercontent.com/Explore-AI/Public-Data/master/Data/regression_sprint/titanic_train_raw.csv')

df = pd.read_csv('https://raw.githubusercontent.com/Explore-AI/Public-Data/master/Data/regression_sprint/titanic_test_raw.csv')

Write a function that takes in as input a dataframe and a column name, and returns the mean for numerical columns and the mode for non-numerical columns. Function Specifications: The function should take two inputs: (df, column_name), where df is a pandas DataFrame, column_name is a str. If the column_name does not exist in df, raise a ValueError. Should return as output the mean if the specified column is numerical and return a list of the mode(s) otherwise. The mean should be rounded to 2 decimal places. If there is more than one mode for a given non-numerical column, the fuction should return a list of all modes. 

 

def calc_mean_mode(df, column_name): # your code here return

calc_mean_mode(df,'Age')

Expected Outputs:

calc_mean_mode(df, 'Age') == 29.7

calc_mean_mode(df, 'Embarked') == ['S']

Expert Solution
steps

Step by step

Solved in 2 steps

Blurred answer
Knowledge Booster
Arrays
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-engineering and related others by exploring similar questions and additional content below.
Recommended textbooks for you
Computer Networking: A Top-Down Approach (7th Edi…
Computer Networking: A Top-Down Approach (7th Edi…
Computer Engineering
ISBN:
9780133594140
Author:
James Kurose, Keith Ross
Publisher:
PEARSON
Computer Organization and Design MIPS Edition, Fi…
Computer Organization and Design MIPS Edition, Fi…
Computer Engineering
ISBN:
9780124077263
Author:
David A. Patterson, John L. Hennessy
Publisher:
Elsevier Science
Network+ Guide to Networks (MindTap Course List)
Network+ Guide to Networks (MindTap Course List)
Computer Engineering
ISBN:
9781337569330
Author:
Jill West, Tamara Dean, Jean Andrews
Publisher:
Cengage Learning
Concepts of Database Management
Concepts of Database Management
Computer Engineering
ISBN:
9781337093422
Author:
Joy L. Starks, Philip J. Pratt, Mary Z. Last
Publisher:
Cengage Learning
Prelude to Programming
Prelude to Programming
Computer Engineering
ISBN:
9780133750423
Author:
VENIT, Stewart
Publisher:
Pearson Education
Sc Business Data Communications and Networking, T…
Sc Business Data Communications and Networking, T…
Computer Engineering
ISBN:
9781119368830
Author:
FITZGERALD
Publisher:
WILEY