Traffic Violations Dataset contains around 65k+ traffic-related violation records. The attributes are Date of violation, Time of violation, Country name, Gender of violators (Male-M, Female-F), Age of violators, Race of violators, Category of violation, Search conducted, Result of violation, Arrest information, Detained time taken by the violators to stop and Involvement in drugs etc. You can download the dataset traffic_violaions.csv from the following link: https://www.kaggle.com/shubamsumbria/traffic-violations-dataset Write a summary of your understanding of the purpose and contents of this dataset and your assessment of the quality of the data. To do this, you must develop code to explore the data programmatically in a notebook and provide it as part of your answer. Write a summary (~ 300 words) in word processing document which includes the following: The contents of the above dataset with detailed description The quality of the data with respect to validity, accuracy, completeness, consistency and uniformity Estimate the amount of dirtiness of the data of each type and discuss its potential impact of the goal of the analysis

Computer Networking: A Top-Down Approach (7th Edition)
7th Edition
ISBN:9780133594140
Author:James Kurose, Keith Ross
Publisher:James Kurose, Keith Ross
Chapter1: Computer Networks And The Internet
Section: Chapter Questions
Problem R1RQ: What is the difference between a host and an end system? List several different types of end...
icon
Related questions
Question

Traffic Violations Dataset contains around 65k+ traffic-related violation records. The attributes are Date of violation, Time of violation, Country name, Gender of violators (Male-M, Female-F), Age of violators, Race of violators, Category of violation, Search conducted, Result of violation, Arrest information, Detained time taken by the violators to stop and Involvement in drugs etc.

You can download the dataset traffic_violaions.csv from the following link:

https://www.kaggle.com/shubamsumbria/traffic-violations-dataset  

Write a summary of your understanding of the purpose and contents of this dataset and your assessment of the quality of the data. To do this, you must develop code to explore the data programmatically in a notebook and provide it as part of your answer.  

Write a summary (~ 300 words) in word processing document which includes the following:

  • The contents of the above dataset with detailed description
  • The quality of the data with respect to validity, accuracy, completeness, consistency and uniformity
  • Estimate the amount of dirtiness of the data of each type and discuss its potential impact of the goal of the analysis
Expert Solution
steps

Step by step

Solved in 3 steps with 15 images

Blurred answer
Recommended textbooks for you
Computer Networking: A Top-Down Approach (7th Edi…
Computer Networking: A Top-Down Approach (7th Edi…
Computer Engineering
ISBN:
9780133594140
Author:
James Kurose, Keith Ross
Publisher:
PEARSON
Computer Organization and Design MIPS Edition, Fi…
Computer Organization and Design MIPS Edition, Fi…
Computer Engineering
ISBN:
9780124077263
Author:
David A. Patterson, John L. Hennessy
Publisher:
Elsevier Science
Network+ Guide to Networks (MindTap Course List)
Network+ Guide to Networks (MindTap Course List)
Computer Engineering
ISBN:
9781337569330
Author:
Jill West, Tamara Dean, Jean Andrews
Publisher:
Cengage Learning
Concepts of Database Management
Concepts of Database Management
Computer Engineering
ISBN:
9781337093422
Author:
Joy L. Starks, Philip J. Pratt, Mary Z. Last
Publisher:
Cengage Learning
Prelude to Programming
Prelude to Programming
Computer Engineering
ISBN:
9780133750423
Author:
VENIT, Stewart
Publisher:
Pearson Education
Sc Business Data Communications and Networking, T…
Sc Business Data Communications and Networking, T…
Computer Engineering
ISBN:
9781119368830
Author:
FITZGERALD
Publisher:
WILEY