Documentation For Quality Data Preparation ( Federal Aviation Administration ) Data Downloads, N.d Essay

1166 Words5 Pages
Introduction This assignment utilizes one of the file sets in the FAA database, Aircraft Series, to demonstrate a proposed process for quality data preparation (Federal Aviation Administration - Data Downloads, n.d.). This documentation includes a process overview, a description of the data files, two attempts at cleaning the data, and validation and standardization of the resulting content. A short discussion of data integrity, data validation, data governance, and documentation follows including recommendations to overcome the challenges encountered. Process Overview Per the advice of Robin Hunt’s video tutorial, the exercise began with a notional workflow diagram (2015). After two attempts, the succeeding process consolidated into a series of four phases, each described in the following sections. The complete diagram appears below. Figure 1. Assignment Workflow Diagram Select Data The FAA database provided the source data. The data cleaning process required downloading two text formatted files in the subject area of Accident/Incident data: the source data file and a document describing the layout of the source data file. The website download process appeared similar to the figure below: Figure 2. FAA Data Download Web Page On first inspection, the downloaded source data file appeared to only contain text characters similar to as shown below. Figure 3. Aircraft Series Source Data File The data contained 4959 rows (as measured by Notepad ++) and

More about Documentation For Quality Data Preparation ( Federal Aviation Administration ) Data Downloads, N.d Essay

Open Document