As data clean up started it became apparent that all data files had not been updated either with initial measurements, or the latests year's data had not been appended to the data set, therefore further acquision of data files was needed.
Data files per company arrived exclusively in excel spread sheets with large headers and multiple tabbed sheets per file all of which needed to be separated. See example below of three different data sets.
All
data sets were reformatted with common units of measure for each
attribute and with column heading that were only 8 characters long to
that master flat file could be used in multiple software packages for
analysis. Final version of files was saved with CSV format. See below
for example of final version.