Ask a question from expert

Ask now

Data cleansing Workflow | Assignment

5 Pages973 Words180 Views
   

Added on  2021-06-15

Data cleansing Workflow | Assignment

   Added on 2021-06-15

BookmarkShareRelated Documents
Data Cleaning: Work FlowThe data from the three sheets were first collated into one and the data was processed to make the data useful for analysis.Source FilesWestdaleLibraryDataISPPA2Feb2018Data not Parsed Correctly. Dates were incorrectly entered in Column “ACCESSION NUMBER”. The data was removed from the “Accession Column” to Formatted the Column Using “Format Cell --> Date-“In Column “LANG” , all different spellings of language “Noongar” were filtered, using “Filter” tab ; and “Replace” Command was used to replace them with Noongar. Similarly, “Wiradjuri” was used to replace the different variations of the spelling.An Additional Column named “CLIENT” was created. Which contained the cocanated data from “CLIENT_Second_Name” and “CLIENT_First_Name:.The following Columns were Copy Pasted to the Respective colums in the Destination __DataTable 1 COLUMNS INTERCHANGEDSource SheetDestination SheetDateDATEThing NumberUSNLanguage groupLANGCOLOURM_NAMEMATERIALMATERIALFABRICFABRICCOSTPRICE (AUD)MAKERM_NAMECITYM_CITYSUBURBM_SUBURBZIP CODEM_PCNICKNAMEM_NAME
Data cleansing Workflow | Assignment_1
MarciaBradyDataISPPA2Feb2018Formatted the “DATE” Column Using “Format Cell --> Date-“Data was not parsed properly. The numeric characters were manually removed and patsted in the empty column to the left. The empty colum was names as “Price(AUD)In Column “Address” , the City and suburb name were together and not parsed. Another column was added, naed as “suburb” and suburb values from the address column were cut and paste manually. In Column “Language group”, only initials were mention. Hence, the initial “N” was replaced by “NOONGAR” Similarly, “Wiradjuri” was used to replace the initial W, usingthe Replace command. “Westd. Council” was replaced by “Westdale City Council”.In the column, “Made_for” proper names were added to replace common nouns.The following Columns were Copy Pasted to the Respective colums in the Destination __DataTable 2 COLUMNS INTERCHANGEDSource SheetDestination SheetDATEDATEPrice (AUD)PRICE (AUD)Thing NumberUSNLanguage groupLANGCOLOURCOLOURMATERIALMATERIALFABRICFABRICMAKERM_NAMECITYM_CITYSUBURBM_SUBURBNameS_NICKNAMEMADE_FORCLIENTDonorAuctionCatalogueDataISPPA2Feb2018Data from Columns CLIENT_First_Name and CLIENT_Second_Name were merged together.The following Columns were Copy Pasted to the Respective colums in the Destination __DataSource SheetDestination SheetDateDATE
Data cleansing Workflow | Assignment_2

End of preview

Want to access all the pages? Upload your documents or become a member.