Data cleaning

A continuous process that requires corrective actions throughout the data lifecycle.

Also, the process of detecting and correcting corrupt or inaccurate records from a dataset. Data cleaning involves identifying, replacing, modifying, or deleting incomplete, incorrect, inaccurate, inconsistent, irrelevant, and improperly formatted, data. Typically, the process involves updating, correcting, standardizing, and de-duplicating records to create a single view of the data, even if they are stored in multiple disparate systems.