Data Cleaning

Data Cleaning#

  • Rename columns for clarity and drop any columns that won’t be useful for analysis.

  • Format/convert data: e.g., ensure numbers are integers/floats and dates are in a consistent format.

  • Handle missing data: fill in, impute, or drop missing values if they’re below a chosen threshold (e.g., less than 3% missing).

  • Check for duplicated rows or inconsistent formats and fix them as needed.