|
Project title: |
2 page summary, Do Data Cleaning, Hypotheses, recommendation
|
Posted by: |
External project from PeoplePerHour
|
Started: |
09-Apr-2025 12:30 GMT |
Description: |
These are theminimum expectatiosn.
1. Data Cleaning Plan ✔️ Observations: No missing values (Non-Null Count = 3397 for all columns)
Some inconsistencies in column names and types:
number of time driver fail test should be numeric, but it's of type object.
Column names have extra spaces and inconsistent formatting (e.g. No_hazard_reported(positve) has a typo).
✅ Cleaning Actions: Strip whitespace from column names
Fix typos and standardize column names
Convert number of time driver fail test and Size of box to appropriate types (likely int or float)
Let me clean the data first, then we’ll form hypotheses (step 2), test them (step 3), and suggest control measures (step 4).
After cleaning:
✅ Column names are standardized.
⚠️ 717 values in Times_driver_failed_test and 69 in Size_of_box… |
Project ID:
|
3429136 |
Project category: |
|
Project budget: |
|
|
|