✨Basic Data Cleaning Functions in Excel (For Beginners)
✨Basic Data Cleaning Functions in Excel (For Beginners)
Hello friends π Welcome back to Brundha_Sri Learning Hub!
Today, let’s start our Data Cleaning journey in Excel – the most important step for any Data Analyst.
π Did you know? 80% of analysis time goes into cleaning messy data before you can create dashboards or insights. That’s why Excel provides some powerful but simple functions to clean your dataset.
In this blog, we’ll learn 8 basic functions with real-life examples.
πΉ 1. TRIM() – Remove Extra Spaces
π Meaning: Deletes all extra spaces in text (except single space between words).
Syntax:
=TRIM(text)
Example:
Raw Data → " John Smith "
Formula → =TRIM(A2)
Result → John Smith
✅
πΉ 2. CLEAN() – Remove Hidden Characters
π Meaning: Removes non-printable characters (often appear when you copy-paste from websites, PDFs, or systems).
Syntax:
=CLEAN(text)
Example:
Raw Data → "Michael…Johnson"
Formula → =CLEAN(A2)
Result → MichaelJohnson
✅
πΉ 3. PROPER() – Capitalize First Letters
π Meaning: Converts text so that each word starts with a capital letter.
Syntax:
=PROPER(text)
Example:
Raw Data → emma brown
Formula → =PROPER(A2)
Result → Emma Brown
✅
πΉ 4. UPPER() & LOWER() – Change Text Case
π Meaning: Converts text to all uppercase or lowercase.
Syntax:
=UPPER(text)
=LOWER(text)
Example:
Raw Data → JohnSmith@GMAIL.com
-
=UPPER(A2)
→JOHNSMITH@GMAIL.COM
-
=LOWER(A2)
→johnsmith@gmail.com
πΉ 5. SUBSTITUTE() – Replace Text or Symbols
π Meaning: Replaces specific text/character inside a string.
Syntax:
=SUBSTITUTE(text, old_text, new_text)
Example:
Raw Data → 987-654-3210
Formula → =SUBSTITUTE(A2,"-","")
Result → 9876543210
✅
πΉ 6. REPLACE() – Replace by Position
π Meaning: Replaces part of a text string based on position & length.
Syntax:
=REPLACE(old_text, start_num, num_chars, new_text)
Example:
Raw Data → 9876543210
Formula → =REPLACE(A2,4,3,"***")
Result → 987***3210
✅
πΉ 7. TEXT() – Format Numbers/Dates
π Meaning: Converts a number or date into a specific format.
Syntax:
=TEXT(value, format_text)
Examples:
-
=TEXT(25.0,"00")
→25
-
=TEXT(D2,"MMMM-YYYY")
→January-2025
π Example Dataset
Before Cleaning (Messy Data)
Name (Raw) | Phone (Raw) | Email (Raw) | Date (Raw) |
---|---|---|---|
john smith | 987-654-3210 | JohnSmith@GMAIL.com | 01/01/25 |
EMMA BROWN | (987)6543210 | emma.brown@gmail.COM | 2025-01-01 |
rahul sharma | 987.654.3210 | rahulsharma@ outlook.com | 1-Jan-25 |
After Cleaning (Using Functions)
Name (Cleaned) | Phone (Cleaned) | Email (Cleaned) | Date (Formatted) |
---|---|---|---|
John Smith | 9876543210 | johnsmith@gmail.com | January-2025 |
Emma Brown | 9876543210 | emma.brown@gmail.com | January-2025 |
Rahul Sharma | 9876543210 | rahulsharma@outlook.com | January-2025 |
π€ Wrap-Up
“So friends, these basic Excel functions – TRIM, CLEAN, PROPER, UPPER/LOWER, SUBSTITUTE, REPLACE, and TEXT – are your first toolkit to transform raw, messy data into neat, professional data.
π Try them on your names, phone numbers, and emails today. You’ll be amazed how quickly your dataset becomes analysis-ready!” π
π In next, we’ll cover Intermediate Data Cleaning Functions like VALUE
, LEN
, LEFT
, RIGHT
, MID
, SEARCH
, etc. Stay tuned!
Comments
Post a Comment