Data cleaning is one of the most time-consuming parts of working with spreadsheets, and one of the best uses of Copilot in Excel — because it can scan your whole dataset at once and surface problems that would take hours to find manually. Here's how to actually use it.

Ad placeholder — in-article responsive unit

Setup: format as a table first

As with all Copilot features in Excel, data cleaning works best on a properly formatted Table with clear column headers. Select your data and go to Insert > Table if you haven't already. Make sure the file is saved to OneDrive or SharePoint with AutoSave on.

Finding errors and inconsistencies

Start by asking Copilot for a general health check on your data:

Copilot will scan the data and report what it finds — typos, inconsistent capitalization, mixed formats, formula errors, and similar issues. It often spots things you wouldn't notice scrolling through manually.

Finding and handling missing values

Tip: Copilot can highlight cells matching a condition directly in your sheet. Ask it to "highlight rows where [column] is empty" and it applies conditional formatting automatically — making missing values visible at a glance without you needing to write a formula.

Fixing inconsistent formatting

Inconsistent date formats, mixed capitalisation, and extra spaces are common data quality problems that break lookups and filters. Copilot can fix these:

Finding duplicates

Copilot identifies duplicates and can highlight them for you to review. It won't automatically delete rows — it flags them so you can decide which to keep, which is the safer approach with real data.

Adding a new column to flag data quality issues

For ongoing data quality monitoring, ask Copilot to add a helper column:

Splitting and combining columns

Always verify before applying to the full dataset

Copilot's data cleaning suggestions show as previews before being applied. Review what it's about to do — especially for bulk find-and-replace operations — on a sample of rows before accepting. Use Ctrl+Z to undo anything that doesn't look right. For large or business-critical datasets, it's worth keeping a backup copy before running bulk cleaning operations.

The bottom line

Data cleaning is where Copilot saves the most time relative to doing it manually — it can scan thousands of rows instantly for patterns that would take hours to catch by hand. The combination of asking for a health check first, then drilling into specific issues, gets clean data faster than any other method short of writing complex validation formulas yourself.