
Process Data from Dirty to Clean
Here in Process Data from Dirty to Clean you’ll learn how to check and clean your data using spreadsheets and SQL as well as how to verify and report your data cleaning results.
About this Course
This is the fourth course Process Data from Dirty to Clean in the Google Data Analytics Certificate. These courses will equip you with the skills needed to apply to introductory-level data analyst jobs. In this course, you’ll continue to build your understanding of data analytics and the concepts and tools that data analysts use in their work. Current Google data analysts will continue to instruct and provide you with hands-on ways to accomplish common data analyst tasks with the best tools and resources.
Learners who complete this certificate program will be equipped to apply for introductory-level jobs as data analysts. No previous experience is necessary.
By the end of this course, you will be able to do the following:
- Learn how to check for data integrity.
- Discover data cleaning techniques using spreadsheets.
- Develop basic SQL queries for use on databases in Process Data from Dirty to Clean
- Apply basic SQL functions for cleaning and transforming data.
- Gain an understanding of how to verify the results of cleaning data.
- Explore the elements and importance of data cleaning reports.
What You Will Learn
- Define data integrity with reference to types of integrity and risk to data integrity
- Apply basic SQL functions for use in cleaning string variables in a database
- Develop basic SQL queries for use on databases in Process Data from Dirty to Clean
- Describe the process involved in verifying the results of cleaning data
Skills You Will Gain
- Spreadsheet
- Data Cleansing in Process Data from Dirty to Clean
- Sample Size Determination
- SQL in Process Data from Dirty to Clean
- Data Integrity
Syllabus
WEEK 1
5 hours to complete
Process Data from Dirty to Clean: The importance of integrity
As you start thinking about how to prepare your data for exploration, this part of the course will highlight why data integrity is so essential to successful decision-making. You’ll learn about how data is generated and the techniques analysts use to decide what data to collect for analysis. And you’ll discover structured and unstructured data, data types, and data formats.
8 videos (Total 33 min), 9 readings, 6 quizzes
WEEK 2
6 hours to complete
Sparkling-clean data
Every data analyst wants clean data to work with when performing an analysis. In this part of the course, you’ll learn the difference between clean and dirty data. You’ll also explore data cleaning techniques using spreadsheets and other tools.
10 videos (Total 66 min), 5 readings, 6 quizzes
WEEK 3
6 hours to complete
Cleaning data with SQL
Knowing a variety of ways to clean data can make an analyst’s job much easier. In this part of the course, you’ll check out how to clean your data using SQL. You’ll explore queries and functions that you can use in SQL to clean and transform your data to get it ready for analysis.
9 videos (Total 49 min), 5 readings, 6 quizzes
WEEK 4
2 hours to complete
Verify and report on your cleaning results
Cleaning your data is an essential step in the data analysis process. Verifying and reporting your cleaning is a way to show that your data is ready for the next step. In this part of the course, you’ll find out the processes involved with verifying and reporting data cleaning as well as their benefits.
6 videos (Total 28 min), 4 readings, 4 quizzes