What is the difference between 'if the data is of good quality' and 'if the data is tidy'?

Question

What is the difference between 'if the data is of good quality' and 'if the data is tidy'?

user3508140

2021年2月2日 20:07

I'm doing Data Analyst nanodegree from Udacity. I'm confused between the difference even after going through the lecture a few times.

Topic data-analysis data-wrangling data-cleaning

Category Data Science

Karthik Sunil · Accepted Answer · 2020年5月5日 10:15

Let me try to explain by intuitively. First let me take the easy one.

Data being tidy

As per definition Tidy means Arranged in Order, Neat, Uncluttered. All of these explain the physical aspects of the data representation. For example, data arranged in proper columns, with good headings, with relevance etc. You can think of this being syntactic in nature

Good Quality Data

This refers to actual data elements in the file. Like not having too many blank (missing) values. Every column having proper data types. Data without special (unreadable) characters. Properly encoded. Not having too much unstructured data. You can think this as the semantics of the data.

Hope this helps

What is the difference between 'if the data is of good quality' and 'if the data is tidy'?

About