What is the difference between 'if the data is of good quality' and 'if the data is tidy'?

I'm doing Data Analyst nanodegree from Udacity. I'm confused between the difference even after going through the lecture a few times.

Topic data-analysis data-wrangling data-cleaning

Category Data Science


Let me try to explain by intuitively. First let me take the easy one.


Data being tidy

As per definition Tidy means Arranged in Order, Neat, Uncluttered. All of these explain the physical aspects of the data representation. For example, data arranged in proper columns, with good headings, with relevance etc. You can think of this being syntactic in nature


Good Quality Data

This refers to actual data elements in the file. Like not having too many blank (missing) values. Every column having proper data types. Data without special (unreadable) characters. Properly encoded. Not having too much unstructured data. You can think this as the semantics of the data.

Hope this helps

About

Geeks Mental is a community that publishes articles and tutorials about Web, Android, Data Science, new techniques and Linux security.