How does Pandas store timezone information?
Timestamp columns are stored internally in UTC, regardless of whether they have a timezone or not. This is stated in the docs: https://pandas.pydata.org/docs/user_guide/timeseries.html#working-with-time-zones
So how is timezone information stored internally? There must be some sort of tag which makes that column contain pd.Timestamp entries with tz=America/New_York or whatever the timezone is.
Context:
I am writing data to disk in a way that is not specific to Python, e.g I am not using pickle
. When I retrieve the data I want the timezone information to remain.
Category Data Science