Consider that what is taught about data (analysis) doesn't match up the reality
|
Expectations |
Reality |
|
What does the data look like? |
Static dataset/table/CSV that represents observations or an experiment |
Constantly changing (new records, changing schema) with dozens (if not hundreds) of datasets to work with that represent business operations |
YY |
How is it stored? |
Stored in a CSV |
Stored as objects (tables/views) in a database |
|
What languages and tools used to analyze it? |
Analyzed with R, Pandas, SPSS |
Languages: SQL Tools: git, the command line, dbt, Airflow, VSCode, etc etc. |
|
What do analysts do? |
Deliver “insights” |
So much! Version control, testing, modeling, requirements gathering, getting buy-in |
|
Metadata