This Data Science laboratorial course was designed independently of
the data science tool to use, and consider the different tasks in
the KDD process. However, given the differences on the methods used
to process different kinds of data, we instantiated the process into
two distinct modules:
- the first to deal with multidimensional data (tabular format) usually collected from relational databases and data warehouses;
- and the second to process uni and multi-variate time series.
-
auxiliary files
including the
dslabs_functions.py
that contains all the auxiliary functions implemented in the following pages - and data files used in the examples.