Create subsection ## Datasets.
In section Datasets read metainformation for dataset okcupid-stem as follows:
1 2 3 4 5 6 7 8 9 | |
and get data as a dataframe (last week we selected numpy.array).
1 2 | |
target stores the target column, job. Using df[target] etc. allows me to write more generic later on._ to indicate information that is discarded. Here the information is just None because I did not ask to return the target column separately.Check import using the usual shape and head code:
1 2 | |