Lecture 2
Hands-on data exploration and feature engineering using real-world datasets and Orange data mining
This work is licensed under CC BY-NC-SA 4.0
© Way-Up 2025
from Orange.data.pandas_compat import table_from_frame, table_to_frame
df = table_to_frame(in_data)
df = df.drop_duplicates()
out_data = table_from_frame(df)
3 main strategies exist if you cannot recover missing values
Year is to be removed from event data prevent overfitting, actually no new value with a past year (for new events)
Example with Orangeappend the following dataset daily weather in california since 1998 to 2020
Implementation with Orange