Skip to content

Pandas

import pandas as pd

Encoding Dummy Variables

music_df = pd.read_csv('music.csv)
music_dummies = pd.get_dummies(music_df['genre'], drop_first=True)

Missing Values

df.isna().sum().sort_values() # (1)
df.dropna()
  1. FINDING: .isna() returns an array of true/false, which when .sum() (summed) gives us the total number of true values. And finally, we sort everything ascending.