How to Solve the Problem of Imbalanced Datasets: Meet Djinn by Tonic

  • It’s so difficult to build an unbiased model to classify a rare event since machine learning algorithms will learn to classify the majority class so much better. This blog post shows how a new AI-powered data synthesizer tool, Djinn, can upsample synthetic data even better than SMOTE and SMOTE-NC. Using neural network generative models, it has a powerful ability to learn and mimic real data super quickly and integrates seamlessly with Jupyter Notebook.

    Full disclosure: I recently joined Tonic.ai as their first Data Science Evangelist, but I also can say that I genuinely think this product is amazing and a game-changer for data scientists.

    Happy to connect and chat all things data synthesis!