Jan van der Vegt: A walk through the isolation forest | PyData Amsterdam 2019

Поділитися
Вставка
  • Опубліковано 8 лип 2024
  • Anomaly detection can help with fraud detection, predictive maintenance and cyber security cases amongst others. Next to this it can help on a meta level for other machine learning projects by detecting outliers during training or inference. One of the approaches to anomaly detection is called Isolation Forests. In this talk we will first go over the original idea of the isolation forest paper and a slightly more sophisticated extension called Entropy Isolation Forests. Isolation forests have a number of appealing properties with regards to intuition, parallelism and performance but the basic formulation is missing native support for categorical features and missing values.
    By looking at the mathematical formulation and the reasoning behind it we will extend this approach to natively allow categorical and missing values. The goal of the talk is twofold; on the one hand an in-depth introduction to the class of Isolation Forests and on the other hand a look at the process of extending existing methods to suit the needs of your project. The concepts in this talk are accompanied with a GitHub implementation.
    www.pydata.org
    PyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R.
    PyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases. 00:00 Welcome!
    00:10 Help us add time stamps or captions to this video! See the description for details.
    Want to help add timestamps to our UA-cam videos to help with discoverability? Find out more here: github.com/numfocus/UA-camVi...
  • Наука та технологія

КОМЕНТАРІ • 5

  • @Pitmaster1233
    @Pitmaster1233 Рік тому +3

    Very simple and intuitive explanation.

  • @jeff0gucha834
    @jeff0gucha834 Рік тому +1

    Great explanation

  • @surenmailme
    @surenmailme 6 місяців тому

    Nice presentation, but if you could have shared the python notebook for a sample data that will help understand more

  • @kamel7897
    @kamel7897 2 роки тому +6

    This comment was isolated

  • @posthocprior
    @posthocprior Рік тому +3

    A confusing talk.