Skip to content
@amsterdata

amsterdata

Data management for ML at the University of Amsterdam (as part of INDELab)

Research on data management for machine learning at the University of Amsterdam (as part of INDELab). Checkout our recent and ongoing projects:

  • mlinspect allows the instrumentation and inspection of native ML pipelines written in Python with pandas, sklearn and keras.

  • serenade is a low-latency session-based recommender system deployed in production at bol.com, the largest e-commerce platform in the Netherlands.

  • caboose contains implementations of state-of-the-art kNN models for next-basket recommendation which can "unlearn" user data in milliseconds.

  • jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptions (e.g., missing values, broken character encodings) on the prediction quality of their ML models.

Pinned Loading

  1. serenade serenade Public

    Forked from bolcom/serenade

    Serenade is a low-latency session-based recommender system deployed in production at bol.com

    Rust

  2. caboose caboose Public

    Forked from schelterlabs/caboose

    Python 1 1

  3. mlinspect mlinspect Public

    Forked from stefan-grafberger/mlinspect

    mlinspect allows the instrumentation and inspection of native ML pipelines written in Python with pandas, sklearn and keras.

    Jupyter Notebook

  4. jenga jenga Public

    Forked from schelterlabs/jenga

    Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptions (e.g., missing values, broken character encodings) on the…

    Jupyter Notebook

  5. arguseyes-demo arguseyes-demo Public

    Jupyter Notebook 2

Repositories

Showing 10 of 16 repositories
  • amsterdata/schemapile’s past year of commit activity
    Jupyter Notebook 4 1 0 0 Updated Apr 7, 2024
  • amsterdata/snapcase-demo’s past year of commit activity
    JavaScript 0 0 0 0 Updated Mar 26, 2024
  • demodq Public
    amsterdata/demodq’s past year of commit activity
    Jupyter Notebook 3 0 0 0 Updated Jan 9, 2024
  • retrieval_importance Public

    Implementation and experimentation code for the paper on Improving Retrieval-Augmented Large Language Models via Data Importance Learning.

    amsterdata/retrieval_importance’s past year of commit activity
    Rust 0 0 0 0 Updated Nov 25, 2023
  • amsterdata/master-theses-2023’s past year of commit activity
    0 0 0 0 Updated Oct 31, 2023
  • amsterdata/ragbooster’s past year of commit activity
    Rust 30 Apache-2.0 2 0 0 Updated Jul 31, 2023
  • seqrec Public
    amsterdata/seqrec’s past year of commit activity
    Rust 1 0 0 0 Updated Jun 13, 2023
  • .github Public
    amsterdata/.github’s past year of commit activity
    0 0 0 0 Updated Apr 22, 2023
  • amsterdata/caboose’s past year of commit activity
    Python 1 GPL-3.0 2 0 0 Updated Apr 22, 2023
  • freamon Public

    Freamon enables data scientists to automatically reconstruct and query the intermediate data from ML pipelines to reduce the level of expertise and manual effort required to debug this data.

    amsterdata/freamon’s past year of commit activity
    Jupyter Notebook 0 GPL-3.0 0 0 0 Updated Jan 25, 2023

Top languages

Loading…

Most used topics

Loading…