LAS @ ETH Zurich

  • user_interactions Public

    Aligning Language Models from User Interactions via Self-Distillation

    lasgroup/user_interactions’s past year of commit activity

    Python

    12

    Apache-2.0

    3 1 0

    Updated Mar 31, 2026

  • rewarduq Public

    Code for "RewardUQ: A Unified Framework for Uncertainty-Aware Reward Models"

    lasgroup/rewarduq’s past year of commit activity

    Python

    15

    Apache-2.0

    1 0 1

    Updated Mar 31, 2026

  • lasgroup/rlhf’s past year of commit activity

    JavaScript 0 0

    0 0

    Updated Mar 25, 2026

  • lasgroup/molmospaces’s past year of commit activity

    Python 0

    23 0 0

    Updated Mar 20, 2026

  • lasgroup/ombrl’s past year of commit activity

    Python

    10 3 0 1

    Updated Mar 13, 2026

  • lasgroup/ActiveUltraFeedback’s past year of commit activity

    8

    0

    1 0

    Updated Mar 11, 2026

  • safe-learning Public

    A collection of algorithms and experiment tools for safe sim to real transfer in robotics.

    lasgroup/safe-learning’s past year of commit activity

    Python

    23

    MIT

    7 0 0

    Updated Feb 23, 2026

  • SDPO Public

    Reinforcement Learning via Self-Distillation (SDPO)

    lasgroup/SDPO’s past year of commit activity

    Python

    725

    Apache-2.0

    79 4 1

    Updated Feb 18, 2026

  • lasgroup/swissai-openpi’s past year of commit activity

    Python 0 Apache-2.0

    5 0 0

    Updated Jan 28, 2026

  • swissai-dsrl Public Forked from nakamotoo/dsrl_pi0

    Official implementation for pi0 steering via DSRL, Steering Your Diffusion Policy with Latent Space Reinforcement Learning (CoRL 2025)

    lasgroup/swissai-dsrl’s past year of commit activity

    Python

    1 31 0 0

    Updated Jan 23, 2026