LAS @ ETH Zurich

user_interactions Public
Aligning Language Models from User Interactions via Self-Distillation

lasgroup/user_interactions’s past year of commit activity

Python
12
Apache-2.0
3 1 0
Updated Mar 31, 2026
rewarduq Public
Code for "RewardUQ: A Unified Framework for Uncertainty-Aware Reward Models"

lasgroup/rewarduq’s past year of commit activity

Python
15
Apache-2.0
1 0 1
Updated Mar 31, 2026
lasgroup/rlhf’s past year of commit activity

JavaScript 0 0
0 0
Updated Mar 25, 2026
lasgroup/molmospaces’s past year of commit activity

Python 0
23 0 0
Updated Mar 20, 2026
lasgroup/ombrl’s past year of commit activity

Python
10 3 0 1
Updated Mar 13, 2026
lasgroup/ActiveUltraFeedback’s past year of commit activity

8
0
1 0
Updated Mar 11, 2026
safe-learning Public
A collection of algorithms and experiment tools for safe sim to real transfer in robotics.

lasgroup/safe-learning’s past year of commit activity

Python
23
MIT
7 0 0
Updated Feb 23, 2026
SDPO Public
Reinforcement Learning via Self-Distillation (SDPO)

lasgroup/SDPO’s past year of commit activity

Python
725
Apache-2.0
79 4 1
Updated Feb 18, 2026
lasgroup/swissai-openpi’s past year of commit activity

Python 0 Apache-2.0
5 0 0
Updated Jan 28, 2026
swissai-dsrl Public Forked from nakamotoo/dsrl_pi0
Official implementation for pi0 steering via DSRL, Steering Your Diffusion Policy with Latent Space Reinforcement Learning (CoRL 2025)

lasgroup/swissai-dsrl’s past year of commit activity

Python
1 31 0 0
Updated Jan 23, 2026