AdrianBZG - Overview

View AdrianBZG's full-sized avatar

Adrián Bazaga AdrianBZG

Senior Researcher @ Microsoft. Foundational LLM Development. PhD, Machine Learning @ University of Cambridge. Ex: Amazon AGI, Microsoft Research

Block or report AdrianBZG

Pinned Loading

  1. Forked from amazon-science/TISER

    [ACL 2025] Learning to Reason Over Time: Timeline Self-Reflection for Improved Temporal Reasoning in Language Models

    1

  2. [EMNLP 2024] HyperBERT: Mixing Hypergraph-Aware Layers with Language Models for Node Classification on Text-Attributed Hypergraphs

    Python 23 1

  3. Tune efficiently any LLM model from HuggingFace using distributed training (multiple GPU) and DeepSpeed. Uses Ray AIR to orchestrate the training on multiple AWS GPU instances

    Python 60 6

  4. [ICLR 2024] Unsupervised Pretraining for Fact Verification by Language Model Distillation

    Python 5

  5. [ICML 2024] TabMDA: Tabular Manifold Data Augmentation for Any Classifier using Transformers with In-context Subsetting

    Python 9

  6. Multimodal Instruction Tuning for Llama 3

    Python 52 11