ckyang1124 - Overview

Skip to content

Navigation Menu

Sign in

Appearance settings

View ckyang1124's full-sized avatar

Chih-Kai Yang ckyang1124

Block or report ckyang1124

Pinned Loading

  1. Official GitHub repository for paper "SAKURA: On the Multi-hop Reasoning of Large Audio-Language Models Based on Speech and Audio Information" (Interspeech 2025)

    Python 22 3

  2. Collection of works for evaluating (and analyzing) large audio-language models (LALMs)

    40 1

  3. This is the repository for the paper "Do Prompts Really Prompt? Exploring the Prompt Understanding Capability of Whisper"

    Python 2

  4. This is the official implementation of the work "Investigating Zero-Shot Generalizability on Mandarin-English Code-Switched ASR and Speech-to-text Translation of Recent Foundation Models with Self-…

    Python 1

  5. Official Repository for the ASRU 2025 paper "AudioLens: A Closer Look at Auditory Attribute Perception of Large Audio-Language Models"

    Python 6