Skip to content
@Audio-WestlakeU

Audio-WestlakeU

Audio Signal and Information Processing Lab at Westlake University

Pinned Loading

  1. FullSubNet FullSubNet Public

    PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

    Python 558 158

  2. NBSS NBSS Public

    The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation

    Python 240 27

  3. McNet McNet Public

    The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023

    Python 110 13

  4. audiossl audiossl Public

    A library built for easier audio self-supervised training, downstream tasks evaluation

    Python 110 10

  5. FN-SSL FN-SSL Public

    The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]

    Python 97 10

  6. ATST-SED ATST-SED Public

    This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".

    Jupyter Notebook 107 13

Repositories

Showing 10 of 28 repositories
  • UMA-ASR Public

    This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).

    Audio-WestlakeU/UMA-ASR’s past year of commit activity
    Shell 20 5 1 0 Updated Dec 17, 2024
  • RealMAN Public

    A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurIPS 2024]

    Audio-WestlakeU/RealMAN’s past year of commit activity
    Python 105 11 4 0 Updated Dec 11, 2024
  • FN-SSL Public

    The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]

    Audio-WestlakeU/FN-SSL’s past year of commit activity
    Python 97 10 2 0 Updated Dec 9, 2024
  • FS-EEND Public

    The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024] and "LS-EEND: long-form streaming end-to-end neural diarization with online attractor extraction"

    Audio-WestlakeU/FS-EEND’s past year of commit activity
    Python 98 MIT 4 2 0 Updated Dec 2, 2024
  • NBSS Public

    The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation

    Audio-WestlakeU/NBSS’s past year of commit activity
    Python 240 MIT 27 19 0 Updated Nov 4, 2024
  • ATST-SED Public

    This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".

    Audio-WestlakeU/ATST-SED’s past year of commit activity
    Jupyter Notebook 107 MIT 13 2 0 Updated Oct 15, 2024
  • SAR-SSL Public

    A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Multi-Channel Conformer” [TASLP 2024]

    Audio-WestlakeU/SAR-SSL’s past year of commit activity
    Python 32 MIT 1 2 0 Updated Oct 11, 2024
  • ATST-RCT Public

    ATST-RCT model for DCASE 2022 task4.

    Audio-WestlakeU/ATST-RCT’s past year of commit activity
    Python 2 0 0 0 Updated Sep 19, 2024
  • audiossl Public

    A library built for easier audio self-supervised training, downstream tasks evaluation

    Audio-WestlakeU/audiossl’s past year of commit activity
    Python 110 10 4 1 Updated Aug 27, 2024
  • RVAE-EM Public

    Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]

    Audio-WestlakeU/RVAE-EM’s past year of commit activity
    Python 42 MIT 4 0 0 Updated Mar 20, 2024