Skip to content
Change the repository type filter

All

    Repositories list

    • HTML
      31800Updated Apr 22, 2026Apr 22, 2026
    • PAVAS

      Public
      [CVPR 2026 (Oral)] PAVAS: Physics-Aware Video-to-Audio Synthesis
      0000Updated Apr 19, 2026Apr 19, 2026
    • Woosh

      Public
      Public release of the Sound Effect Foundation model by Sony AI.
      Python
      Apache License 2.0
      1019120Updated Apr 15, 2026Apr 15, 2026
    • LLM2Fx

      Public
      Large Language Models for Music Post Production
      Python
      43910Updated Mar 31, 2026Mar 31, 2026
    • [TASLP] Open-Vocabulary Sound Event Localization and Detection with Joint Learning of CLAP Embedding and Activity-Coupled Cartesian DOA Vector
      Python
      MIT License
      1800Updated Mar 25, 2026Mar 25, 2026
    • VibeToken

      Public
      [CVPR 2026] VibeToken: Scaling 1D Image Tokenizers and Autoregressive Models for Dynamic Resolution Generations
      Python
      MIT License
      0400Updated Feb 25, 2026Feb 25, 2026
    • Python
      Other
      1300Updated Feb 12, 2026Feb 12, 2026
    • SAVGBench

      Public
      SAVGBench: Benchmarking Spatially Aligned Audio-Video Generation
      Python
      2500Updated Feb 4, 2026Feb 4, 2026
    • Official Repository for "Towards blind data cleaning: A case study in music source separation"
      0000Updated Jan 26, 2026Jan 26, 2026
    • MEGAMI

      Public
      Accompanying repository for the paper "Automatic Music Mixing Using a Generative Model of Effect Embeddings"
      Python
      Other
      33400Updated Jan 18, 2026Jan 18, 2026
    • evaluation toolkit for video-to-audio generation on SoundReactor
      Python
      MIT License
      0300Updated Dec 22, 2025Dec 22, 2025
    • Python
      MIT License
      0100Updated Dec 4, 2025Dec 4, 2025
    • Continuous waveform-based VAEs for SoundReactor
      Python
      MIT License
      0200Updated Nov 26, 2025Nov 26, 2025
    • Jupyter Notebook
      Apache License 2.0
      0700Updated Nov 5, 2025Nov 5, 2025
    • CCStereo

      Public
      [ACMMM 2025] CCStereo: Audio-Visual Contextual and Contrastive Learning for Binaural Audio Generation
      Python
      MIT License
      0800Updated Nov 4, 2025Nov 4, 2025
    • Model evaluation harness for vision model evaluation on the Fair human-centric image dataset for ethical AI benchmarking (FHIBE) dataset developed by the Sony A…
      Python
      Apache License 2.0
      0300Updated Nov 3, 2025Nov 3, 2025
    • diffvox

      Public
      Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"
      Jupyter Notebook
      MIT License
      43900Updated Oct 28, 2025Oct 28, 2025
    • stella

      Public
      StelLA: Subspace Learning in Low-rank Adaptation using Stiefel Manifold (NeurIPS 2025 Spotlight)
      Python
      Apache License 2.0
      41600Updated Oct 20, 2025Oct 20, 2025
    • Hold necessary code for our organized denoising challenge associated to ICCV 2025
      JavaScript
      MIT License
      31300Updated Oct 15, 2025Oct 15, 2025
    • [ICCV 2025] Official implementation of the paper "Beyond RGB: Adaptive Parallel Processing for RAW Object Detection"
      Python
      MIT License
      22300Updated Oct 8, 2025Oct 8, 2025
    • Python
      MIT License
      1800Updated Oct 6, 2025Oct 6, 2025
    • This repo contains a python package for extracting skin tone from images.
      Python
      MIT License
      1000Updated Sep 29, 2025Sep 29, 2025
    • ReRAW

      Public
      Implementation of ReRAW: RGB-to-RAW Image Reconstruction via Stratified Sampling for Efficient Object Detection on the Edge
      Python
      MIT License
      33010Updated Sep 24, 2025Sep 24, 2025
    • "Fx-Encoder++: Extracting Instrument-wise Audio Effect Representations from Mixtures"
      Python
      Other
      15000Updated Aug 23, 2025Aug 23, 2025
    • IISA

      Public
      [ICCV 2025] - Image Intrinsic Scale Assessment: Bridging the Gap Between Quality and Resolution
      Python
      11620Updated Aug 16, 2025Aug 16, 2025
    • raw_bench

      Public
      RAW-Bench: the Robust Audio Watermarking Benchmark, a comprehensive real-world assessment of audio watermarking algorithms
      Python
      MIT License
      3900Updated Aug 5, 2025Aug 5, 2025
    • Unified framework for post-hoc explainability in Knowledge Graph Completion that standardizes explanation generation and evaluation to improve reproducibility a…
      Jupyter Notebook
      MIT License
      0100Updated Jul 22, 2025Jul 22, 2025
    • Data generator for stereo sound event localization and detection task of DCASE 2025 challenge
      Python
      MIT License
      01500Updated Jul 17, 2025Jul 17, 2025
    • Implementation of the paper "ITO-Master: Inference-Time Optimization for Audio Effects Modeling of Music Mastering Processors"
      Python
      Other
      02210Updated Jul 3, 2025Jul 3, 2025
    • Noise Modeling in One Hour: Minimizing Preparation Efforts for Self-supervised Low-Light RAW Image Denoising
      Python
      MIT License
      67600Updated Jun 18, 2025Jun 18, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.