Publications

(2024). Vision Language Models Are Few-Shot Audio Spectrogram Classifiers. In NeurIPS Audio Imagination Workshop 2024.

arXiv PDF BibTeX

(2024). Local Deployment of Large-Scale Music AI Models on Commodity Hardware. In ISMIR LBD 2024.

arXiv PDF BibTeX 🕹ī¸ Demo

(2024). Just Label the Repeats for In-The-Wild Audio-to-Score Alignment. In ISMIR 2024.

arXiv PDF BibTeX Code Video Examples

(2024). Hookpad Aria: A Copilot for Songwriters. In ISMIR LBD 2024.

arXiv PDF BibTeX Project Page

(2024). Towards Music-Aware Virtual Assistants. In UIST 2024.

PDF BibTeX DOI Video

(2024). Do Music Generation Models Encode Music Theory?. In ISMIR 2024.

arXiv PDF BibTeX Sound 🔊

(2024). The Impact of Element Ordering on LM Agent Performance. In NeurIPS Workshops 2024.

arXiv PDF BibTeX

(2024). Adaptive Accompaniment with ReaLchords. In ICML 2024.

arXiv PDF BibTeX

(2024). V2Meow: Meowing to the Visual Beat via Video-to-Music Generation. In AAAI 2024.

arXiv PDF BibTeX Music Samples

(2023). Music ControlNet: Multiple Time-varying Controls for Music Generation. In TASLP 2024.

arXiv PDF BibTeX 🔊 Examples Video

(2023). Anticipatory Music Transformer. In TMLR 2024.

arXiv PDF BibTeX 🔊 Examples Code

(2023). SingSong: Generating Musical Accompaniments from Singing.

arXiv PDF BibTeX 🔊 Examples

(2022). Melody Transcription via Generative Pre-training. In ISMIR.

arXiv PDF BibTeX 🔊 Examples Code Dataset Video

(2022). It's Raw! Audio Generation with State-Space Models. In ICML (Long Talk; Top 2%).

arXiv PDF BibTeX 🔊 Examples Code

(2021). Towards Automatic Instrumentation by Learning to Separate Parts in Symbolic Multitrack Music. In ISMIR.

arXiv PDF BibTeX 🔊 Examples Code

(2021). Codified Audio Language Modeling Learns Useful Representations for Music Information Retrieval. In ISMIR (Best Paper Runner-up).

arXiv PDF BibTeX Code

(2021). Swords ⚔ī¸: A Benchmark for Lexical Substitution with Improved Data Coverage and Quality. In NAACL.

arXiv PDF BibTeX Code Dataset

(2020). Enabling Language Models to Fill in the Blanks. In ACL.

arXiv PDF BibTeX Code Demo

(2019). LakhNES: Improving Multi-instrumental Music Generation with Cross-domain Pre-training. In ISMIR.

arXiv PDF BibTeX 🔊 Examples Code

(2019). Expediting TTS Synthesis with Adversarial Vocoding. In INTERSPEECH (Oral).

arXiv PDF BibTeX 🔊 Examples Code

(2019). GANSynth: Adversarial Neural Audio Synthesis. In ICLR.

arXiv PDF BibTeX 🔊 Examples Code Blog

(2019). Adversarial Audio Synthesis. In ICLR.

arXiv PDF BibTeX 🔊 Examples Code 🕹ī¸ Demo Notebook

(2019). Piano Genie. In ACM IUI.

arXiv PDF BibTeX Code 🕹ī¸ Demo Video Blog

(2018). Semantically Decomposing the Latent Spaces of Generative Adversarial Networks. In ICLR.

arXiv PDF BibTeX Code 🕹ī¸ Demo

(2017). Dance Dance Convolution. In ICML.

arXiv PDF BibTeX Code Dataset 🕹ī¸ Demo